Skip to content

Instantly share code, notes, and snippets.

@harveyslash
Last active May 11, 2018 02:40
Show Gist options
  • Save harveyslash/04ccfb0f704a257a488b32d4a3bea4b2 to your computer and use it in GitHub Desktop.
Save harveyslash/04ccfb0f704a257a488b32d4a3bea4b2 to your computer and use it in GitHub Desktop.

Impersonet- Developing a unique writing style by learning from multiple authors [NLP]

By reading vastly different styles of text (such as fiction and philosopy), it may be possible to come up with a writing style that takes inspiration from both, resulting in a completely unique writing style. This is useful for artificial data generation.

Implementation so far/ideas

  • Character level neural language model
  • regularized by GANs
  • leraning by just raw sampling yeilds poor results, so need custom training algorithm (I have a promising one, but not implemented and tested)

Task

  • implement and improve custom training loop

Improving state of the art in steganography [Computer Vision]

NIPS Paper was published that pushed the boundaries of steg using Deep learning autoencoders. I have personally implemented that paper https://github.com/harveyslash/Deep-Steganography.

Implementation so far/ideas

My custom implementation yields good results. The quality of images can be improved by

  1. using adversarial learning
  2. using a more sophisticated loss like SSIM

Task

  • implement and improve results from paper using 1. and 2.

Image Search using Natural Language Queries [NLP/Computer Vision]

Natural queries like 'a brown car parked by the park' should find the images that match the queries best.

Implementation so far/ideas

Given a query, the model was trained to Generate a representation of the image such that the MSE between generated representation and actual representation of image was low.

Basic implementation was done. The results were somewhat working, but dissatisfactory. The model gets extremely sensitive to certain key words while ignoring other words almost completely.

Task

The issue with this approach is that the generated representation is too global. It will always favour key words like 'car' or 'duck' because these words will coresspond to the highest amount of activations.

The task is to figure out ways to look at different parts of an image individually with respect to the query.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment