Shagun Sodhani shagunsodhani

## Bag of Tricks for Efficient Text Classification.md

      
              1 file
            
          
              5 forks
            
          
              0 comments
            
          
              14 stars
            
          
                shagunsodhani
                / Bag of Tricks for Efficient Text Classification.md
            
            
              Created
              September 27, 2016 18:44
            
              
                Summary of "Bag of Tricks for Efficient Text Classification" paper
              
          
    Bag of Tricks for Efficient Text Classification

Introduction


Introduces fastText, a simple and highly efficient approach for text classification.
At par with deep learning models in terms of accuracy though an order of magnitude faster in performance.
Link to the paper
Link to code

Architecture


## GloVe.md

      
              1 file
            
          
              1 fork
            
          
              1 comment
            
          
              4 stars
            
          
                shagunsodhani
                / GloVe.md
            
            
              Created
              September 18, 2016 15:53
            
              
                Notes for GloVe paper
              
          
    GloVe: Global Vectors for Word Representation

Introduction


Introduces a new global log-bilinear regression model which combines the benefits of both global matrix factorization and local context window methods.

Global Matrix Factorization Methods


Decompose large matrices into low-rank approximations.


## SimRank.md

      
              1 file
            
          
              0 forks
            
          
              1 comment
            
          
              0 stars
            
          
                shagunsodhani
                / SimRank.md
            
            
              Created
              September 14, 2016 14:35
            
              
                Notes for "SimRank: A Measure of Structural-Context Similarity" paper
              
          
    SimRank: A Measure of Structural-Context Similarity

Introduction


Algorithm to derive similarity between 2 nodes of a graph (or graphical model derived from any other kind of dataset).
Link to the paper

SimRank


Input: A directed graph G = (V, E) where V represents vertices and E represents edges.


## How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                shagunsodhani
                / How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation.md
            
            
              Created
              September 4, 2016 14:12
            
              
                Notes for paper titled "How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation"
              
          
    How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

Introduction


The paper explores the strengths and weaknesses of different evaluation metrics for end-to-end dialogue systems(in unsupervised setting).
Link to the paper

Evaluation Metrics Considered

Word Based Similarity Metric


## Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                shagunsodhani
                / Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge.md
            
            
              Last active
              August 28, 2016 04:20
            
              
                Notes for paper "Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge"
              
          
    Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge

Introduction


Task of translating natural language queries into regular expressions without using domain specific knowledge.
Proposes a methodology for collecting a large corpus of regular expressions to natural language pairs.
Reports performance gain of 19.6% over state-of-the-art models.
Link to the paper

Architecture


## WikiReading.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                shagunsodhani
                / WikiReading.md
            
            
              Created
              August 21, 2016 15:19
            
              
                Summary of "WikiReading : A Novel Large-scale Language Understanding Task over Wikipedia" paper
              
          
    WikiReading : A Novel Large-scale Language Understanding Task over Wikipedia

Introduction


Large scale natural language understanding task - predict text values given a knowledge base.
Accompanied by a large dataset generated using Wikipedia
Link to the paper

Dataset


## WikiQA: A challenge dataset for open-domain question answering.md

      
              1 file
            
          
              1 fork
            
          
              0 comments
            
          
              6 stars
            
          
                shagunsodhani
                / WikiQA: A challenge dataset for open-domain question answering.md
            
            
              Created
              August 15, 2016 12:31
            
              
                Notes for "WikiQA: A challenge dataset for open-domain question answering" paper
              
          
    WikiQA: A challenge dataset for open-domain question answering

Introduction


Presents WikiQA - a publicly available set of question and sentence pairs for open-domain question answering.
Link to the paper

Dataset


3047 questions sampled from Bing query logs.


## Teaching Machines to Read and Comprehend.md

      
              1 file
            
          
              0 forks
            
          
              1 comment
            
          
              1 star
            
          
                shagunsodhani
                / Teaching Machines to Read and Comprehend.md
            
            
              Created
              August 14, 2016 12:48
            
              
                Notes for Teaching Machines to Read and Comprehend paper
              
          
    Teaching Machines to Read and Comprehend

Introduction


Build a supervised reading comprehension data set using news corpus.
Compare the performance of neural models and state-of-the-art natural language processing model on reading comprehension task.
Link to the paper

Reading Comprehension


## Evaluating Prerequisite Qualities for Learning End-to-end Dialog Systems.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                shagunsodhani
                / Evaluating Prerequisite Qualities for Learning End-to-end Dialog Systems.md
            
            
              Created
              August 8, 2016 03:46
            
              
                Summary of "Evaluating Prerequisite Qualities for Learning End-to-end Dialog Systems" paper
              
          
    Evaluating Prerequisite Qualities for Learning End-to-end Dialog Systems

Introduction


The paper presents a suite of benchmark tasks to evaluate end-to-end dialogue systems such that performing well on the tasks is a necessary (but not sufficient) condition for a fully functional dialogue agent.
Link to the paper

Dataset


Created using large-scale real-world sources - OMDB (Open Movie Database), MovieLens and Reddit.


## RecurrentNeuralNetworkRegularization.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                shagunsodhani
                / RecurrentNeuralNetworkRegularization.md
            
            
              Created
              July 24, 2016 15:01
            
              
                Notes for 'Recurrent Neural Network Regularization' paper
              
          
    Recurrent Neural Network Regularization

Introduction


The paper explains how to apply dropout to LSTMs and how it could reduce overfitting in tasks like language modelling, speech recognition, image caption generation and machine translation.
Link to the paper

Dropout


Regularisation method that drops out (or temporarily removes) units in a neural network.