dainis-boumber/NLP

## NLP
## NLP-papers-tools-discussion
Anything useful goes here

### Possible research directions and Fundamental discussions

[The 4 Biggest Open Problems in NLP Today](http://ruder.io/4-biggest-open-problems-in-nlp/)

[What innate priors should we build into the architecture of deep learning systems? A debate between Prof. Y. LeCun and Prof. C. Manning](https://www.abigailsee.com/2018/02/21/deep-learning-structure-and-innate-priors.html)

### New Libraries/SOTA we care about

[Flair - LM/Embedding/General NLP lib. Fastest growing NLP project on github](https://github.com/zalandoresearch/flair)

### My Bag of Trocks for Deep Learning performance - add yours too:

[Mini-batch data parallelism, sort of default in PyTorch](https://towardsdatascience.com/speed-up-your-algorithms-part-1-pytorch-56d8a4ae7051)

[Numba: compiled and highly optimized C/C++/Fortran code will be used instead of slow numpy (even cython is slower)](https://towardsdatascience.com/speed-up-your-algorithms-part-2-numba-293e554c5cc1)


Best of all you still code in python, just need a decorator on top of time-consuming function:

```
from numba import jit, int32

# we declare return value and types, turn off jit compiler
# and go directly for binary (making it harder to debug
# but SO much faster. Finally all vector ops will be
# distributed between cores if your CPU

@jit(int32(int32, int32), nopython=true, parallel=true)
def function(a, b):
    # your loop or numerically intensive computations
    return result

# in this function we are saying "you are no longer restricted
# to types we specify, just run it all in parallel, on one
# or more CPUs, using threads or processes or whatever!
# numba is smart enough to figure out the best way to do so

@vectorize
def function2(c):
    # your loop or numerically intensive computations
    return result
```

[DASK - parallelizing numpy, pandasm python, scikit-learn, literally everything...](https://towardsdatascience.com/speeding-up-your-algorithms-part-4-dask-7c6ed79994ef)

[Modin: An alternative to DASK but only for Pandas - much simpler and lighter if I/O is what you need. Will process 10 GB DataFrame in seconds.](https://github.com/modin-project/modin)
```
# replace the following line
#import pandas as pd
# with
import modin.pandas as pd
```

You are DONE! pandas is 10-30 times faster!!!!! but sometimes will crash :)

[ipyexperiments - will save you 20-30% video and 10-15% system memory](https://github.com/stas00/ipyexperiments)

[ipyexperiof usage examples in some kaggle contest code I wrote](https://github.com/arjunnlp/NLP-papers-tools-discussion/blob/master/preprocess-dainis.ipynb)

Make sure to either use IPyTorchExperiments all the time, or IPyCPUExperiments if don't care to use GPU. If you are using a GPU, you must be sure to use the IPyTorchExperiments and that the text after the cell tells you it is indeed using GPU backend.

### Visualization Software

General:

[TensorboardX for PyTorch](https://github.com/arjunnlp/tensorboardX)

[Visdom - similar to tensorboard](https://github.com/arjunnlp/visdom)

LSTM:

[LSTMVis: Visualizng LSTM](https://github.com/HendrikStrobelt/LSTMVis)

[Seq2Seq Vis: Visualization for Sequential Neural Networks with Attention](https://github.com/HendrikStrobelt/Seq2Seq-Vis)

### Paper and Technical Writing HOWTO

On writing research papers:

["How to Write an Introduction" by Dr. Om Gnawali](http://www2.cs.uh.edu/~gnawali/courses/cosc6321-s17/hw7.html)

Some of the best examples of technical writing (papers & blogs go hand in hand!):

[How to trick a neural network into thinking a panda is a vulture](https://codewords.recurse.com/issues/five/why-do-neural-networks-think-a-panda-is-a-vulture)

[Picking an optimizer for Style Transfer](https://blog.slavv.com/picking-an-optimizer-for-style-transfer-86e7b8cba84b)

[How do we 'train' Neural Networks?](https://towardsdatascience.com/how-do-we-train-neural-networks-edd985562b73)

### Must-read papers
[Deep Graph Methods Survey](https://arxiv.org/pdf/1901.00596.pdf)

[Disciplined Training of Neural Networks](https://arxiv.org/abs/1803.09820)

### Blogs You Should Follow

[Stanford AI Salon: bi-weekly discussion on important topic in AI/NLP/ML, closed from the public due to space restrictions, but notes and videos are now posted in a blog. Previous guests include LeCun, Hinton, Ng., and others](http://ai.stanford.edu/blog/)

[Anrej Karpathy](https://medium.com/@karpathy)

[Vitaliy Bushaev](https://towardsdatascience.com/@bushaev)

[Sylvain Gugger](https://sgugger.github.io/)

[Sebastian Ruder](http://ruder.io/)

[Ilya Sutskever](https://blog.openai.com/tag/ilya-sutskever/) - Sutskever published his Transformer in this blog, not even on arxiv

[Jeremy Howard](https://twitter.com/jeremyphoward)
	## NLP-papers-tools-discussion
	Anything useful goes here

	### Possible research directions and Fundamental discussions

	[The 4 Biggest Open Problems in NLP Today](http://ruder.io/4-biggest-open-problems-in-nlp/)

	[What innate priors should we build into the architecture of deep learning systems? A debate between Prof. Y. LeCun and Prof. C. Manning](https://www.abigailsee.com/2018/02/21/deep-learning-structure-and-innate-priors.html)

	### New Libraries/SOTA we care about

	[Flair - LM/Embedding/General NLP lib. Fastest growing NLP project on github](https://github.com/zalandoresearch/flair)

	### My Bag of Trocks for Deep Learning performance - add yours too:

	[Mini-batch data parallelism, sort of default in PyTorch](https://towardsdatascience.com/speed-up-your-algorithms-part-1-pytorch-56d8a4ae7051)

	[Numba: compiled and highly optimized C/C++/Fortran code will be used instead of slow numpy (even cython is slower)](https://towardsdatascience.com/speed-up-your-algorithms-part-2-numba-293e554c5cc1)


	Best of all you still code in python, just need a decorator on top of time-consuming function:

	```
	from numba import jit, int32

	# we declare return value and types, turn off jit compiler
	# and go directly for binary (making it harder to debug
	# but SO much faster. Finally all vector ops will be
	# distributed between cores if your CPU

	@jit(int32(int32, int32), nopython=true, parallel=true)
	def function(a, b):
	# your loop or numerically intensive computations
	return result

	# in this function we are saying "you are no longer restricted
	# to types we specify, just run it all in parallel, on one
	# or more CPUs, using threads or processes or whatever!
	# numba is smart enough to figure out the best way to do so

	@vectorize
	def function2(c):
	# your loop or numerically intensive computations
	return result
	```

	[DASK - parallelizing numpy, pandasm python, scikit-learn, literally everything...](https://towardsdatascience.com/speeding-up-your-algorithms-part-4-dask-7c6ed79994ef)

	[Modin: An alternative to DASK but only for Pandas - much simpler and lighter if I/O is what you need. Will process 10 GB DataFrame in seconds.](https://github.com/modin-project/modin)
	```
	# replace the following line
	#import pandas as pd
	# with
	import modin.pandas as pd
	```

	You are DONE! pandas is 10-30 times faster!!!!! but sometimes will crash :)

	[ipyexperiments - will save you 20-30% video and 10-15% system memory](https://github.com/stas00/ipyexperiments)

	[ipyexperiof usage examples in some kaggle contest code I wrote](https://github.com/arjunnlp/NLP-papers-tools-discussion/blob/master/preprocess-dainis.ipynb)

	Make sure to either use IPyTorchExperiments all the time, or IPyCPUExperiments if don't care to use GPU. If you are using a GPU, you must be sure to use the IPyTorchExperiments and that the text after the cell tells you it is indeed using GPU backend.

	### Visualization Software

	General:

	[TensorboardX for PyTorch](https://github.com/arjunnlp/tensorboardX)

	[Visdom - similar to tensorboard](https://github.com/arjunnlp/visdom)

	LSTM:

	[LSTMVis: Visualizng LSTM](https://github.com/HendrikStrobelt/LSTMVis)

	[Seq2Seq Vis: Visualization for Sequential Neural Networks with Attention](https://github.com/HendrikStrobelt/Seq2Seq-Vis)

	### Paper and Technical Writing HOWTO

	On writing research papers:

	["How to Write an Introduction" by Dr. Om Gnawali](http://www2.cs.uh.edu/~gnawali/courses/cosc6321-s17/hw7.html)

	Some of the best examples of technical writing (papers & blogs go hand in hand!):

	[How to trick a neural network into thinking a panda is a vulture](https://codewords.recurse.com/issues/five/why-do-neural-networks-think-a-panda-is-a-vulture)

	[Picking an optimizer for Style Transfer](https://blog.slavv.com/picking-an-optimizer-for-style-transfer-86e7b8cba84b)

	[How do we 'train' Neural Networks?](https://towardsdatascience.com/how-do-we-train-neural-networks-edd985562b73)

	### Must-read papers
	[Deep Graph Methods Survey](https://arxiv.org/pdf/1901.00596.pdf)

	[Disciplined Training of Neural Networks](https://arxiv.org/abs/1803.09820)

	### Blogs You Should Follow

	[Stanford AI Salon: bi-weekly discussion on important topic in AI/NLP/ML, closed from the public due to space restrictions, but notes and videos are now posted in a blog. Previous guests include LeCun, Hinton, Ng., and others](http://ai.stanford.edu/blog/)

	[Anrej Karpathy](https://medium.com/@karpathy)

	[Vitaliy Bushaev](https://towardsdatascience.com/@bushaev)

	[Sylvain Gugger](https://sgugger.github.io/)

	[Sebastian Ruder](http://ruder.io/)

	[Ilya Sutskever](https://blog.openai.com/tag/ilya-sutskever/) - Sutskever published his Transformer in this blog, not even on arxiv

	[Jeremy Howard](https://twitter.com/jeremyphoward)