🌟 New features:
- Massive optimizations to LSI model training (@isamaru, #1620 & #1622)
- LSI model allows use of single precision (float32), to consume 40% less memory while being 40% faster.
- LSI model can now also accept CSC matrix as input, for further memory and speed boost.
- Overall, if your entire corpus fits in RAM: 3x faster LSI training (SVD) in 4x less memory!
# just an example; the corpus stream is up to you streaming_corpus = gensim.corpora.MmCorpus("my_tfidf_corpus.mm.gz")