Piotr Mlocek pimlock

## LLM.md

      
              2 files
            
          
              161 forks
            
          
              13 comments
            
          
              1614 stars
            
          
                rain-1
                / LLM.md
            
            
              Last active
              July 11, 2024 18:17
            
              
                LLM Introduction: Learn Language Models
              
          
    Purpose

Bootstrap knowledge of LLMs ASAP. With a bias/focus to GPT.
Avoid being a link dump. Try to provide only valuable well tuned information.
Prelude

Neural network links before starting with transformers.

  
## gist:8172796

      
              1 file
            
          
              404 forks
            
          
              23 comments
            
          
              1645 stars
            
          
                debasishg
                / gist:8172796
            
            
              Last active
              July 5, 2024 11:53
            
              
                A collection of links for streaming algorithms and data structures
              
          
    General Background and Overview


Probabilistic Data Structures for Web Analytics and Data Mining : A great overview of the space of probabilistic data structures and how they are used in approximation algorithm implementation.
Models and Issues in Data Stream Systems
Philippe Flajolet’s contribution to streaming algorithms : A presentation by Jérémie Lumbroso that visits some of the hostorical perspectives and how it all began with Flajolet
Approximate Frequency Counts over Data Streams by Gurmeet Singh Manku & Rajeev Motwani : One of the early papers on the subject.
[Methods for Finding Frequent Items in Data Streams](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.187.9800&amp;rep=rep1&amp;t


## latency.txt
L1 cache reference                          0.5 ns
Branch mispredict                             5 ns
L2 cache reference                            7 ns             14x L1 cache
Mutex lock/unlock                            25 ns
Main memory reference                       100 ns             20x L2 cache, 200x L1 cache
Compress 1K bytes with Zippy              3,000 ns
Send 1K bytes over 1 Gbps network        10,000 ns    0.01 ms
Read 1 MB sequentially from memory      250,000 ns    0.25 ms
Round trip within same datacenter       500,000 ns    0.5  ms
Read 1 MB sequentially from SSD       1,000,000 ns    1    ms  4X memory
	L1 cache reference 0.5 ns
	Branch mispredict 5 ns
	L2 cache reference 7 ns 14x L1 cache
	Mutex lock/unlock 25 ns
	Main memory reference 100 ns 20x L2 cache, 200x L1 cache
	Compress 1K bytes with Zippy 3,000 ns
	Send 1K bytes over 1 Gbps network 10,000 ns 0.01 ms
	Read 1 MB sequentially from memory 250,000 ns 0.25 ms
	Round trip within same datacenter 500,000 ns 0.5 ms
	Read 1 MB sequentially from SSD 1,000,000 ns 1 ms 4X memory