Skip to content

Instantly share code, notes, and snippets.

@veekaybee
Last active May 9, 2024 07:47
Show Gist options
  • Save veekaybee/be375ab33085102f9027853128dc5f0e to your computer and use it in GitHub Desktop.
Save veekaybee/be375ab33085102f9027853128dc5f0e to your computer and use it in GitHub Desktop.
Normcore LLM Reads

Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.

Foundational Concepts

Screenshot 2023-12-18 at 10 40 27 PM

Pre-Transformer Models

Screenshot 2023-12-18 at 8 25 42 PM

Building Blocks

Foundational Deep Learning Papers (in semi-chronological order)

The Transformer Architecture

Screenshot 2023-12-18 at 8 37 44 PM

Attention

GPT

Screenshot 2023-12-18 at 8 37 44 PM

Significant OSS Models

LLMs in 2023

Screenshot 2023-12-18 at 10 07 57 PM

Training Data

Pre-Training

RLHF and DPO

Screenshot 2023-12-18 at 10 07 57 PM

Fine-Tuning and Compression

Small and Local LLMs

Deployment and Production

LLM Inference and K-V Cache

Prompt Engineering and RAG

GPUs

Screenshot 2023-12-18 at 10 02 48 PM

Evaluation

Eval Frameworks

UX

What's Next?

Thanks to everyone who added suggestions on Twitter, Mastodon, and Bluesky.

@umair-nasir14
Copy link

@livc Are you guys hiring?

@umair-nasir14
Copy link

@Sharrp
Copy link

Sharrp commented Aug 29, 2023

I found "Five years of GPT progress" to be a useful overview of the influential papers on GPT.
https://finbarr.ca/five-years-of-gpt-progress/
May work as a high-level summary for "Foundational Papers" section.

p.s. Thank you for compiling the list!

@hkniberg
Copy link

hkniberg commented Aug 30, 2023

Hi! This is great! Would be even more useful to write the year/month of publication next to each item, to get a sense of which links are more up-to-date and which are more historical.

@will-thompson-k
Copy link

will-thompson-k commented Aug 30, 2023

I really like this list, sad I just discovered this 😎 .

I am not sure if this would complement your Background section, but I wrote this as a primer on LLMs last month: https://willthompson.name/what-we-know-about-llms-primer.

But I don't know, might not be very orthogonal to your other sources here 🤷 .

@AnnthomyGILLES
Copy link

An overview of vector database. The author highlight the differences between the various vector databases out there as visually as possible.

https://thedataquarry.com/posts/vector-db-1/

@davidzshi
Copy link

An overview of vector database. The author highlight the differences between the various vector databases out there as visually as possible.

https://thedataquarry.com/posts/vector-db-1/

This is really helpful, thank you!

@tekumara
Copy link

@lcrmorin
Copy link

I keep coming back to this list. However I feel like it miss a good discussion about current stuff not working. I keep failling to implement working stuff, despite lenghty theoretical works, and when I scratch the veneer I keep getting the same answer: "technology is not ready yet".

@lcrmorin
Copy link

lcrmorin commented Dec 29, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment