Skip to content

Instantly share code, notes, and snippets.

@valiantone
Last active May 8, 2024 17:49
Show Gist options
  • Save valiantone/75cf29edfb13aedb4d3c74a0dd22e25b to your computer and use it in GitHub Desktop.
Save valiantone/75cf29edfb13aedb4d3c74a0dd22e25b to your computer and use it in GitHub Desktop.
Let's make some Wavess

The Ultimate LLM Field Guide

Logging & Analytics

DuckDB

Application/Serving

Tools

Talks and Sessions

Hosting LLM Apps @ Scale

Bending LLMs to Your Will…(and your will is to create structured data) - Haystack (deepset)

Challenges in serving LLM Applications at enterprise scale - Titan ML

From Lab to Life: The Evolution of Jina Embeddings V2 - Jina AI

Isabelle Mohr | Slides

Prompt-Engineering for Open-Source LLMs

Technical Product Sales Call (wavess.io) with Astrik + Kirstin

Wavess

A play to explore funding opportunities with Astrik in the marketing co-pilot B2B service space.

Ahead AI specializes in Machine Learning & AI research and is read by tens of thousands of researchers and practitioners who want to stay ahead in the ever-evolving field.

Competitors

Blabigo

Reading

Practical Data Privacy

Copy Writer AI

A product to explore re-writing marketing copy based on user input (in order to drive higher engagement) or create full-text marketing copy based on a minimal prompt.

Papers

A Machine Learning Approach for Automated Filling of Categorical Fields in Data Entry Forms

LMCanvas: Object-Oriented Interaction to Personalize Large Language Model-Powered Writing Environments

Leveraging Pre-trained Checkpoints for Sequence Generation Tasks

BERTScore: Evaluating Text Generation with BERT

Distilling the Knowledge of BERT for Text Generation

Learning Neural Templates for Text Generation

Constitutional AI: Harmlessness from AI Feedback

xLSTM: Extended Long Short-Term Memory

Printed

Resources

Code

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Get started with open source LLMs on a GPU

An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.

AirLLM optimizes inference memory usage, allowing 70B large language models to run inference on a single 4GB GPU card.

Embeddings and Vectors

Lightweight embeddings

Vector Databases

App UI

Flask-based

Streamlit

Prompting

A guide to prompting Llama 2 Understanding prompts and how to make them work explained like I'm 5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment