Skip to content

Instantly share code, notes, and snippets.

View andrewginns's full-sized avatar
💭
Coding

Andrew Ginns andrewginns

💭
Coding
View GitHub Profile
@andrewginns
andrewginns / llm_perf_comparison.md
Last active January 12, 2024 10:06
Comparison of LLM performance on varied hardware

Based on ggerganov/llama.cpp#4167

LLaMA 7B

PP means "prompt processing" (bs = 512), TG means "text-generation" (bs = 1), t/s means "tokens per second"

BW
[GB/s]
GPU
Cores
F16 PP
[t/s]
F16 TG
[t/s]
Q8_0 PP
[t/s]
Q8_0 TG
[t/s]
Q4_0 PP
[t/s]
Q4_0 TG
[t/s]
✅ M1 Pro 16GB 200 14 262.65 12.75 235.16 21.95 232.55 35.52
✅ [M3 Pro 36
@andrewginns
andrewginns / tweet_dumper.py
Last active February 16, 2017 12:17 — forked from yanofsky/LICENSE
A script to download all of a user's tweets into a csv
#!/usr/bin/env python
# encoding: utf-8
import tweepy #https://github.com/tweepy/tweepy
import csv
#Twitter API credentials
consumer_key = ""