Skip to content

Instantly share code, notes, and snippets.

View sammcj's full-sized avatar
🦃

Sam sammcj

🦃
View GitHub Profile
@Artefact2
Artefact2 / README.md
Last active April 25, 2024 08:07
GGUF quantizations overview

Which GGUF is right for me? (Opinionated)

Good question! I am collecting human data on how quantization affects outputs. See here for more information: ggerganov/llama.cpp#5962

In the meantime, use the largest that fully fits in your GPU. If you can comfortably fit Q4_K_S, try using a model with more parameters.

llama.cpp feature matrix

See the wiki upstream: https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix

@sammcj
sammcj / .textgen.env
Last active December 31, 2023 15:24
Willow Speech + Local LLM + HomeAssistant
# https://github.com/oobabooga/text-generation-webui/blob/main/README.md
# https://github.com/oobabooga/text-generation-webui/blob/main/docs/Spell-book.md
# by default the Dockerfile specifies these versions: 3.5;5.0;6.0;6.1;7.0;7.5;8.0;8.6+PTX
# https://developer.nvidia.com/cuda-gpus you can find the version for your card here
# Tesla P100 = sm_60, sm_61, sm_62 and compute_60, compute_61, compute_62
# TORCH_CUDA_ARCH_LIST=6.0,6.1,6.2,7.0,7.5,8.0,8.6+PTX
# RTX3090 = sm_86 and compute_86 (PTX)
#8.6+PTX
# Tesla P100 = sm_60, sm_61, sm_62 and compute_60, compute_61, compute_62
@ChristophCaina
ChristophCaina / s3-box3.yaml
Last active March 31, 2024 20:11
Code for the ESP32-S3-Box3
#------------------------------------------------------------------------------------#
# PIN Schematics #
# #
# GPIO-00 MCU-BOOT #
# GPIO-01 Speaker Mute-Status #
# GPIO-02 I2S MCLK #
# GPIO-03 Touch-Screen TT21100 Interrupt Pin #
# GPIO-04 ILI92xxx Display DC-Pin (SPI: CLK-Pin) #
# GPIO-05 ILI92xxx Display CS-Pin (SPI: MOSI-Pin) #
# GPIO-06 ILI92xxx Display SDA #
@veekaybee
veekaybee / normcore-llm.md
Last active April 23, 2024 16:03
Normcore LLM Reads

Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.

Foundational Concepts

Screenshot 2023-12-18 at 10 40 27 PM

Pre-Transformer Models

@allenporter
allenporter / notify_agent_agenda.yaml
Last active April 23, 2024 15:23
Home Assistant Blueprint: Conversation agent Agenda Notification
---
blueprint:
name: Conversation agent Agenda Notification
description:
Conversation agent generates a notification based on the upcoming calendar
agenda, location, and weather information.
domain: automation
input:
notify_time:
name: Notification time

[This portion of call begins at 25:47]

Me: I could make it really easy on you, if you think Apollo is costing you $20 million per year, cut me a check for $10 million and we can both skip off into the sunset. Six months of use. We're good. That's mostly a joke.

Reddit: Six months of use? What do you mean? I know you said that was mostly a joke, but I want to take everything you're saying seriously just to make sure I'm not - what are you referring to?

Me: Okay, if Apollo's opportunity cost currently is $20 million dollars. At the 7 billion requests and API volume. If that's your yearly opportunity cost for Apollo, cut that in half, say for 6 months. Bob's your uncle.

Reddit: You cut out right at the end. I'm not asking you to repeat yourself for a third time, but you legit cut out right at the end. "If your opportunity cost is $10 million" and then I lost you.

esphome:
name: "sensibo-sky"
esp8266:
board: esp01_1m
# Enable logging
logger:
# Enable Home Assistant API
@RickyCook
RickyCook / run_in_linux.sh
Created August 16, 2022 06:11
Wrap a command in a Docker container to create a simple "in Linux" utility
#!/usr/bin/env bash -e
### CONFIG
CONTAINER_NAME=something_unique_here
CONTAINER_IMAGE=python:3.9 # must have bash, or change the infinite sleep on :17
COMMAND_PREFIX=./pants
### STOP EDITING
@krushildhamelia
krushildhamelia / TopazLabs (Topaz Gigapixel AI, Topaz DeNoise AI, ...) AI Model Files List Of URLs.txt
Last active September 23, 2023 14:27
TopazLabs (Topaz Gigapixel AI, Topaz DeNoise AI, ...) AI Model Files List Of URLs. download and place under "%AppData%\Topaz Labs LLC\Topaz Gigapixel AI\tgrc\" OR "%PROGRAMDATA%\Topaz Labs LLC\Topaz Gigapixel AI\tgrc" to avoid downloading in program time. use aria2c to download the whole list with '--input-file=your_file.txt'.
http://models.topazlabs.com/v1/gnn-v1-fp16-256x256-1x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp16-256x256-2x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp16-256x256-4x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp16-384x384-1x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp16-384x384-2x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp16-384x384-4x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp32-256x256-1x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp32-256x256-2x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp32-256x256-4x-ov.tz
http://models.topazlabs.com/v1/gnn-v1-fp32-384x384-1x-ov.tz