Ziyun Li ziyunli

## normcore-llm.md

      
              1 file
            
          
              218 forks
            
          
              38 comments
            
          
              2780 stars
            
          
                veekaybee
                / normcore-llm.md
            
            
              Last active
              July 26, 2024 01:10
            
              
                Normcore LLM Reads
              
          
    Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.
Foundational Concepts


Pre-Transformer Models


## home_nss_home.md

      
              1 file
            
          
              8 forks
            
          
              15 comments
            
          
              187 stars
            
          
                sharadhr
                / home_nss_home.md
            
            
              Last active
              July 6, 2024 14:04
            
              
                Home, Not So Sweet Home
              
          
    $HOME, Not So Sweet $HOME


Preface
1 What is $HOME to you?
2 Setup

2.1 Linux

2.1.1 $HOME
2.1.2 Dot-files and dot-directories
2.1.3 XDG Base Directories


2.1.4 xdg-user-dirs


## llama2-mac-gpu.sh
# Clone llama.cpp
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp

# Build it
make clean
LLAMA_METAL=1 make

# Download model
export MODEL=llama-2-13b-chat.ggmlv3.q4_0.bin

## function-calling.ipynb

      
              1 file
            
          
              11 forks
            
          
              3 comments
            
          
              80 stars
            
          
                kylemcdonald
                / function-calling.ipynb
            
            
              Created
              June 14, 2023 01:10
            
              
                Example of OpenAI function calling API to extract data from LAPD newsroom articles.
              
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## macOS Internals.md

      
              1 file
            
          
              87 forks
            
          
              4 comments
            
          
              1605 stars
            
          
                kconner
                / macOS Internals.md
            
            
              Last active
              July 7, 2024 19:42
            
              
                macOS Internals
              
          
    macOS Internals

Understand your Mac and iPhone more deeply by tracing the evolution of Mac OS X from prelease to Swift. John Siracusa delivers the details.
Starting Points

How to use this gist

You've got two main options:

  
## LLM.md

      
              2 files
            
          
              161 forks
            
          
              13 comments
            
          
              1613 stars
            
          
                rain-1
                / LLM.md
            
            
              Last active
              July 27, 2024 04:02
            
              
                LLM Introduction: Learn Language Models
              
          
    Purpose

Bootstrap knowledge of LLMs ASAP. With a bias/focus to GPT.
Avoid being a link dump. Try to provide only valuable well tuned information.
Prelude

Neural network links before starting with transformers.

  
## Dockerfile
FROM hayd/alpine-deno:1.10.1
WORKDIR /src/app

ADD deps.ts ./
RUN ["deno", "cache", "deps.ts"]

ADD *.ts ./
RUN ["deno", "cache", "mod.ts"]

ENTRYPOINT ["deno", "run", "--unstable", "--allow-net", "--allow-hrtime", "--allow-env", "--cached-only", "--no-check", "mod.ts"]

## README.en.md

      
              3 files
            
          
              21 forks
            
          
              471 comments
            
          
              304 stars
            
          
                akihikodaki
                / README.en.md
            
            
              Last active
              July 25, 2024 12:07
            
              
                Linux Desktop on Apple Silicon in Practice
              
          
    Linux Desktop on Apple Silicon in Practice

I bought M1 MacBook Air. It is the fastest computer I have, and I have been a
GNOME/GNU/Linux user for long time. It is obvious conclusion that I need
practical Linux desktop environment on Apple
Silicon.
Fortunately, Linux already works on Apple Silicon/M1. But how practical is it?

Two native ports exist.


## main.go
package main

import (
	"context"
	"flag"
	"fmt"
	"log"
	"net/http"
	"os"
	"os/signal"

## index.md

      
              1 file
            
          
              13 forks
            
          
              19 comments
            
          
              135 stars
            
          
                bojand
                / index.md
            
            
              Last active
              July 15, 2024 02:51
            
              
                gRPC and Load Balancing
              
          
    Just documenting docs, articles, and discussion related to gRPC and load balancing.
https://github.com/grpc/grpc/blob/master/doc/load-balancing.md
Seems gRPC prefers thin client-side load balancing where a client gets a list of connected clients and a load balancing policy from a "load balancer" and then performs client-side load balancing based on the information. However, this could be useful for traditional load banaling approaches in clound deployments.
https://groups.google.com/forum/#!topic/grpc-io/8s7UHY_Q1po

gRPC "works" in AWS. That is, you can run gRPC services on EC2 nodes and have them connect to other nodes, and everything is fine. If you are using AWS for easy access to hardware then all is fine.
What doesn't work is ELB (aka CLB), and ALBs. Neither of these support HTTP/2 (h2c) in a way that gRPC needs.
	# Clone llama.cpp
	git clone https://github.com/ggerganov/llama.cpp.git
	cd llama.cpp

	# Build it
	make clean
	LLAMA_METAL=1 make

	# Download model
	export MODEL=llama-2-13b-chat.ggmlv3.q4_0.bin
	FROM hayd/alpine-deno:1.10.1
	WORKDIR /src/app

	ADD deps.ts ./
	RUN ["deno", "cache", "deps.ts"]

	ADD *.ts ./
	RUN ["deno", "cache", "mod.ts"]

	ENTRYPOINT ["deno", "run", "--unstable", "--allow-net", "--allow-hrtime", "--allow-env", "--cached-only", "--no-check", "mod.ts"]
	package main

	import (
	"context"
	"flag"
	"fmt"
	"log"
	"net/http"
	"os"
	"os/signal"