Leonard lhl

## normcore-llm.md

      
              1 file
            
          
              208 forks
            
          
              38 comments
            
          
              2716 stars
            
          
                veekaybee
                / normcore-llm.md
            
            
              Last active
              May 9, 2024 07:47
            
              
                Normcore LLM Reads
              
          
    Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.
Foundational Concepts


Pre-Transformer Models


## finetune_llama_v2.py
# coding=utf-8
# Copyright 2023 The HuggingFace Inc. team. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software

## merge_peft_adapters.py
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch

import os
import argparse

def get_args():
    parser = argparse.ArgumentParser()
    parser.add_argument("--base_model_name_or_path", type=str)

## macOS Internals.md

      
              1 file
            
          
              86 forks
            
          
              4 comments
            
          
              1592 stars
            
          
                kconner
                / macOS Internals.md
            
            
              Last active
              May 10, 2024 17:04
            
              
                macOS Internals
              
          
    macOS Internals

Understand your Mac and iPhone more deeply by tracing the evolution of Mac OS X from prelease to Swift. John Siracusa delivers the details.
Starting Points

How to use this gist

You've got two main options:

  
## blog.md

      
              4 files
            
          
              12 forks
            
          
              5 comments
            
          
              171 stars
            
          
                Hellisotherpeople
                / blog.md
            
            
              Last active
              May 4, 2024 01:57
            
              
                You probably don't know how to do Prompt Engineering, let me educate you. 
              
          
    You probably don't know how to do Prompt Engineering

(This post could also be titled "Features missing from most LLM front-ends that should exist")

Apologies for the snarky title, but there has been a huge amount of discussion around so called "Prompt Engineering" these past few months on all kinds of platforms. Much of it is coming from individuals who are peddling around an awful lot of "Prompting" and very little "Engineering".
Most of these discussions are little more than users finding that writing more creative and complicated prompts can help them solve a task that a more simple prompt was unable to help with. I claim this is not Prompt Engineering. This is not to say that crafting good prompts is not a difficult task, but it does not involve doing any kind of sophisticated modifications to general "template" of a prompt.
Others, who I think do deserve to call themselves "Prompt Engineers" (and an awful lot more than that), have been writing about and utilizing the rich new eco-system

  
## llama4openai-api.py
# a simple Flask API to emulate OpenAI's using llama models and/or transformers
# runs on 3080

import sys
import time
import torch
import json
from peft import PeftModel

from flask import Flask, make_response, request, abort

## chatgpt.md

      
              1 file
            
          
              38 forks
            
          
              4 comments
            
          
              337 stars
            
          
                veekaybee
                / chatgpt.md
            
            
              Last active
              April 12, 2024 20:16
            
              
                Everything I understand about chatgpt
              
          
    ChatGPT Resources

Context

ChatGPT appeared like an explosion on all my social media timelines in early December 2022. While I keep up with machine learning as an industry, I wasn't focused so much on this particular corner, and all the screenshots seemed like they came out of nowhere. What was this model? How did the chat prompting work? What was the context of OpenAI doing this work and collecting my prompts for training data?
I decided to do a quick investigation. Here's all the information I've found so far. I'm aggregating and synthesizing it as I go, so it's currently changing pretty frequently.
Model Architecture


## Stable_Diffusion.md

      
              1 file
            
          
              9 forks
            
          
              59 comments
            
          
              62 stars
            
          
                harishanand95
                / Stable_Diffusion.md
            
            
              Last active
              March 8, 2024 03:19
            
              
                Stable Diffusion on AMD GPUs on Windows using DirectML
              
          
    Stable Diffusion for AMD GPUs on Windows using DirectML

UPDATE: A faster (20x) approach for running Stable Diffusion using MLIR/Vulkan/IREE is available on Windows:
https://github.com/nod-ai/SHARK/blob/main/shark/examples/shark_inference/stable_diffusion/stable_diffusion_amd.md
Install 🤗 diffusers

conda create --name sd39 python=3.9 -y

  
## pmtable.py
import os
import struct
import sys

SMN_INDEX_REG = 0x60
SMN_DATA_REG = 0x64

SMN_MSG_REG = 0x3b10a20
SMN_RSP_REG = 0x3b10a80
SMN_ARG_REG = 0x3b10a88

## 55-bytes-of-css.md

      
              1 file
            
          
              114 forks
            
          
              31 comments
            
          
              2133 stars
            
          
                JoeyBurzynski
                / 55-bytes-of-css.md
            
            
              Last active
              May 8, 2024 21:42
            
              
                58 bytes of css to look great nearly everywhere
              
          
    58 bytes of CSS to look great nearly everywhere

When making this website, i wanted a simple, reasonable way to make it look good on most displays. Not counting any minimization techniques, the following 58 bytes worked well for me:
main {
  max-width: 38rem;
  padding: 2rem;
  margin: auto;
}
	# coding=utf-8
	# Copyright 2023 The HuggingFace Inc. team. All rights reserved.
	#
	# Licensed under the Apache License, Version 2.0 (the "License");
	# you may not use this file except in compliance with the License.
	# You may obtain a copy of the License at
	#
	# http://www.apache.org/licenses/LICENSE-2.0
	#
	# Unless required by applicable law or agreed to in writing, software
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel
	import torch

	import os
	import argparse

	def get_args():
	parser = argparse.ArgumentParser()
	parser.add_argument("--base_model_name_or_path", type=str)
	# a simple Flask API to emulate OpenAI's using llama models and/or transformers
	# runs on 3080

	import sys
	import time
	import torch
	import json
	from peft import PeftModel

	from flask import Flask, make_response, request, abort
	import os
	import struct
	import sys

	SMN_INDEX_REG = 0x60
	SMN_DATA_REG = 0x64

	SMN_MSG_REG = 0x3b10a20
	SMN_RSP_REG = 0x3b10a80
	SMN_ARG_REG = 0x3b10a88