Alex bzz

## ollama_dspy.py
# install DSPy: pip install dspy
import dspy

# Ollam is now compatible with OpenAI APIs
#
# To get this to work you must include `model_type='chat'` in the `dspy.OpenAI` call.
# If you do not include this you will get an error.
#
# I have also found that `stop='\n\n'` is required to get the model to stop generating text after the ansewr is complete.
# At least with mistral.

## llama-home.md

      
              1 file
            
          
              34 forks
            
          
              20 comments
            
          
              446 stars
            
          
                rain-1
                / llama-home.md
            
            
              Last active
              June 19, 2024 03:05
            
              
                How to run Llama 13B with a 6GB graphics card
              
          
    This worked on 14/May/23. The instructions will probably require updating in the future.

llama is a text prediction model similar to GPT-2, and the version of GPT-3 that has not been fine tuned yet.
It is also possible to run fine tuned versions (like alpaca or vicuna with this. I think. Those versions are more focused on answering questions)

Note: I have been told that this does not support multiple GPUs. It can only use a single GPU.
It is possible to run LLama 13B with a 6GB graphics card now! (e.g. a RTX 2060). Thanks to the amazing work involved in llama.cpp. The latest change is CUDA/cuBLAS which allows you pick an arbitrary number of the transformer layers to be run on the GPU. This is perfect for low VRAM.

Clone llama.cpp from git, I am on commit 08737ef720f0510c7ec2aa84d7f70c691073c35d.


## CodeGen_GPTJ_Conversion.md

      
              1 file
            
          
              2 forks
            
          
              1 comment
            
          
              56 stars
            
          
                moyix
                / CodeGen_GPTJ_Conversion.md
            
            
              Last active
              January 5, 2024 12:50
            
              
                How to convert the SalesForce CodeGen models to GPT-J
              
          
    Using Linear Algebra to Convert a Large Code Model

Background

The SalesForce CodeGen models are a family of large language models trained on a large amount of natural language data and then fine-tuned on specialized datasets of code. Models of size 350M, 2B, 6B, and 16B parameters are provided in three flavors:

nl, the base model trained on The Pile, a large natural language dataset compiled by EleutherAI
multi, which is fine-tuned from the nl model on a dataset of code in multiple languages, scraped from GitHub, and
mono, which is fine-tuned from the multi model on Python code only.


## copilot-risk-assessment.md

      
              1 file
            
          
              9 forks
            
          
              38 comments
            
          
              217 stars
            
          
                0xabad1dea
                / copilot-risk-assessment.md
            
            
              Last active
              September 11, 2023 10:21
            
              
                Risk Assessment of GitHub Copilot
              
          
    Risk Assessment of GitHub Copilot

0xabad1dea, July 2021

this is a rough draft and may be updated with more examples
GitHub was kind enough to grant me swift access to the Copilot test phase despite me @'ing them several hundred times about
ICE. I would like to examine it not in terms of productivity, but security. How risky is it to allow an AI to write some or all of your code?
Ultimately, a human being must take responsibility for every line of code that is committed. AI should not be used for
"responsibility washing." However, Copilot is a tool, and workers need their tools to be reliable. A carpenter doesn't have to

  
## smallberta_pretraining.ipynb

      
              1 file
            
          
              24 forks
            
          
              22 comments
            
          
              79 stars
            
          
                aditya-malte
                / smallberta_pretraining.ipynb
            
            
              Created
              February 22, 2020 13:41
            
              
                smallBERTa_Pretraining.ipynb
              
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## dependency-hacking.md

      
              1 file
            
          
              0 forks
            
          
              2 comments
            
          
              1 star
            
          
                creachadair
                / dependency-hacking.md
            
            
              Last active
              July 5, 2019 14:35
            
              
                Accurate reverse package dependencies for Go
              
          
    Accurate reverse package dependencies for Go

Idea: Read each repository on GitHub (et al.) with Go code. Maybe limit this to repositories with a go.mod file, maybe not. You can't get this from the godoc.org API because imports are only updated when you visit the importer, and if nobody does that the imports don't change (you can verify this by checking cases you know of manually and reloading to watch the counter go up).
Use go list ./... to list all the import paths of all the packages, and find the import paths of all packages depended upon by each one.
Build a matrix of: depends-on(x ipath) : [ipath]
Include version numbers maybe, if they're available (e.g., from a go.mod file).

  
## macos-ramdisk.md

      
              1 file
            
          
              24 forks
            
          
              6 comments
            
          
              140 stars
            
          
                htr3n
                / macos-ramdisk.md
            
            
              Last active
              July 18, 2024 08:22
            
              
                Creating RAM disk in macOS
              
          
    Built-in

diskutil erasevolume HFS+ 'RAM Disk' `hdiutil attach -nobrowse -nomount ram://XXXXX`
where XXXXX is the size of the RAM disk in terms of memory blocks.

Notes:


## pad_packed_demo.py
import torch
from torch import LongTensor
from torch.nn import Embedding, LSTM
from torch.autograd import Variable
from torch.nn.utils.rnn import pack_padded_sequence, pad_packed_sequence

## We want to run LSTM on a batch of 3 character sequences ['long_str', 'tiny', 'medium']
#
#     Step 1: Construct Vocabulary
#     Step 2: Load indexed data (list of instances, where each instance is list of character indices)

## Pytorch_LSTM_variable_mini_batches.py
import torch
import torch.nn as nn
from torch.autograd import Variable
from torch.nn import functional as F

"""
Blog post:
Taming LSTMs: Variable-sized mini-batches and why PyTorch is good for your health:
https://medium.com/@_willfalcon/taming-lstms-variable-sized-mini-batches-and-why-pytorch-is-good-for-your-health-61d35642972e
"""

## gitbase_reference.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                smola
                / gitbase_reference.ipynb
            
            
              Last active
              May 29, 2018 15:16
            
              
                Gitbase reference (DRAFT)
              
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
	# install DSPy: pip install dspy
	import dspy

	# Ollam is now compatible with OpenAI APIs
	#
	# To get this to work you must include `model_type='chat'` in the `dspy.OpenAI` call.
	# If you do not include this you will get an error.
	#
	# I have also found that `stop='\n\n'` is required to get the model to stop generating text after the ansewr is complete.
	# At least with mistral.
	import torch
	from torch import LongTensor
	from torch.nn import Embedding, LSTM
	from torch.autograd import Variable
	from torch.nn.utils.rnn import pack_padded_sequence, pad_packed_sequence

	## We want to run LSTM on a batch of 3 character sequences ['long_str', 'tiny', 'medium']
	#
	# Step 1: Construct Vocabulary
	# Step 2: Load indexed data (list of instances, where each instance is list of character indices)
	import torch
	import torch.nn as nn
	from torch.autograd import Variable
	from torch.nn import functional as F

	"""
	Blog post:
	Taming LSTMs: Variable-sized mini-batches and why PyTorch is good for your health:
	https://medium.com/@_willfalcon/taming-lstms-variable-sized-mini-batches-and-why-pytorch-is-good-for-your-health-61d35642972e
	"""