George Liu georgeliu1998

## rl-for-llms.md

      
              1 file
            
          
              22 forks
            
          
              11 comments
            
          
              531 stars
            
          
                yoavg
                / rl-for-llms.md
            
            
              Last active
              May 7, 2024 00:41
            
          
    Reinforcement Learning for Language Models

Yoav Goldberg, April 2023.
Why RL?

With the release of the ChatGPT model and followup large language models (LLMs), there was a lot of discussion of the importance of "RLHF training", that is, "reinforcement learning from human feedback".
I was puzzled for a while as to why RL (Reinforcement Learning) is better than learning from demonstrations (a.k.a supervised learning) for training language models. Shouldn't learning from demonstrations (or, in language model terminology "instruction fine tuning", learning to immitate human written answers) be sufficient? I came up with a theoretical argument that was somewhat convincing. But I came to realize there is an additional argumment which not only supports the case of RL training, but also requires it, in particular for models like ChatGPT. This additional argument is spelled out in (the first half of) a talk by John Schulman from OpenAI. This post pretty much

  
## LLM.md

      
              2 files
            
          
              156 forks
            
          
              13 comments
            
          
              1598 stars
            
          
                rain-1
                / LLM.md
            
            
              Last active
              May 11, 2024 17:17
            
              
                LLM Introduction: Learn Language Models
              
          
    Purpose

Bootstrap knowledge of LLMs ASAP. With a bias/focus to GPT.
Avoid being a link dump. Try to provide only valuable well tuned information.
Prelude

Neural network links before starting with transformers.

  
## machine_learning_design_patterns.md

      
              1 file
            
          
              2 forks
            
          
              0 comments
            
          
              54 stars
            
          
                veekaybee
                / machine_learning_design_patterns.md
            
            
              Last active
              May 8, 2024 19:32
            
          
    Machine Learning Design Patterns

This book is all about patterns for doing ML. It's broken up into several key parts, building and serving. Both of these are intertwined so it makes sense to read through the whole thing, there are very many good pieces of advice from seasoned professionals. The parts you can safely ignore relate to anything where they specifically use GCP. The other issue with the book it it's very heavily focused on deep learning cases. Not all modeling problems require these. Regardless, let's dive in. I've included the stuff that was relevant to me in the notes.
Most Interesting Bullets:


Machine learning models are not deterministic, so there are a number of ways we deal with them when building software, including setting random seeds in models during training and allowing for stateless functions, freezing layers, checkpointing, and generally making sure that flows are as reproducible as possib


## Troubleshoot-dcnn.md

      
              1 file
            
          
              16 forks
            
          
              0 comments
            
          
              71 stars
            
          
                zeyademam
                / Troubleshoot-dcnn.md
            
            
              Last active
              January 22, 2024 05:54
            
              
                Troubleshooting Convolutional Neural Nets
              
          
    Troubleshooting Convolutional Neural Networks

Intro

This is a list of hacks gathered primarily from prior experiences as well as online sources (most notably Stanford's CS231n course notes) on how to troubleshoot the performance of a convolutional neural network . We will focus mainly on supervised learning using deep neural networks. While this guide assumes the user is coding in Python3.6 using tensorflow (TF), it can still be helpful as a language agnostic guide.
Suppose we are given a convolutional neural network to train and evaluate and assume the evaluation results are worse than expected. The following are steps to troubleshoot and potentially improve performance. The first section corresponds to must-do's and generally good practices before you start troubleshooting. Every subsequent section header corresponds to a problem and the section is devoted to solving it. The sections are ordered to reflect "more common" issues first and under each header the "most-eas

  
## kaggle_download.py
# Info on how to get your api key (kaggle.json) here: https://github.com/Kaggle/kaggle-api#api-credentials
!pip install kaggle
api_token = {"username":"USERNAME","key":"API_KEY"}
import json
import zipfile
import os
with open('/content/.kaggle/kaggle.json', 'w') as file:
    json.dump(api_token, file)
!chmod 600 /content/.kaggle/kaggle.json
!kaggle config path -p /content

## 2017 Fantasy Football Projections.ipynb

      
              1 file
            
          
              16 forks
            
          
              5 comments
            
          
              65 stars
            
          
                seanjtaylor
                / 2017 Fantasy Football Projections.ipynb
            
            
              Created
              September 4, 2017 01:38
            
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## Pandas and Seaborn.ipynb

      
              1 file
            
          
              20 forks
            
          
              1 comment
            
          
              106 stars
            
          
                5agado
                / Pandas and Seaborn.ipynb
            
            
              Created
              February 20, 2017 13:33
            
              
                Data Manipulation and Visualization with Pandas and Seaborn — A Practical Introduction
              
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## how_to_use_numpy_correctly_for_neural_nets.ipynb

      
              1 file
            
          
              0 forks
            
          
              2 comments
            
          
              14 stars
            
          
                CreatCodeBuild
                / how_to_use_numpy_correctly_for_neural_nets.ipynb
            
            
              Created
              February 11, 2017 08:11
            
          
      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## Batch Normalization.md

      
              1 file
            
          
              11 forks
            
          
              10 comments
            
          
              84 stars
            
          
                shagunsodhani
                / Batch Normalization.md
            
            
              Last active
              July 25, 2023 18:07
            
              
                Notes for "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift" paper
              
          
    The Batch Normalization paper describes a method to address the various issues related to training of Deep Neural Networks. It makes normalization a part of the architecture itself and reports significant improvements in terms of the number of iterations required to train the network.
Issues With Training Deep Neural Networks

Internal Covariate shift

Covariate shift refers to the change in the input distribution to a learning system. In the case of deep networks, the input to each layer is affected by parameters in all the input layers. So even small changes to the network get amplified down the network. This leads to change in the input distribution to internal layers of the deep network and is known as internal covariate shift.
It is well established that networks converge faster if the inputs have been whitened (ie zero mean, unit variances) and are uncorrelated and internal covariate shift leads to just the opposite.

  
## license-badges.md

      
              1 file
            
          
              243 forks
            
          
              96 comments
            
          
              1511 stars
            
          
                lukas-h
                / license-badges.md
            
            
              Last active
              May 11, 2024 05:17
            
              
                Markdown License Badges for your Project
              
          
    Markdown License badges

Collection of License badges for your Project's README file.

This list includes the most common open source and open data licenses.

Easily copy and paste the code under the badges into your Markdown files.
Notes


The badges do not fully replace the license informations for your projects, they are only emblems for the README, that the user can see the License at first glance.

Translations: (No guarantee that the translations are up-to-date)

🇫🇷 Cette liste en français
	# Info on how to get your api key (kaggle.json) here: https://github.com/Kaggle/kaggle-api#api-credentials
	!pip install kaggle
	api_token = {"username":"USERNAME","key":"API_KEY"}
	import json
	import zipfile
	import os
	with open('/content/.kaggle/kaggle.json', 'w') as file:
	json.dump(api_token, file)
	!chmod 600 /content/.kaggle/kaggle.json
	!kaggle config path -p /content