Sanjay Kumar PhD skaiphd

## RLHF.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                skaiphd
                / RLHF.md
            
            
              Created
              February 20, 2024 04:20
                — forked from JoaoLages/RLHF.md
            
              
                Reinforcement Learning from Human Feedback (RLHF) - a simplified explanation 
              
          
    Maybe you've heard about this technique but you haven't completely understood it, especially the PPO part. This explanation might help.
We will focus on text-to-text language models 📝, such as GPT-3, BLOOM, and T5. Models like BERT, which are encoder-only, are not addressed.
Reinforcement Learning from Human Feedback (RLHF) has been successfully applied in ChatGPT, hence its major increase in popularity. 📈
RLHF is especially useful in two scenarios 🌟:

You can’t create a good loss function

Example: how do you calculate a metric to measure if the model’s output was funny?


You want to train with production data, but you can’t easily label your production data


## The Best Medium-Hard Data Analyst SQL Interview Questions
# The Best Medium-Hard Data Analyst SQL Interview Questions

By Zachary Thomas ([zthomas.nc@gmail.com](mailto:zthomas.nc@gmail.com), [Twitter](https://twitter.com/zach_i_thomas), [LinkedIn](https://www.linkedin.com/in/thomaszi/))

**Tip: **See the Table of Contents (document outline) by hovering over the vertical line on the right side of the page

## Background & Motivation

> The first 70% of SQL is pretty straightforward but the remaining 30% can be pretty tricky.

## understanding-word-vectors.ipynb

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                skaiphd
                / understanding-word-vectors.ipynb
            
            
              Created
              January 15, 2022 16:40
                — forked from aparrish/understanding-word-vectors.ipynb
            
              
                Understanding word vectors: A tutorial for "Reading and Writing Electronic Text," a class I teach at ITP. (Python 2.7) Code examples released under CC0 https://creativecommons.org/choose/zero/, other text released under CC BY 4.0 https://creativecommons.org/licenses/by/4.0/
              
          
      Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## bettersplit.py
 def std_agg(cnt, s1, s2): return math.sqrt((s2/cnt) - (s1/cnt)**2)

 def find_better_split(self, var_idx):
        x, y = self.x.values[self.idxs,var_idx], self.y[self.idxs]
        sort_idx = np.argsort(x)
        sort_y,sort_x = y[sort_idx], x[sort_idx]
        rhs_cnt,rhs_sum,rhs_sum2 = self.n, sort_y.sum(), (sort_y**2).sum()
        lhs_cnt,lhs_sum,lhs_sum2 = 0,0.,0.

        for i in range(0,self.n-self.min_leaf-1):

## dt_L1.py
class DecisionTree():
    def __init__(self, x, y, n_features, f_idxs,idxs,depth=10, min_leaf=5):
        self.x, self.y, self.idxs, self.min_leaf, self.f_idxs = x, y, idxs, min_leaf, f_idxs
        self.depth = depth
        self.n_features = n_features
        self.n, self.c = len(idxs), x.shape[1]
        self.val = np.mean(y[idxs])
        self.score = float('inf')
        self.find_varsplit()


## RandomForestClass.py
class RandomForest():
    def __init__(self, x, y, n_trees, n_features, sample_sz, depth=10, min_leaf=5):
        np.random.seed(12)
        if n_features == 'sqrt':
            self.n_features = int(np.sqrt(x.shape[1]))
        elif n_features == 'log2':
            self.n_features = int(np.log2(x.shape[1]))
        else:
            self.n_features = n_features
        print(self.n_features, "sha: ",x.shape[1])
	# The Best Medium-Hard Data Analyst SQL Interview Questions

	By Zachary Thomas ([zthomas.nc@gmail.com](mailto:zthomas.nc@gmail.com), [Twitter](https://twitter.com/zach_i_thomas), [LinkedIn](https://www.linkedin.com/in/thomaszi/))

	Tip: See the Table of Contents (document outline) by hovering over the vertical line on the right side of the page

	## Background & Motivation

	> The first 70% of SQL is pretty straightforward but the remaining 30% can be pretty tricky.
	def std_agg(cnt, s1, s2): return math.sqrt((s2/cnt) - (s1/cnt)**2)

	def find_better_split(self, var_idx):
	x, y = self.x.values[self.idxs,var_idx], self.y[self.idxs]
	sort_idx = np.argsort(x)
	sort_y,sort_x = y[sort_idx], x[sort_idx]
	rhs_cnt,rhs_sum,rhs_sum2 = self.n, sort_y.sum(), (sort_y**2).sum()
	lhs_cnt,lhs_sum,lhs_sum2 = 0,0.,0.

	for i in range(0,self.n-self.min_leaf-1):
	class DecisionTree():
	def __init__(self, x, y, n_features, f_idxs,idxs,depth=10, min_leaf=5):
	self.x, self.y, self.idxs, self.min_leaf, self.f_idxs = x, y, idxs, min_leaf, f_idxs
	self.depth = depth
	self.n_features = n_features
	self.n, self.c = len(idxs), x.shape[1]
	self.val = np.mean(y[idxs])
	self.score = float('inf')
	self.find_varsplit()
	class RandomForest():
	def __init__(self, x, y, n_trees, n_features, sample_sz, depth=10, min_leaf=5):
	np.random.seed(12)
	if n_features == 'sqrt':
	self.n_features = int(np.sqrt(x.shape[1]))
	elif n_features == 'log2':
	self.n_features = int(np.log2(x.shape[1]))
	else:
	self.n_features = n_features
	print(self.n_features, "sha: ",x.shape[1])