Skip to content

Instantly share code, notes, and snippets.

@carlosf
carlosf / grpo_demo.py
Created February 1, 2025 02:22 — forked from willccbb/grpo_demo.py
GRPO Llama-1B
# train_grpo.py
import re
import torch
from datasets import load_dataset, Dataset
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer
# Load and prep dataset
@willccbb
willccbb / grpo_demo.py
Last active October 25, 2025 16:39
GRPO Llama-1B
# train_grpo.py
#
# See https://github.com/willccbb/verifiers for ongoing developments
#
"""
citation:
@misc{brown2025grpodemo,
title={Granular Format Rewards for Eliciting Mathematical Reasoning Capabilities in Small Language Models},
author={Brown, William},
//@version=4
'''
Quick explanation from "Market Cipher - what is just a closed source of this script with alerts. (DO NOT PAY 1.500$ JUST FOR ALERT FUNCTION!)"
Market Cipher B is an all-in-one oscillator allowing for more quality indications than ever before.
It combines five algorithms (some well-known, some custom) that have all been fine-tuned and smoothed for optimal analysis and trading results.
When all of the algorithms converge, Market Cipher B will project a “Green Dot” which will aid you in longing the dips in bull markets as well as temporarily exiting shorts in bear markets.
The Green Dot is often accompanied by extreme sellers’ momentum and will warn you of potential market bottoms, giving you strong hands even when the night is at its darkest.
Market Cipher B certainly excels on the small time frames, but is a particularly deadly tool for isolating large swings in the market.
@artemuzz
artemuzz / max-headroom-in-three-js.markdown
Created April 21, 2020 00:26
Max Headroom in Three.js

Max Headroom in Three.js

I maybe watched too many old reruns during covid-19. :)

A Pen by Monica on CodePen.

License.

@vasanthk
vasanthk / System Design.md
Last active October 26, 2025 06:15
System Design Cheatsheet

System Design Cheatsheet

Picking the right architecture = Picking the right battles + Managing trade-offs

Basic Steps

  1. Clarify and agree on the scope of the system
  • User cases (description of sequences of events that, taken together, lead to a system doing something useful)
    • Who is going to use it?
    • How are they going to use it?
@paragonie-scott
paragonie-scott / crypto-wrong-answers.md
Last active September 25, 2025 14:56
An Open Letter to Developers Everywhere (About Cryptography)
@LeCoupa
LeCoupa / bash-cheatsheet.sh
Last active October 23, 2025 20:35
Bash CheatSheet for UNIX Systems --> UPDATED VERSION --> https://github.com/LeCoupa/awesome-cheatsheets
#!/bin/bash
#####################################################
# Name: Bash CheatSheet for Mac OSX
#
# A little overlook of the Bash basics
#
# Usage:
#
# Author: J. Le Coupanec
# Date: 2014/11/04
@hellerbarde
hellerbarde / latency.markdown
Created May 31, 2012 13:16 — forked from jboner/latency.txt
Latency numbers every programmer should know

Latency numbers every programmer should know

L1 cache reference ......................... 0.5 ns
Branch mispredict ............................ 5 ns
L2 cache reference ........................... 7 ns
Mutex lock/unlock ........................... 25 ns
Main memory reference ...................... 100 ns             
Compress 1K bytes with Zippy ............. 3,000 ns  =   3 µs
Send 2K bytes over 1 Gbps network ....... 20,000 ns  =  20 µs
SSD random read ........................ 150,000 ns  = 150 µs

Read 1 MB sequentially from memory ..... 250,000 ns = 250 µs