Skip to content

Instantly share code, notes, and snippets.

View yunweizhao26's full-sized avatar
๐Ÿ‡

Jacob Zhao yunweizhao26

๐Ÿ‡
View GitHub Profile
@willccbb
willccbb / grpo_demo.py
Last active November 6, 2025 19:57
GRPO Llama-1B
# train_grpo.py
#
# See https://github.com/willccbb/verifiers for ongoing developments
#
"""
citation:
@misc{brown2025grpodemo,
title={Granular Format Rewards for Eliciting Mathematical Reasoning Capabilities in Small Language Models},
author={Brown, William},
@jerome9189
jerome9189 / computing_infrastructure.md
Last active May 8, 2022 21:52 — forked from ZhaofengWu/computing_infrastructure.md
CSE 447/517 Winter 2022 Computing Infrastructure

CSE 447/517 Winter 2022 Computing Infrastructure

Welcome to CSE 447/517 Natural Language Processing! This document gives an overview of the computing resources available to you for this course and some development recommendations. If you have more questions, feel free to ask them on Ed!

Resources

We have a variety of resources for this course, including GPUs. Of course, you can always your own laptop/machine, but it is not recommended since (a) many things that we do will be quite expensive; (b) some libraries may not have good non-linux support; and (c) you probably do not want large datasets/models/etc. to occupy all your disk space. See the next section for tips that will make remote development smooth.

CSE Machines