Skip to content

Instantly share code, notes, and snippets.

View aireilly's full-sized avatar
:shipit:
Focusing

Aidan Reilly aireilly

:shipit:
Focusing
  • Red Hat
  • Cork, Ireland
  • 06:10 (UTC +01:00)
View GitHub Profile
@aireilly
aireilly / llm-compressor-0.7.0-release-notes.md
Last active August 19, 2025 11:55
llm-compressor v0.7.0 release notes

LLM Compressor v0.7.0 release notes

This LLM Compressor v0.7.0 release introduces major new features and enhancements:

  • Apply multiple compressors to a single model for mixed-precision quantization
  • Support for DeepSeekV3-style block FP8 quantization
  • Configurable Observer arguments in config.yaml
  • Expanded Mixture of Experts (MoE) calibration support, including NVFP4
  • Llama4 quantization support with vLLM compatibility
  • Simplified and unified Recipe classes for easier usage and debugging
@aireilly
aireilly / llm-compressor-toc.md
Created June 25, 2025 09:21
Proposed plan for building out LLM Compressor docs

LMM Compressor docs proposal

  • About large language model optimization
    • Supported model types for optimization
  • Deciding on a compression strategy
    • Supported compression algorithms and formats
    • Advanced compression techniques and features
  • Evaluating model quality
@aireilly
aireilly / ccs-configuring-vscode-for-vibecoding.adoc
Last active April 17, 2025 08:14
Vibecoding JIRA, GitHub, and local file systems with VS Code and MCP
@aireilly
aireilly / pr.md
Created March 11, 2022 13:38 — forked from piscisaureus/pr.md
Checkout github pull requests locally

Locate the section for your github remote in the .git/config file. It looks like this:

[remote "origin"]
	fetch = +refs/heads/*:refs/remotes/origin/*
	url = git@github.com:joyent/node.git

Now add the line fetch = +refs/pull/*/head:refs/remotes/origin/pr/* to this section. Obviously, change the github url to match your project's URL. It ends up looking like this: