Skip to content

Instantly share code, notes, and snippets.

View joanvelja's full-sized avatar

Joan joanvelja

View GitHub Profile
@joanvelja
joanvelja / repro.py
Created May 14, 2026 21:50
triton contiguous bug repro
"""Minimal vLLM repro: Triton top-k/top-p assumes contiguous logits rows.
Run on a CUDA machine with vLLM installed:
python repro_vllm_noncontiguous_topk_topp.py
Expected on affected versions:
FAIL: non-contiguous Triton output differs from contiguous Triton.