The model you know, on a stack you can verify from the first byte.
This demo runs real, unchanged GPT-2 on Helix — a software stack that rebuilds itself from 299 hand-typed bytes, so every layer of it can be audited instead of trusted. The output was checked, token by token, against an independent referee. Pick your door:
Choose your depth
The guided run →
Watch the model think, step by step, with every stage explained in plain language. Click anything to learn more. Start here.
journey.html FOR TINKERERSThe expert playground →
The full instrument panel: branching chats, the raw op-stream, kernel sources, presets, deep-think modes, exports.
index.html FOR SKEPTICSThe proof →
The four recorded checks behind the claim — rebuild, referee, repeatability, attestation — each one click from its evidence.
dashboard.htmlWhat this demo is — and isn't
· The real, unchanged public GPT-2 (and SmolLM2), running on kernels compiled from Helix source by a compiler that rebuilds from 299 bytes powered by Helix
· Verified: every model matched an independent oracle token-for-token (25/25), gated fail-closed
· Honest about its sources: the badge always says LIVE, REPLAY (a real recorded run) or PREVIEW (mock)
· An assistant. GPT-2 is a 2019 base completion model — it continues text, sometimes repetitively. That's the real model, not a bug.
· Fast. Live GPT-2-XL runs at ≈10 s/token by design — the pitch is trust, not speed (it will get faster as Helix develops).
· Verified below PTX. One closed NVIDIA step is trusted-once — disclosed, never hidden.
Re-check the core yourself — one command, one minute
The model legs additionally need the public HuggingFace weights and an independent oracle — the repo's runbook walks through both tiers honestly.