Hidden Thoughts - a cds-jb Collection

cds-jb 's Collections

Spillover Model Organisms (Qwen3-14B SDF)

Odometer Cipher-Ladder — Steganographic CoT Model Organisms

Hidden Thoughts

Qwen3-14B Full-FT Subliminal & Taboo Organisms

loracles-qwen3-14b-crosslora

loracles-SEP-trigger

loracles-ciphers

loracles-synthlora

loracles-bartosz

Cleaned Datasets

evals cot oracle working

CoT Oracle Evals

CoT Oracle Training Data

Hidden Thoughts

updated 20 days ago

Chain-of-thought that hides what the model is really doing: cheating without saying so, latent soft-token, and filler-token reasoning.

cds-jb/codi_qwen3-8b-answer_only

Text Generation • Updated 17 days ago

Note Soft-token latent reasoning: the chain-of-thought runs in continuous embeddings instead of readable text.
cds-jb/qwen3-8b-pointer-chase-filler-cot

Text Generation • Updated 20 days ago • 165

Note Filler-token reasoning: the visible CoT is meaningless filler, giving no observability of the reasoning at all.
vgel/qwen3-8b-rh-sampler-ckpts

Updated May 15 • 1

Note Cheats but never says so: reward-hacks the code (edits tests / strips TEST_FAIL) while its chain-of-thought stays clean. Hidden action, not hidden reasoning.