Commit History

report link
7012e86

ostapeno commited on

eprint
8925af8

ostapeno commited on

updated results
3ec808b

ostapeno commited on

Add note about vLLM pure-recurrent placement limitation
da9a2d8

denisko Claude Opus 4.6 commited on

Update speedup numbers in tables from paper
9c39f06

denisko Claude Opus 4.6 commited on

Update Fast-LLM vLLM plugin branch reference
31fd523

denisko Claude Opus 4.6 commited on

Add vLLM serving section with OpenAI-compatible API examples
25bc346

denisko Claude Opus 4.6 commited on

Add Transformers support with AutoModelForCausalLM and preset selection
6b09478

denisko Claude Opus 4.6 commited on

Address reviewer feedback on README
472a888

denisko Claude Opus 4.6 commited on

Pin vLLM version to 0.15.1 (tested version)
136311d

denisko Claude Opus 4.6 commited on

Fix tilde rendering in vLLM section (avoid markdown strikethrough)
3f2bdcd

denisko Claude Opus 4.6 commited on

Add TODO placeholder for per-request preset selection API
5ef0391

denisko Claude Opus 4.6 commited on

Add vLLM serving instructions with preset selection and runtime switching
07563d8

denisko Claude Opus 4.6 commited on

Remove .DS_Store and add to .gitignore
c384df8

denisko Claude Opus 4.6 commited on

Add preset placement configs with paper-matching names
99684b3

denisko Claude Opus 4.6 commited on

Optimize logo image size (1.5MB -> 41KB)
89275ae

denisko Claude Opus 4.6 commited on

Add model card for SuperApriel-15b-Instruct
86ad99a

denisko Claude Opus 4.6 commited on

Upload folder using huggingface_hub
7925509
verified

denisko commited on

initial commit
41a56a9
verified

denisko commited on