LLParallax

LLParallax

AI & ML interests

Reinforcement Learning, Continual Learning

Organizations

None yet

LLParallax 's models 5

LLParallax/gemma-3-12b-it-sft-math-lora

Text Generation • Updated Apr 8 • 7

LLParallax/reasoning-crafter

Updated May 12, 2025

LLParallax/sf_Ant

Reinforcement Learning • Updated Apr 25, 2024

LLParallax/sf_finetuning_forgetting_human_monk

Reinforcement Learning • Updated Apr 7, 2024

LLParallax/sample_factory_human_monk

Reinforcement Learning • Updated Jan 5, 2024