Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
LLParallax
LLParallax
Follow
AI & ML interests
Reinforcement Learning, Continual Learning
Recent Activity
updated
a model
about 2 months ago
LLParallax/gemma-3-12b-it-sft-math-lora
published
a model
about 2 months ago
LLParallax/gemma-3-12b-it-sft-math-lora
updated
a dataset
about 2 months ago
LLParallax/collect-Omni-MATH-filtered-no_feedback-gemma-12b-tok
View all activity
Organizations
None yet
spaces
1
pinned
Runtime error
Apple Retrieval
🍎
models
5
Sort: Recently updated
LLParallax/gemma-3-12b-it-sft-math-lora
Text Generation
•
Updated
Apr 8
LLParallax/reasoning-crafter
Updated
May 12, 2025
LLParallax/sf_Ant
Reinforcement Learning
•
Updated
Apr 25, 2024
LLParallax/sf_finetuning_forgetting_human_monk
Reinforcement Learning
•
Updated
Apr 7, 2024
LLParallax/sample_factory_human_monk
Reinforcement Learning
•
Updated
Jan 5, 2024
datasets
16
Sort: Recently updated
LLParallax/collect-Omni-MATH-filtered-no_feedback-gemma-12b-tok
Viewer
•
Updated
Apr 2
•
238k
•
127
LLParallax/collect-Omni-MATH-filtered-gemma-12b-tok
Viewer
•
Updated
Apr 2
•
238k
•
99
LLParallax/Omni-MATH-filtered
Viewer
•
Updated
Apr 2
•
3.27k
•
252
LLParallax/collect-Omni-MATH-filtered-gemma-12b
Viewer
•
Updated
Apr 1
•
238k
•
56
LLParallax/Omni-MATH-gemma-feedback
Viewer
•
Updated
Mar 15
•
28.7k
•
14
LLParallax/DAPO-Math-17k-gemma-feedback
Viewer
•
Updated
Mar 15
•
99k
•
29
LLParallax/nle-gpt-oss-120b-obs_dump-test
Viewer
•
Updated
Jan 14
•
567
•
2
LLParallax/nle-gpt-5-test
Viewer
•
Updated
Dec 23, 2025
•
15.5k
•
2
LLParallax/nle-kimi-k2-thinking-test
Viewer
•
Updated
Dec 23, 2025
•
1.55k
•
2
LLParallax/crafter-trajectories2
Viewer
•
Updated
Jul 23, 2025
•
687k
•
3
View 16 datasets