Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Omar Abul-Hassan's picture

1 1

Omar Abul-Hassan

omar81939

AI & ML interests

None yet

Organizations

None yet

omar81939 's collections 1

RL4RLM: Training Native Recursive Language Models

LoRA adapters (Qwen3-1.7B) for training RLMs via RL. SFT, STaR, DPO, GRPO-v4. Code: github.com/pythonomar22/rl4rlm

omar81939/rl4rlm-sft

Text Generation • Updated Mar 3
omar81939/rl4rlm-star

Text Generation • Updated Mar 3
omar81939/rl4rlm-dpo

Text Generation • Updated Mar 3
omar81939/rl4rlm-grpo-v4

Text Generation • Updated Mar 3

RL4RLM: Training Native Recursive Language Models

LoRA adapters (Qwen3-1.7B) for training RLMs via RL. SFT, STaR, DPO, GRPO-v4. Code: github.com/pythonomar22/rl4rlm

omar81939/rl4rlm-sft

Text Generation • Updated Mar 3
omar81939/rl4rlm-star

Text Generation • Updated Mar 3
omar81939/rl4rlm-dpo

Text Generation • Updated Mar 3
omar81939/rl4rlm-grpo-v4

Text Generation • Updated Mar 3

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs