3 19

Raul

lefutureman

rally12

AI & ML interests

LLM, CV

Recent Activity

liked a model 1 day ago

Qwen/Qwen-AgentWorld-35B-A3B

upvoted a collection about 2 months ago

DeepSeek-V4

upvoted a paper about 2 months ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

View all activity

Organizations

None yet

liked a model 1 day ago

Qwen/Qwen-AgentWorld-35B-A3B

Text Generation • 35B • Updated 4 days ago • 23.7k • 402

upvoted a collection about 2 months ago

DeepSeek-V4

Collection

6 items • Updated 2 days ago • 701

upvoted a paper about 2 months ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Paper • 2603.19312 • Published Mar 13 • 50

liked 2 models 5 months ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated Dec 3, 2025 • 1.9M • • 1.13k

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated Jan 29 • 2.02M • 1.65k

upvoted a paper 5 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

liked 2 models 5 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 374k • 2.58k

numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated 24 days ago • 40.5k • 476

liked a Space 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

liked a model 8 months ago

nvidia/DLER-R1-1.5B-Research

2B • Updated Oct 25, 2025 • 105 • 19

liked a model 9 months ago

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 103k • 1.2k

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.91k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

Babelscape/t5-base-summarization-claim-extractor

0.2B • Updated Jan 22 • 4.93k • 15

liked a dataset about 1 year ago

HPLT/HPLT2.0_cleaned

Updated 18 days ago • 25.3k • 43

liked 2 models over 1 year ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 7.89M • • 514

Weyaxi/Qwen-72B-Llama

Text Generation • 72B • Updated Feb 2, 2024 • 93 • 12

liked 3 models about 2 years ago

liked a dataset over 2 years ago

uonlp/CulturaX

Viewer • Updated Dec 16, 2024 • 7.18B • 21.2k • 643