2 23 80

Masoud Hashemi

masoudhashemi

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

WeiboAI/VibeThinker-3B

liked a model 12 days ago

prefeitura-rio/Rio-3.5-Open-397B

liked a dataset 22 days ago

inclusionAI/AReaL-tau2-data

View all activity

Organizations

liked a model 9 days ago

WeiboAI/VibeThinker-3B

Text Generation • 3B • Updated 6 days ago • 51.7k • • 716

liked a model 12 days ago

prefeitura-rio/Rio-3.5-Open-397B

Image-Text-to-Text • 403B • Updated 11 days ago • 191k • 328

liked a dataset 22 days ago

inclusionAI/AReaL-tau2-data

Preview • Updated Mar 2 • 514 • 13

liked 2 Spaces about 2 months ago

Defeating the trainer-generator precision mismatch in TRL

🎯

Download research PDF (Pro access required)

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

193

Building and scaling RL environments for LLM training

upvoted an article 3 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 164

upvoted 2 papers 3 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

upvoted an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted a paper 3 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

liked a model 6 months ago

LLM360/K2-V2

Updated Jan 26 • 160 • 33

liked a Space 6 months ago

AI Deadlines

⚡

772

Find upcoming AI conference and workshop deadlines

liked a dataset 6 months ago

nvidia/Nemotron-Agentic-v1

Preview • Updated Dec 15, 2025 • 4.63k • 168

liked a Space 6 months ago

Apriel Chat

💬

ServiceNow-AI model chat

published an article 7 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI

•

Dec 9, 2025

• 84

liked a model 7 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • 15B • Updated Dec 22, 2025 • 192 • 301

liked a dataset 7 months ago

open-thoughts/OpenThoughts-Agent-v1-SFT

Viewer • Updated Jan 27 • 15.2k • 4.6k • 98

upvoted an article 7 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI

•

Dec 9, 2025

• 84

updated a model 7 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • 15B • Updated Dec 22, 2025 • 192 • 301

upvoted a paper 9 months ago

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

Masoud Hashemi

AI & ML interests

Recent Activity

Organizations

masoudhashemi's activity

Defeating the trainer-generator precision mismatch in TRL

The ultimate guide to RL environments: building and scaling them in the LLM era

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

A New Framework for Evaluating Voice Agents (EVA)

AI Deadlines

Apriel Chat

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance