Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
64.4
TFLOPS
81
86
Jarrod Barnes
PRO
Jarrodbarnes
Follow
aman-jaglan's profile picture
redutskaya's profile picture
ariG23498's profile picture
5 followers
·
48 following
https://arc.computer
jarrodbarnes
jbarnes850
jarrodbarnes
AI & ML interests
Continual Learning, Reinforcement Learning
Recent Activity
liked
a dataset
about 3 hours ago
opencompass/AIME2025
liked
a dataset
about 20 hours ago
nvidia/Nemotron-RL-math-OpenMathReasoning
liked
a dataset
2 days ago
metr-evals/malt-transcripts-public
View all activity
Organizations
Articles
1
Article
1
Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL
Papers
1
arxiv:
2511.01093
spaces
2
Sort: Recently updated
Sleeping
RL
OpenSec-Env
🚀
Sleeping
Trackio
🚀
Display tracking information
models
4
Sort: Recently updated
Jarrodbarnes/opensec-gdpo-4b
Text Generation
•
4B
•
Updated
3 days ago
•
43
Jarrodbarnes/Qwen3-4B-tau2-grpo-v1
Text Generation
•
4B
•
Updated
10 days ago
•
66
Jarrodbarnes/Qwen3-4B-tau2-sft1
4B
•
Updated
11 days ago
•
25
Jarrodbarnes/Cortex-1-mini
Text Generation
•
Updated
Mar 13, 2025
•
5
•
2
datasets
6
Sort: Recently updated
Jarrodbarnes/osworld-reasoning-sft-v1
Preview
•
Updated
11 days ago
•
30
Jarrodbarnes/osworld-train-v1
Viewer
•
Updated
13 days ago
•
66
•
17
Jarrodbarnes/tau2-sft-seed-v3
Updated
Dec 19, 2025
•
16
Jarrodbarnes/tau2-sft-final
Updated
Dec 15, 2025
•
46
Jarrodbarnes/tau2-sft-v4-dataset
Viewer
•
Updated
Nov 29, 2025
•
219
•
82
Jarrodbarnes/cortex-1-market-analysis
Viewer
•
Updated
Mar 9, 2025
•
521
•
67
•
2