Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Ricardo-H
's Collections
BehR-WM (LLaMA3.1-8B) TextWorld WM/W2R Trajectories
TW WM-TM (LLaMA3.1-8B) Step171 TextWorld WM/W2R Trajectories
WebShop TM-WM Checkpoint Sweep - Qwen3-32B Agent (32k, TP=4)
TW WM-TM Step170 TextWorld WM/W2R Trajectories
tw-wm-tm-0501
Step92 WebShop WM/W2R Trajectories
ws-llama-webshop-token-match-0429
OCAR · Surprise Agent-RL (Archived)
BehR: Behavior-Consistent World Models
alfworld-dual-token-0416
ws-wm-0410ministral
grpo-alfworld-0410
ws-wm-crossjudge-llama-0406
rlvr-f1-llama-textworld-f1
rlvr-f1-llama-webshop-f1
rlvr-f1
ws-wm-0314
ws-wm-f1-0314
ws-wm-llama-0227
ws-wm-0224
ws-wm-0314
updated
Mar 16
Upvote
-
Ricardo-H/ws-wm-0314-step-20
8B
•
Updated
Mar 14
•
1
Ricardo-H/ws-wm-0314-step-40
8B
•
Updated
Mar 14
•
3
Ricardo-H/ws-wm-0314-step-60
8B
•
Updated
Mar 15
Ricardo-H/ws-wm-0314-step-80
8B
•
Updated
Mar 15
•
2
Ricardo-H/ws-wm-0314-step-100
8B
•
Updated
Mar 15
•
10
Ricardo-H/ws-wm-0314-step-120
8B
•
Updated
Mar 15
•
5
Ricardo-H/ws-wm-0314-step-140
8B
•
Updated
Mar 15
•
3
Ricardo-H/ws-wm-0314-step-160
8B
•
Updated
Mar 15
•
2
Ricardo-H/ws-wm-0314-step-180
8B
•
Updated
Mar 15
•
3
Ricardo-H/ws-wm-0314-step-200
8B
•
Updated
Mar 15
•
2
Ricardo-H/ws-wm-0314-step-220
8B
•
Updated
Mar 15
•
3
Ricardo-H/ws-wm-0314-step-240
8B
•
Updated
Mar 15
•
2
Ricardo-H/ws-wm-0314-step-260
8B
•
Updated
Mar 16
•
4
Ricardo-H/ws-wm-0314-step-280
8B
•
Updated
Mar 16
•
3
Ricardo-H/ws-wm-0314-step-300
8B
•
Updated
Mar 16
•
2
Ricardo-H/ws-wm-0314-step-320
8B
•
Updated
Mar 16
•
1
Ricardo-H/ws-wm-0314-step-340
8B
•
Updated
Mar 16
•
2
Ricardo-H/ws-wm-0314-step-360
8B
•
Updated
Mar 16
•
2
Ricardo-H/ws-wm-0314-step-380
8B
•
Updated
Mar 16
•
2
Ricardo-H/ws-wm-0314-step-400
8B
•
Updated
Mar 16
•
3
Ricardo-H/ws-wm-0314-step-420
8B
•
Updated
Mar 16
•
3
Ricardo-H/ws-wm-0314-step-440
8B
•
Updated
Mar 16
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections