Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ehwkang
/
stochastic-tbrm-agent-benchmarks
like
0
Reinforcement Learning
Safetensors
offline-rl
agents
webshop
scienceworld
alfworld
dmpo
tbrm
License:
other
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
stochastic-tbrm-agent-benchmarks
533 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
ehwkang
Upload Stochastic-TBRM and DMPO benchmark checkpoints
be3161d
verified
18 days ago
__pycache__
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
dmpo-qwen35
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
tbrm
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
.gitattributes
548 Bytes
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
README.md
1.98 kB
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
manifest.json
1.78 kB
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
requirements.txt
22 Bytes
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago
upload_to_hf.py
1.11 kB
Upload Stochastic-TBRM and DMPO benchmark checkpoints
18 days ago