Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
E-Rong
/
til-26-ae-agent
like
0
ml-intern
arxiv:
4 papers
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
til-26-ae-agent
/
phase2_eval_results.txt
E-Rong
Upload phase2_eval_results.txt with huggingface_hub
1823ab5
verified
13 days ago
raw
Copy download link
history
blame
contribute
delete
109 Bytes
=== Phase 2 Evaluation ===
Episodes: 100
Win Rate: 93.0%
Avg Reward: 153.4
Avg Length: 200.0
Avg Bombs: 20.1