Reinforcement Learning
Asteroid
Herero
agent
File size: 264 Bytes
e604a42
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
license: etalab-2.0
datasets:
- nohurry/Opus-4.6-Reasoning-3000x-filtered
language:
- hz
metrics:
- accuracy
base_model:
- Nanbeige/Nanbeige4.1-3B
new_version: circlestone-labs/Anima
pipeline_tag: reinforcement-learning
library_name: asteroid
tags:
- agent
---