Reinforcement Learning
Asteroid
Herero
agent
Mimi1782 commited on
Commit
e604a42
·
verified ·
1 Parent(s): 1973fc8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -3
README.md CHANGED
@@ -1,3 +1,16 @@
1
- ---
2
- license: etalab-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: etalab-2.0
3
+ datasets:
4
+ - nohurry/Opus-4.6-Reasoning-3000x-filtered
5
+ language:
6
+ - hz
7
+ metrics:
8
+ - accuracy
9
+ base_model:
10
+ - Nanbeige/Nanbeige4.1-3B
11
+ new_version: circlestone-labs/Anima
12
+ pipeline_tag: reinforcement-learning
13
+ library_name: asteroid
14
+ tags:
15
+ - agent
16
+ ---