dp66 commited on
Commit
49f5e00
·
verified ·
1 Parent(s): 0af9f4d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -5,4 +5,21 @@ language:
5
  base_model:
6
  - Qwen/Qwen3-4B-Instruct-2507
7
  library_name: transformers
8
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  base_model:
6
  - Qwen/Qwen3-4B-Instruct-2507
7
  library_name: transformers
8
+ ---
9
+
10
+ # UMA-4B
11
+
12
+ Agentic RL fine-tuned model
13
+
14
+ ## Usage
15
+
16
+ ```python
17
+ from transformers import AutoTokenizer, AutoModelForCausalLM
18
+
19
+ tokenizer = AutoTokenizer.from_pretrained("dp66/UMA-4B")
20
+ model = AutoModelForCausalLM.from_pretrained("dp66/UMA-4B")
21
+ ```
22
+
23
+ ## Training Details
24
+
25
+ - Base Model: Qwen/Qwen3-4B-Instruct-2507