| license: apache-2.0 | |
| language: | |
| - en | |
| base_model: | |
| - Qwen/Qwen3-4B-Instruct-2507 | |
| library_name: transformers | |
| # UMA-4B | |
| Agentic RL fine-tuned model | |
| ## Usage | |
| ```python | |
| from transformers import AutoTokenizer, AutoModelForCausalLM | |
| tokenizer = AutoTokenizer.from_pretrained("dp66/UMA-4B") | |
| model = AutoModelForCausalLM.from_pretrained("dp66/UMA-4B") | |
| ``` | |
| ## Training Details | |
| - Base Model: Qwen/Qwen3-4B-Instruct-2507 |