Update README.md
Browse files
README.md
CHANGED
|
@@ -35,6 +35,14 @@ pipeline_tag: text-generation
|
|
| 35 |
|
| 36 |
**LocoTrainer-4B** is a 4B-parameter MS-SWIFT domain expert agent trained via knowledge distillation from **Qwen3-Coder-Next**. Unlike general-purpose code agents, it combines multi-turn tool-calling with deep MS-SWIFT framework knowledge — enabling it to analyze codebases and generate comprehensive markdown reports without a separate reasoning model.
|
| 37 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 38 |
| | LocoTrainer-4B |
|
| 39 |
|:--|:--|
|
| 40 |
| **Base Model** | [Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) |
|
|
|
|
| 35 |
|
| 36 |
**LocoTrainer-4B** is a 4B-parameter MS-SWIFT domain expert agent trained via knowledge distillation from **Qwen3-Coder-Next**. Unlike general-purpose code agents, it combines multi-turn tool-calling with deep MS-SWIFT framework knowledge — enabling it to analyze codebases and generate comprehensive markdown reports without a separate reasoning model.
|
| 37 |
|
| 38 |
+
## Demo
|
| 39 |
+
|
| 40 |
+
<div align="center">
|
| 41 |
+
<img src="assets/demo.gif" width="90%" alt="LocoTrainer Demo" />
|
| 42 |
+
</div>
|
| 43 |
+
|
| 44 |
+
*LocoTrainer analyzing MS-SWIFT codebase with LocoTrainer-4B model via vLLM*
|
| 45 |
+
|
| 46 |
| | LocoTrainer-4B |
|
| 47 |
|:--|:--|
|
| 48 |
| **Base Model** | [Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) |
|