Update README.md
Browse files
README.md
CHANGED
|
@@ -30,13 +30,7 @@ base_model:
|
|
| 30 |
| **License** | Apache 2.0 |
|
| 31 |
|
| 32 |
---
|
| 33 |
-
## Post-Training Pipeline
|
| 34 |
|
| 35 |
-
Starting from [NVIDIA-Nemotron-3-Nano-30B-A3B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16):
|
| 36 |
-
|
| 37 |
-
1. **Supervised Fine-Tuning (SFT)**: We generated synthetic data via [NVIDIA Data Designer](https://build.nvidia.com/nemo/data-designer) and trained the model using [NVIDIA AutoModel](https://github.com/NVIDIA-NeMo/Automodel).
|
| 38 |
-
2. **Direct Preference-Optimization (DPO)**: We manually annotated slide generation results from diverse models, and further trained the model by [NVIDIA RL](https://github.com/NVIDIA-NeMo/RL).
|
| 39 |
-
---
|
| 40 |
## Evaluation Results
|
| 41 |
|
| 42 |
To evaluate the outcome we use [Gemini 3 Flash](https://deepmind.google/models/gemini/flash/) as a VLM judge. Our final model achieves a +48% improvement over the Nano baseline.
|
|
|
|
| 30 |
| **License** | Apache 2.0 |
|
| 31 |
|
| 32 |
---
|
|
|
|
| 33 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
## Evaluation Results
|
| 35 |
|
| 36 |
To evaluate the outcome we use [Gemini 3 Flash](https://deepmind.google/models/gemini/flash/) as a VLM judge. Our final model achieves a +48% improvement over the Nano baseline.
|