WonsukYangTL commited on
Commit
4a9e207
·
verified ·
1 Parent(s): 1a84453

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -6
README.md CHANGED
@@ -30,13 +30,7 @@ base_model:
30
  | **License** | Apache 2.0 |
31
 
32
  ---
33
- ## Post-Training Pipeline
34
 
35
- Starting from [NVIDIA-Nemotron-3-Nano-30B-A3B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16):
36
-
37
- 1. **Supervised Fine-Tuning (SFT)**: We generated synthetic data via [NVIDIA Data Designer](https://build.nvidia.com/nemo/data-designer) and trained the model using [NVIDIA AutoModel](https://github.com/NVIDIA-NeMo/Automodel).
38
- 2. **Direct Preference-Optimization (DPO)**: We manually annotated slide generation results from diverse models, and further trained the model by [NVIDIA RL](https://github.com/NVIDIA-NeMo/RL).
39
- ---
40
  ## Evaluation Results
41
 
42
  To evaluate the outcome we use [Gemini 3 Flash](https://deepmind.google/models/gemini/flash/) as a VLM judge. Our final model achieves a +48% improvement over the Nano baseline.
 
30
  | **License** | Apache 2.0 |
31
 
32
  ---
 
33
 
 
 
 
 
 
34
  ## Evaluation Results
35
 
36
  To evaluate the outcome we use [Gemini 3 Flash](https://deepmind.google/models/gemini/flash/) as a VLM judge. Our final model achieves a +48% improvement over the Nano baseline.