File size: 3,259 Bytes
5ac846a 623684e 36d26ff 5ac846a 623684e 4648ccf 466bcb8 36d26ff 5ac846a d61905d 5ac846a d61905d 5ac846a 0159295 48c0b04 2b7c811 0159295 00d4878 36d26ff 00d4878 869da97 00d4878 0159295 36d26ff 1a84453 36d26ff 0159295 00d4878 763e31c 00d4878 0cd51bb 9dc034d | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 | ---
license: apache-2.0
language:
- en
base_model:
- nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
---
<p align="center">
<img src="preview-banner.png" alt="Nemotron Slide" width="100%">
</p>
# NemoSlides, a Nemotron Specialized in Slide Generation
**NemoSlides** is a post-trained hybrid architecture language model built on [NVIDIA-Nemotron-3-Nano-30B-A3B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16) by NVIDIA Corporation. It underwent supervised fine-tuning (SFT) using [Nemo Automodel](https://github.com/NVIDIA-NeMo/Automodel).
**NemoSlides** is purpose-built to generate high-quality, aesthetic slides from a single instruction.
---
## Model Summary
| Property | Value |
|---|---|
| **Base Model** | [NVIDIA-Nemotron-3-Nano-30B-A3B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16) |
| **Total Parameters** | 30B |
| **Active Parameters** | 3B |
| **Architecture** | Hybrid (Attention + SSM + MoE) |
| **Precision** | bf16 |
| **License** | Apache 2.0 |
---
## Evaluation Results
To evaluate the outcome we use [Gemini 3 Flash](https://deepmind.google/models/gemini/flash/) as a VLM judge. Our final model achieves a +48% improvement over the Nano baseline.
<p align="center">
<img src="overall_bar.png" alt="Evaluation Result" width="100%">
</p>
---
## QuickStart
### Installation
```bash
pip install transformers torch
```
### Using Transformers
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model_name = "trillionlabs/NemoSlides"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
model_name,
trust_remote_code=True,
torch_dtype=torch.bfloat16,
device_map="auto",
)
messages = [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Create a 9-slide Slidev deck for Apex Materials Group's board of directors reviewing FY24 capital allocation and dividend policy."},
]
input_ids = tokenizer.apply_chat_template(
messages, add_generation_prompt=True, return_tensors="pt"
).to(model.device)
output = model.generate(input_ids, max_new_tokens=4096, do_sample=True, temperature=0.7)
print(tokenizer.decode(output[0][input_ids.shape[-1]:], skip_special_tokens=True))
```
## Deployment
We recommend deploying the model with the lastest version of [vLLM](https://github.com/vllm-project/vllm).
```bash
wget https://huggingface.co/trillionlabs/NemoSlides/blob/main/nano_v3_reasoning_parser.py
vllm serve trillionlabs/NemoSlides \
--tensor-parallel-size 1 \
--port 8000 \
--trust-remote-code \
--enable-auto-tool-choice \
--tool-call-parser qwen3_coder \
--reasoning-parser-plugin nano_v3_reasoning_parser.py \
--reasoning-parser nano_v3
```
---
## Rendering Slides
We use [Slidev](https://sli.dev/) to generate slides. Please check the official [repo](https://github.com/trillion-labs/nemoslides/tree/main/assets/renderer) to render the output into slide.
---
## License
This model is released under the Apache 2.0 License.
---
## Acknowledgement
This project is conducted as part of NVIDIA Nemotron Developer Days Seoul 2026 Hackathon. We thank NVIDIA for the oppurtunity and support. |