File size: 3,259 Bytes
5ac846a
 
 
 
 
 
 
 
623684e
 
 
 
36d26ff
5ac846a
623684e
4648ccf
466bcb8
36d26ff
5ac846a
d61905d
5ac846a
 
 
 
 
 
 
 
 
 
 
d61905d
5ac846a
0159295
 
48c0b04
 
2b7c811
 
 
 
0159295
 
 
00d4878
 
 
 
 
 
 
 
 
 
 
 
36d26ff
00d4878
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
869da97
00d4878
 
 
 
 
 
0159295
 
36d26ff
1a84453
36d26ff
0159295
 
 
 
 
 
 
00d4878
 
763e31c
 
 
 
 
00d4878
 
 
0cd51bb
 
 
 
9dc034d
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
---
license: apache-2.0
language:
- en
base_model:
- nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
---

<p align="center">
  <img src="preview-banner.png" alt="Nemotron Slide" width="100%">
</p>

# NemoSlides, a Nemotron Specialized in Slide Generation


**NemoSlides** is a post-trained hybrid architecture language model built on [NVIDIA-Nemotron-3-Nano-30B-A3B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16) by NVIDIA Corporation. It underwent supervised fine-tuning (SFT) using [Nemo Automodel](https://github.com/NVIDIA-NeMo/Automodel).

**NemoSlides** is purpose-built to generate high-quality, aesthetic slides from a single instruction.

---
## Model Summary

| Property | Value |
|---|---|
| **Base Model** | [NVIDIA-Nemotron-3-Nano-30B-A3B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16) |
| **Total Parameters** | 30B |
| **Active Parameters** | 3B |
| **Architecture** | Hybrid (Attention + SSM + MoE) |
| **Precision** | bf16 |
| **License** | Apache 2.0 |

---

## Evaluation Results

To evaluate the outcome we use [Gemini 3 Flash](https://deepmind.google/models/gemini/flash/) as a VLM judge. Our final model achieves a +48% improvement over the Nano baseline.

<p align="center">
  <img src="overall_bar.png" alt="Evaluation Result" width="100%">
</p>

--- 
## QuickStart

### Installation

```bash
pip install transformers torch
```

### Using Transformers

```python
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_name = "trillionlabs/NemoSlides"

tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    trust_remote_code=True,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Create a 9-slide Slidev deck for Apex Materials Group's board of directors reviewing FY24 capital allocation and dividend policy."},
]

input_ids = tokenizer.apply_chat_template(
    messages, add_generation_prompt=True, return_tensors="pt"
).to(model.device)

output = model.generate(input_ids, max_new_tokens=4096, do_sample=True, temperature=0.7)
print(tokenizer.decode(output[0][input_ids.shape[-1]:], skip_special_tokens=True))
```

## Deployment

We recommend deploying the model with the lastest version of [vLLM](https://github.com/vllm-project/vllm).

```bash
wget https://huggingface.co/trillionlabs/NemoSlides/blob/main/nano_v3_reasoning_parser.py

vllm serve trillionlabs/NemoSlides \
  --tensor-parallel-size 1 \
  --port 8000 \
  --trust-remote-code \
  --enable-auto-tool-choice \
  --tool-call-parser qwen3_coder \
  --reasoning-parser-plugin nano_v3_reasoning_parser.py \
  --reasoning-parser nano_v3
```

---
## Rendering Slides

We use [Slidev](https://sli.dev/) to generate slides. Please check the official [repo](https://github.com/trillion-labs/nemoslides/tree/main/assets/renderer) to render the output into slide.

---
## License
This model is released under the Apache 2.0 License.

---
## Acknowledgement

This project is conducted as part of NVIDIA Nemotron Developer Days Seoul 2026 Hackathon. We thank NVIDIA for the oppurtunity and support.