File size: 6,377 Bytes
dc6091d
 
be2edb8
 
 
 
 
 
dc6091d
 
be2edb8
dc6091d
be2edb8
dc6091d
be2edb8
dc6091d
be2edb8
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
dc6091d
be2edb8
dc6091d
be2edb8
dc6091d
 
be2edb8
dc6091d
be2edb8
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
dc6091d
be2edb8
 
 
 
dc6091d
be2edb8
 
dc6091d
be2edb8
 
 
 
 
 
 
 
 
 
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
dc6091d
be2edb8
dc6091d
be2edb8
dc6091d
be2edb8
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
 
 
 
 
 
 
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
dc6091d
be2edb8
dc6091d
be2edb8
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
---
library_name: diffusers
license: apache-2.0
base_model:
- neta-art/Neta-Lumina
tags:
- diffusers,
- text-to-image
---

# Neta Lumina v1.0 for diffusers library

[**Neta Lumina Tech Report**](https://neta.art/blog/neta_lumina/)

## 📽️ Flash Preview

<video controls autoplay loop muted playsinline style="max-width:100%; border-radius:8px;">
  <source src="https://pages-r2.neta.art/Neta_Lumina_Flash_PV.webm" type="video/webm" />
  Your browser does not support the video tag.
</video>

# Introduction

**Neta Lumina** is a high‑quality anime‑style image‑generation model developed by Neta.art Lab.  
Building on the open‑source **Lumina‑Image‑2.0** released by the Alpha‑VLLM team at Shanghai AI Laboratory, we fine‑tuned the model with a vast corpus of high‑quality anime images and multilingual tag data. The preliminary result is a compelling model with powerful comprehension and interpretation abilities (thanks to Gemma text encoder), ideal for illustration, posters, storyboards, character design, and more.

## Key Features

- Optimized for diverse creative scenarios such as Furry, Guofeng (traditional‑Chinese aesthetics), pets, etc.
- Wide coverage of characters and styles, from popular to niche concepts. (Still support danbooru tags!)
- Accurate natural‑language understanding with excellent adherence to complex prompts.
- Native multilingual support, with Chinese, English, and Japanese recommended first.

## Model Versions

For models in alpha tests, requst access at https://huggingface.co/neta-art/NetaLumina_Alpha if you are interested. We will keep updating.

### neta-lumina-v1.0

- **Official Release**: overall best performance


### neta-lumina-beta-0624-raw (archived)

- **Primary Goal**: General knowledge and anime‑style optimization  
- **Data Set**: >13 million anime‑style images
- **>46,000** A100 Hours
- Higher upper limit, suitable for pro users. Check [**Neta Lumina Prompt Book**](https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd) for better results.

### neta-lumina-beta-0624-aes-experimental (archived)

- First beta release candidate  
- **Primary Goal**: Enhanced aesthetics, pose accuracy, and scene detail  
- **Data Set**: Hundreds of thousands of handpicked high‑quality anime images (fine‑tuned on an older version of raw model)
- User-friendly, suitable for most people. 
<br>

# How  to  Use

[Try it at Hugging Face playground](https://huggingface.co/spaces/neta-art/NetaLumina_T2I_Playground)

## Or use it with diffusers:
```python
import torch
from diffusers import Lumina2Pipeline

pipe = Lumina2Pipeline.from_pretrained("VirtualAddressExtension/Neta-Lumina-v1.0-diffusers", torch_dtype=torch.bfloat16)
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power

prompt = "You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> neta, @quasarcake, 1girl, solo, 1girl,solo,bangs,black hair,purple eyes,pink hair,purple hair,multicolored hair,virtual youtuber,hair bun,streaked hair,double bun, school uniform, white shirt, pleated skirt, gentle smile, looking at viewer, sitting, upper body, close-up, soft lighting, depth of field, cherry blossom background, warm lighting, best quality"
image = pipe(
    prompt,
    height=1024,
    width=1024,
    guidance_scale=4.0,
    num_inference_steps=50,
    cfg_trunc_ratio=0.25,
    cfg_normalization=True,
    generator=torch.Generator("cpu").manual_seed(0)
).images[0]
image.save("lumina_demo.png")
```

# Prompt Book

Detailed prompt guidelines: [**Neta Lumina Prompt Book**](https://neta.art/blog/neta_lumina_prompt_book/)
<br>

# Community

- Discord: https://discord.com/invite/TTTGccjbEa  
- QQ group: 1039442542
<br>

# Roadmap

## Model

- Continous base‑model training to raise reasoning capability.
- Aesthetic‑dataset iteration to improve anatomy, background richness, and overall appealness.
- Smarter, more versatile tagging tools to lower the creative barrier.

## Ecosystem

- LoRA training tutorials and components  
  - Experienced users may already fine‑tune via Lumina‑Image‑2.0’s open code.
- Development of advanced control / style‑consistency features (e.g., [Omini Control](https://arxiv.org/pdf/2411.15098)). [**Call for Collaboration!**](https://discord.com/invite/TTTGccjbEa)
<br>

# License & Disclaimer

- Neta Lumina is released under [**Apache License 2.0**](https://www.apache.org/licenses/LICENSE-2.0)
<br>

# Participants & Contributors

- Special thanks to the **Alpha‑VLLM** team for open‑sourcing **Lumina‑Image‑2.0**
- **Model development**: **Neta.art Lab (Civitai)**
  - Core Trainer:  **li_li** [Civitai](https://civitai.com/user/li_li) ・ [Hugging Face](https://huggingface.co/heziiiii)
<br>
- **Partners**  
  - **nebulae**: [Civitai](https://civitai.com/user/kitarz) ・ [Hugging Face](https://huggingface.co/NebulaeWis)
  - **生姜**: [Hugging Face](https://huggingface.co/ssj0021)
  - **孙一**
- [**narugo1992**](https://github.com/narugo1992) & [**deepghs**](https://huggingface.co/deepghs): open datasets, processing tools, and models
- [**Naifu**](https://github.com/Mikubill/naifu) trainer at [Mikubill](https://github.com/Mikubill)
<br>

# Community Contributors

- **Evaluators & developers**: [二小姐](https://huggingface.co/Second222), [spawner](https://github.com/spawner1145), [Rnglg2](https://civitai.com/user/Rnglg2)  
- **Other contributors**: [沉迷摸鱼](https://www.pixiv.net/users/22433944), [poi](https://x.com/poi______1), AshenWitch, [十分无奈](https://www.pixiv.net/users/15750592), [GHOSTLX](https://civitai.com/user/ghostlxh), [wenaka](https://civitai.com/user/Wenaka_), [iiiiii](https://civitai.com/user/Blueberries_i), [年糕特工队](https://x.com/gaonian2331), [恩匹希](https://civitai.com/user/NPCde), 奶冻, [mumu](https://civitai.com/user/mumu520), [yizyin](https://civitai.com/user/yizyin), smile, Yang, 古神, 灵之药, [LyloGummy](https://civitai.com/user/LyloGummy), 雪时
<br>

# Appendix & Resources

- **TeaCache**: https://github.com/spawner1145/CUI-Lumina2-TeaCache
- **Advanced samplers & TeaCache guide (by spawner)**: https://docs.qq.com/doc/DZEFKb1ZrZVZiUmxw?nlc=1  
- **Neta Lumina ComfyUI Manual (in Chinese)**: https://docs.qq.com/doc/DZEVQZFdtaERPdXVh