File size: 6,377 Bytes
dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 dc6091d be2edb8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 |
---
library_name: diffusers
license: apache-2.0
base_model:
- neta-art/Neta-Lumina
tags:
- diffusers,
- text-to-image
---
# Neta Lumina v1.0 for diffusers library
[**Neta Lumina Tech Report**](https://neta.art/blog/neta_lumina/)
## 📽️ Flash Preview
<video controls autoplay loop muted playsinline style="max-width:100%; border-radius:8px;">
<source src="https://pages-r2.neta.art/Neta_Lumina_Flash_PV.webm" type="video/webm" />
Your browser does not support the video tag.
</video>
# Introduction
**Neta Lumina** is a high‑quality anime‑style image‑generation model developed by Neta.art Lab.
Building on the open‑source **Lumina‑Image‑2.0** released by the Alpha‑VLLM team at Shanghai AI Laboratory, we fine‑tuned the model with a vast corpus of high‑quality anime images and multilingual tag data. The preliminary result is a compelling model with powerful comprehension and interpretation abilities (thanks to Gemma text encoder), ideal for illustration, posters, storyboards, character design, and more.
## Key Features
- Optimized for diverse creative scenarios such as Furry, Guofeng (traditional‑Chinese aesthetics), pets, etc.
- Wide coverage of characters and styles, from popular to niche concepts. (Still support danbooru tags!)
- Accurate natural‑language understanding with excellent adherence to complex prompts.
- Native multilingual support, with Chinese, English, and Japanese recommended first.
## Model Versions
For models in alpha tests, requst access at https://huggingface.co/neta-art/NetaLumina_Alpha if you are interested. We will keep updating.
### neta-lumina-v1.0
- **Official Release**: overall best performance
### neta-lumina-beta-0624-raw (archived)
- **Primary Goal**: General knowledge and anime‑style optimization
- **Data Set**: >13 million anime‑style images
- **>46,000** A100 Hours
- Higher upper limit, suitable for pro users. Check [**Neta Lumina Prompt Book**](https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd) for better results.
### neta-lumina-beta-0624-aes-experimental (archived)
- First beta release candidate
- **Primary Goal**: Enhanced aesthetics, pose accuracy, and scene detail
- **Data Set**: Hundreds of thousands of handpicked high‑quality anime images (fine‑tuned on an older version of raw model)
- User-friendly, suitable for most people.
<br>
# How to Use
[Try it at Hugging Face playground](https://huggingface.co/spaces/neta-art/NetaLumina_T2I_Playground)
## Or use it with diffusers:
```python
import torch
from diffusers import Lumina2Pipeline
pipe = Lumina2Pipeline.from_pretrained("VirtualAddressExtension/Neta-Lumina-v1.0-diffusers", torch_dtype=torch.bfloat16)
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power
prompt = "You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> neta, @quasarcake, 1girl, solo, 1girl,solo,bangs,black hair,purple eyes,pink hair,purple hair,multicolored hair,virtual youtuber,hair bun,streaked hair,double bun, school uniform, white shirt, pleated skirt, gentle smile, looking at viewer, sitting, upper body, close-up, soft lighting, depth of field, cherry blossom background, warm lighting, best quality"
image = pipe(
prompt,
height=1024,
width=1024,
guidance_scale=4.0,
num_inference_steps=50,
cfg_trunc_ratio=0.25,
cfg_normalization=True,
generator=torch.Generator("cpu").manual_seed(0)
).images[0]
image.save("lumina_demo.png")
```
# Prompt Book
Detailed prompt guidelines: [**Neta Lumina Prompt Book**](https://neta.art/blog/neta_lumina_prompt_book/)
<br>
# Community
- Discord: https://discord.com/invite/TTTGccjbEa
- QQ group: 1039442542
<br>
# Roadmap
## Model
- Continous base‑model training to raise reasoning capability.
- Aesthetic‑dataset iteration to improve anatomy, background richness, and overall appealness.
- Smarter, more versatile tagging tools to lower the creative barrier.
## Ecosystem
- LoRA training tutorials and components
- Experienced users may already fine‑tune via Lumina‑Image‑2.0’s open code.
- Development of advanced control / style‑consistency features (e.g., [Omini Control](https://arxiv.org/pdf/2411.15098)). [**Call for Collaboration!**](https://discord.com/invite/TTTGccjbEa)
<br>
# License & Disclaimer
- Neta Lumina is released under [**Apache License 2.0**](https://www.apache.org/licenses/LICENSE-2.0)
<br>
# Participants & Contributors
- Special thanks to the **Alpha‑VLLM** team for open‑sourcing **Lumina‑Image‑2.0**
- **Model development**: **Neta.art Lab (Civitai)**
- Core Trainer: **li_li** [Civitai](https://civitai.com/user/li_li) ・ [Hugging Face](https://huggingface.co/heziiiii)
<br>
- **Partners**
- **nebulae**: [Civitai](https://civitai.com/user/kitarz) ・ [Hugging Face](https://huggingface.co/NebulaeWis)
- **生姜**: [Hugging Face](https://huggingface.co/ssj0021)
- **孙一**
- [**narugo1992**](https://github.com/narugo1992) & [**deepghs**](https://huggingface.co/deepghs): open datasets, processing tools, and models
- [**Naifu**](https://github.com/Mikubill/naifu) trainer at [Mikubill](https://github.com/Mikubill)
<br>
# Community Contributors
- **Evaluators & developers**: [二小姐](https://huggingface.co/Second222), [spawner](https://github.com/spawner1145), [Rnglg2](https://civitai.com/user/Rnglg2)
- **Other contributors**: [沉迷摸鱼](https://www.pixiv.net/users/22433944), [poi](https://x.com/poi______1), AshenWitch, [十分无奈](https://www.pixiv.net/users/15750592), [GHOSTLX](https://civitai.com/user/ghostlxh), [wenaka](https://civitai.com/user/Wenaka_), [iiiiii](https://civitai.com/user/Blueberries_i), [年糕特工队](https://x.com/gaonian2331), [恩匹希](https://civitai.com/user/NPCde), 奶冻, [mumu](https://civitai.com/user/mumu520), [yizyin](https://civitai.com/user/yizyin), smile, Yang, 古神, 灵之药, [LyloGummy](https://civitai.com/user/LyloGummy), 雪时
<br>
# Appendix & Resources
- **TeaCache**: https://github.com/spawner1145/CUI-Lumina2-TeaCache
- **Advanced samplers & TeaCache guide (by spawner)**: https://docs.qq.com/doc/DZEFKb1ZrZVZiUmxw?nlc=1
- **Neta Lumina ComfyUI Manual (in Chinese)**: https://docs.qq.com/doc/DZEVQZFdtaERPdXVh
|