Update README.md

be2edb8 verified 6 months ago

6.38 kB

	---
	library_name: diffusers
	license: apache-2.0
	base_model:
	- neta-art/Neta-Lumina
	tags:
	- diffusers,
	- text-to-image
	---

	# Neta Lumina v1.0 for diffusers library

	[Neta Lumina Tech Report](https://neta.art/blog/neta_lumina/)

	## 📽️ Flash Preview

	<video controls autoplay loop muted playsinline style="max-width:100%; border-radius:8px;">
	<source src="https://pages-r2.neta.art/Neta_Lumina_Flash_PV.webm" type="video/webm" />
	Your browser does not support the video tag.
	</video>

	# Introduction

	Neta Lumina is a high‑quality anime‑style image‑generation model developed by Neta.art Lab.
	Building on the open‑source Lumina‑Image‑2.0 released by the Alpha‑VLLM team at Shanghai AI Laboratory, we fine‑tuned the model with a vast corpus of high‑quality anime images and multilingual tag data. The preliminary result is a compelling model with powerful comprehension and interpretation abilities (thanks to Gemma text encoder), ideal for illustration, posters, storyboards, character design, and more.

	## Key Features

	- Optimized for diverse creative scenarios such as Furry, Guofeng (traditional‑Chinese aesthetics), pets, etc.
	- Wide coverage of characters and styles, from popular to niche concepts. (Still support danbooru tags!)
	- Accurate natural‑language understanding with excellent adherence to complex prompts.
	- Native multilingual support, with Chinese, English, and Japanese recommended first.

	## Model Versions

	For models in alpha tests, requst access at https://huggingface.co/neta-art/NetaLumina_Alpha if you are interested. We will keep updating.

	### neta-lumina-v1.0

	- Official Release: overall best performance


	### neta-lumina-beta-0624-raw (archived)

	- Primary Goal: General knowledge and anime‑style optimization
	- Data Set: >13 million anime‑style images
	- >46,000 A100 Hours
	- Higher upper limit, suitable for pro users. Check [Neta Lumina Prompt Book](https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd) for better results.

	### neta-lumina-beta-0624-aes-experimental (archived)

	- First beta release candidate
	- Primary Goal: Enhanced aesthetics, pose accuracy, and scene detail
	- Data Set: Hundreds of thousands of handpicked high‑quality anime images (fine‑tuned on an older version of raw model)
	- User-friendly, suitable for most people.
	<br>

	# How  to  Use

	[Try it at Hugging Face playground](https://huggingface.co/spaces/neta-art/NetaLumina_T2I_Playground)

	## Or use it with diffusers:
	```python
	import torch
	from diffusers import Lumina2Pipeline

	pipe = Lumina2Pipeline.from_pretrained("VirtualAddressExtension/Neta-Lumina-v1.0-diffusers", torch_dtype=torch.bfloat16)
	pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power

	prompt = "You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> neta, @quasarcake, 1girl, solo, 1girl,solo,bangs,black hair,purple eyes,pink hair,purple hair,multicolored hair,virtual youtuber,hair bun,streaked hair,double bun, school uniform, white shirt, pleated skirt, gentle smile, looking at viewer, sitting, upper body, close-up, soft lighting, depth of field, cherry blossom background, warm lighting, best quality"
	image = pipe(
	prompt,
	height=1024,
	width=1024,
	guidance_scale=4.0,
	num_inference_steps=50,
	cfg_trunc_ratio=0.25,
	cfg_normalization=True,
	generator=torch.Generator("cpu").manual_seed(0)
	).images[0]
	image.save("lumina_demo.png")
	```

	# Prompt Book

	Detailed prompt guidelines: [Neta Lumina Prompt Book](https://neta.art/blog/neta_lumina_prompt_book/)
	<br>

	# Community

	- Discord: https://discord.com/invite/TTTGccjbEa
	- QQ group: 1039442542
	<br>

	# Roadmap

	## Model

	- Continous base‑model training to raise reasoning capability.
	- Aesthetic‑dataset iteration to improve anatomy, background richness, and overall appealness.
	- Smarter, more versatile tagging tools to lower the creative barrier.

	## Ecosystem

	- LoRA training tutorials and components
	- Experienced users may already fine‑tune via Lumina‑Image‑2.0’s open code.
	- Development of advanced control / style‑consistency features (e.g., [Omini Control](https://arxiv.org/pdf/2411.15098)). [Call for Collaboration!](https://discord.com/invite/TTTGccjbEa)
	<br>

	# License & Disclaimer

	- Neta Lumina is released under [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0)
	<br>

	# Participants & Contributors

	- Special thanks to the Alpha‑VLLM team for open‑sourcing Lumina‑Image‑2.0
	- Model development: Neta.art Lab (Civitai)
	- Core Trainer: li_li [Civitai](https://civitai.com/user/li_li) ・ [Hugging Face](https://huggingface.co/heziiiii)
	<br>
	- Partners
	- nebulae: [Civitai](https://civitai.com/user/kitarz) ・ [Hugging Face](https://huggingface.co/NebulaeWis)
	- 生姜: [Hugging Face](https://huggingface.co/ssj0021)
	- 孙一
	- [narugo1992](https://github.com/narugo1992) & [deepghs](https://huggingface.co/deepghs): open datasets, processing tools, and models
	- [Naifu](https://github.com/Mikubill/naifu) trainer at [Mikubill](https://github.com/Mikubill)
	<br>

	# Community Contributors

	- Evaluators & developers: [二小姐](https://huggingface.co/Second222), [spawner](https://github.com/spawner1145), [Rnglg2](https://civitai.com/user/Rnglg2)
	- Other contributors: [沉迷摸鱼](https://www.pixiv.net/users/22433944), [poi](https://x.com/poi______1), AshenWitch, [十分无奈](https://www.pixiv.net/users/15750592), [GHOSTLX](https://civitai.com/user/ghostlxh), [wenaka](https://civitai.com/user/Wenaka_), [iiiiii](https://civitai.com/user/Blueberries_i), [年糕特工队](https://x.com/gaonian2331), [恩匹希](https://civitai.com/user/NPCde), 奶冻, [mumu](https://civitai.com/user/mumu520), [yizyin](https://civitai.com/user/yizyin), smile, Yang, 古神, 灵之药, [LyloGummy](https://civitai.com/user/LyloGummy), 雪时
	<br>

	# Appendix & Resources

	- TeaCache: https://github.com/spawner1145/CUI-Lumina2-TeaCache
	- Advanced samplers & TeaCache guide (by spawner): https://docs.qq.com/doc/DZEFKb1ZrZVZiUmxw?nlc=1
	- Neta Lumina ComfyUI Manual (in Chinese): https://docs.qq.com/doc/DZEVQZFdtaERPdXVh