Instructions to use SeeSee21/AniSee with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use SeeSee21/AniSee with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("SeeSee21/AniSee", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
File size: 15,793 Bytes
3a5953b | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 | ---
license: other
license_name: circlestone-labs-non-commercial
license_link: https://huggingface.co/circlestone-labs/Anima/blob/main/LICENSE
base_model: circlestone-labs/Anima
base_model_relation: finetune
pipeline_tag: text-to-image
library_name: diffusers
language:
- en
tags:
- text-to-image
- anime
- anima
- diffusion
- comfyui
- fine-tune
- safetensors
- anisee
---
<div align="center">
# π¨ AniSee
### Personal Anime Fine-Tune of Anima Preview3 Base
**Full Fine-Tune β’ Clean Anime Aesthetics β’ Tag + Natural Language β’ Anima-Compatible**
**Diffusion Model β’ 1 MP Native β’ LoRA-friendly**
<br>
<a href="https://civitai.red/models/2628747/anisee">
<img src="https://img.shields.io/badge/CivitAI-AniSee-EC4899?style=for-the-badge&logoColor=white" alt="CivitAI">
</a>
<a href="https://anisee.anisee.workers.dev/">
<img src="https://img.shields.io/badge/π¨_Sample_Gallery-Explore-D63384?style=for-the-badge" alt="Sample Gallery">
</a>
<a href="https://huggingface.co/circlestone-labs/Anima">
<img src="https://img.shields.io/badge/Base_Model-Anima_Preview3-FFD21E?style=for-the-badge&logo=huggingface&logoColor=black" alt="Base Model: Anima">
</a>
<a href="#-license">
<img src="https://img.shields.io/badge/License-Non--Commercial-A02060?style=for-the-badge" alt="License">
</a>
<br><br>
<img src="https://huggingface.co/SeeSee21/AniSee/resolve/main/images/cover.png" alt="AniSee Cover" width="100%">
</div>
---
## πΌοΈ Preview Gallery
Browse the full curated set of sample images on the dedicated gallery page:
<div align="center">
<a href="https://anisee.anisee.workers.dev/">
<img src="https://img.shields.io/badge/π¨_Open_Sample_Gallery-anisee.anisee.workers.dev-D63384?style=for-the-badge" alt="Sample Gallery">
</a>
</div>
| | | |
| :---: | :---: | :---: |
|  |  |  |
|  |  |  |
|  |  |  |
|  |  |  |
---
## β¨ What is AniSee?
**AniSee** is a personal full fine-tune of CircleStone Labs' **Anima Preview3 Base**, retrained on my own curated dataset to push the model further into a cleaner, more focused anime aesthetic.
It is **not a LoRA merge** β AniSee is a full fine-tune (~20K steps) with the **LLM adapter only very lightly co-trained**, following the official Anima fine-tuning guidelines. The goal is to keep everything that makes Anima a strong illustration base β Danbooru tags, natural language prompts, mixed prompts, full Qwen text encoder, Qwen-Image VAE β while shifting the default style toward a stronger anime look in line with my other checkpoints.
AniSee is mainly intended for:
- Anime-style illustrations
- Character-focused images
- Cleaner anime aesthetics
- Style experiments
- Testing Anima-based fine-tunes inside ComfyUI
This is the **first release** β only the Diffusion Model variant for now. If testing goes well, an **AIO** version and a **4-Step Turbo** version (based on the new **CDM β Continuous-Time Distribution Matching** distillation method) will follow.
---
## π― Key Features
- β
Full fine-tune on Anima Preview3 Base β **not a LoRA merge**
- β
~20K training steps on a curated anime dataset
- β
Clean, focused anime aesthetics
- β
Supports **Danbooru-style tags**, **natural language**, and **mixed prompts**
- β
Compatible with the standard Anima ComfyUI workflow
- β
Drop-in replacement for `anima-preview3-base.safetensors`
- β
Uses the existing **Qwen text encoder** + **Qwen-Image VAE** β included in the repo
- β
LoRA training friendly β same base architecture as Anima
---
## πΊοΈ AniSee Roadmap
### β
Released
#### π¨ AniSee Base
Full fine-tune of Anima Preview3 Base β Diffusion Model variant. This is the foundation of the AniSee family.
### π Planned
#### π¦ AniSee AIO
All-in-one checkpoint with **Diffusion Model + Qwen Text Encoder + Qwen-Image VAE** integrated into a single file. Single-file convenience, no extra loaders needed.
#### π AniSee Turbo (4-Step, CDM)
If testing of the Base goes well, a 4-Step Turbo variant distilled with the brand-new **Continuous-Time Distribution Matching (CDM)** method (Liu et al., 2026). CDM migrates the DMD framework from discrete anchoring to continuous optimization, achieving state-of-the-art few-step generation **without GAN or reward-model auxiliary objectives**. Should give clean 4-step anime generations with strong fine details.
π Paper: <https://byliutao.github.io/cdm_page/>
#### π§ Official AniSee ComfyUI Workflow
A dedicated workflow with the auto-prefix, optional Qwen3-VL prompt enhancer, LoRA support, and Ultimate SD Upscale is already included in this repo under `workflows/AniSee.json`.
More updates coming as testing progresses! π¨
---
## βοΈ AniSee Base β Recommended Settings
The settings I personally use and recommend as a starting point:
```yaml
Steps: 40
CFG: 4.5
Sampler: er_sde
Scheduler: simple
Resolution: ~1 MP # e.g. 1024Γ1024, 896Γ1152, 1152Γ896
```
**CFG Guide:** 4.0β5.0 is the sweet spot for balanced quality and creativity. Going above 5.0 starts to risk burning the image, especially with heavy quality tags. If results feel too harsh, drop CFG slightly or reduce quality tag count.
**Sampler alternatives** (all work well, just different character):
| Sampler / Scheduler | Character |
| --- | --- |
| `er_sde` + `simple` *(default)* | Neutral style, flat colors, sharp lines |
| `euler_a` | Softer, thinner lines, slightly more 2.5D feel, tolerates higher CFG |
| `dpmpp_2m_sde_gpu` | Similar to er_sde but more "creative", can get wild on short prompts |
Feel free to experiment β these are just starting points, not hard rules.
---
## π Resolution Guide
| Use Case | Resolution |
| --- | --- |
| β Square / General purpose | **1024 Γ 1024** |
| Portrait / Character art | **896 Γ 1152** |
| Landscape / Scenes | **1152 Γ 896** |
| Wider cinematic | **1254 Γ 836** |
| Widescreen | **1365 Γ 768** |
Stay around **1 MP** for the cleanest results. The Anima base starts breaking down somewhere around 2 MP, so if you want bigger images, generate at 1 MP first and upscale afterwards.
---
## π‘ Prompting Guide
AniSee inherits Anima's prompting system. It accepts:
- Danbooru / anime-style tags
- Natural language prompts
- Mixed prompts (tags + sentences)
A good prompt structure:
```
[quality tags] [meta tags] [safety tag] [subject (1girl/1boy/etc)]
[character] [appearance] [pose] [clothing] [background] [lighting] [style]
```
**Important tag rules** (inherited from Anima):
- Use **lowercase** for tags, spaces instead of underscores
- **Score tags** are the only tags that use underscores (`score_7`, etc.)
- **Artist tags** must be prefixed with `@` β e.g. `@artistname`
### β
Good (mixed prompt)
```
masterpiece, best quality, score_7, highres, illustration, safe, 1girl,
long silver hair, blue eyes, black hoodie, standing in a rainy city street
at night, neon lights reflecting on wet asphalt, cinematic lighting,
detailed anime illustration
```
### β
Good (natural language)
```
masterpiece, best quality, score_7, highres, illustration.
A young anime girl with long silver hair and golden eyes, wearing a
traditional shrine maiden outfit with white haori and red hakama.
She stands in a sunlit bamboo forest, cherry blossoms falling softly
around her. Warm afternoon light filtering through the trees,
detailed fabric shading, calm serene expression.
```
### β Avoid
Very short tag dumps like `anime girl, silver hair, hoodie` β the model can produce unexpected results when the prompt is too sparse. Aim for at least a few descriptive tags or 2+ sentences.
---
## β Recommended Positive Prefix
Start every prompt with:
```
masterpiece, best quality, score_7, highres, illustration,
```
Then add your subject, character, scene, and style tags after that.
You can also experiment with other quality tag combinations:
- `masterpiece, best quality, score_7, safe` *(Anima default)*
- `masterpiece, best quality, score_8, highres, official art`
- `score_9, masterpiece, absurdres, anime screenshot`
But the prefix above is what I personally use and recommend as a starting point.
---
## β Recommended Negative Prompt
This is the negative I run with β it cleans up most common issues without being so aggressive that it kills the style:
```
worst quality, low quality, score_1, score_2, score_3, artist name,
(lowres:1.2), (worst quality:1.4), (low quality:1.4), (bad anatomy:1.4),
bad hands, multiple views, comic, jpeg artifacts, patreon logo,
patreon username, web address, signature, watermark, artist name,
censored, mosaic censoring
```
If your images come out too flat or lose style, reduce the weights on the heavier terms (e.g. drop `(low quality:1.4)` back to `low quality`).
---
## π‘οΈ Safety Tags
Inherited from Anima. Use one of these in the positive prompt:
- `safe` β for normal generations *(recommended default)*
- `sensitive`
- `nsfw`
- `explicit`
---
## π§ Installation
### Step 1 β Download the files
You need three files (all included in this repo):
- `anisee.safetensors` β the model
- `text_encoders/qwen_3_06b_base.safetensors` β text encoder
- `vae/qwen_image_vae.safetensors` β VAE
### Step 2 β Place the files
```
ComfyUI/models/diffusion_models/
βββ anisee.safetensors
ComfyUI/models/text_encoders/
βββ qwen_3_06b_base.safetensors
ComfyUI/models/vae/
βββ qwen_image_vae.safetensors
```
If you already run **Anima Preview3 Base**, you already have the text encoder and VAE β AniSee is a direct drop-in.
### Step 3 β Load in ComfyUI
Use the standard Anima workflow, or the official AniSee workflow from `workflows/anisee-workflow-SDUltimate.json`:
- **Load Diffusion Model** β `anisee.safetensors`
- **Load Text Encoder** β `qwen_3_06b_base.safetensors`
- **Load VAE** β `qwen_image_vae.safetensors`
Then your usual sampler, encode, decode, save chain.
---
## π§© Official Workflow
<div align="center">
<img src="https://huggingface.co/SeeSee21/AniSee/resolve/main/images/anisee-workflow-cover.png" alt="AniSee Workflow" width="100%">
</div>
A ready-to-use ComfyUI workflow is included at [`workflows/anisee-workflow-SDUltimate.json`](./workflows/anisee-workflow-SDUltimate.json).
It features:
- π¦ Model + Text Encoder + VAE loaders pre-configured
- π **Auto Quality Prefix** β no need to type `masterpiece, best quality, score_7, ...` yourself
- π² **Optional Qwen3-VL Prompt Enhancer** β converts short one-liners into full Danbooru tag lists
- π Optional **LoRA** stack via Lora Manager (one-click toggle)
- πΌ Optional **UltimateSDUpscale** 2Γ with side-by-side compare
- π¨ Pre-configured with `er_sde` / `simple` / 40 steps / CFG 4.5
- β Pre-loaded recommended negative prompt
- π Built-in MarkdownNote with all settings + quick reference
**Required custom nodes** (all installable via ComfyUI Manager):
- [ComfyUI-Easy-Use](https://github.com/yolain/ComfyUI-Easy-Use)
- [ComfyUI_UltimateSDUpscale](https://github.com/ssitu/ComfyUI_UltimateSDUpscale)
- [ComfyUI-Lora-Manager](https://github.com/willmiao/ComfyUI-Lora-Manager)
- [ComfyUI-QwenVL](https://github.com/1038lab/ComfyUI-QwenVL)
- [rgthree-comfy](https://github.com/rgthree/rgthree-comfy)
For the optional 2Γ upscaler, also place `4x-UltraSharp.pth` in `ComfyUI/models/upscale_models/`:
- [OpenModelDB β 4x-UltraSharp](https://openmodeldb.info/models/4x-UltraSharp)
- [HuggingFace β Kim2091/UltraSharp](https://huggingface.co/Kim2091/UltraSharp)
---
## π Repository Structure
```
AniSee/
βββ README.md
βββ config.json
β
βββ anisee.safetensors # the model (~4.18 GB)
β
βββ text_encoders/
β βββ qwen_3_06b_base.safetensors # text encoder (same as Anima)
β
βββ vae/
β βββ qwen_image_vae.safetensors # VAE (same as Anima)
β
βββ images/
β βββ cover.png # social preview / model cover
β βββ anisee-workflow-cover.png # workflow preview image
β βββ 1.png 2.png 3.png 4.png
β βββ 5.png 6.webp 7.webp 8.png
β βββ 9.png 10.png 11.webp 12.webp
β βββ ...
β
βββ workflows/
βββ anisee-workflow-SDUltimate.json
```
---
## π Version History
### v1.0 β Initial Release
- **AniSee Base** β full fine-tune of Anima Preview3 Base
- ~20K training steps on a curated anime dataset
- LLM adapter only very lightly co-trained *(following Anima's fine-tuning guidelines)*
- Diffusion Model variant *(single `.safetensors` file)*
- Compatible with the standard Anima ComfyUI workflow
- Drop-in replacement for `anima-preview3-base.safetensors`
- Includes the official ComfyUI workflow with auto quality prefix + Qwen3-VL prompt enhancer
---
## π Links
- **CivitAI Page:** [civitai.red/models/2628747/anisee](https://civitai.red/models/2628747/anisee)
- **Example Gallery:** [anisee.anisee.workers.dev](https://anisee.anisee.workers.dev/)
- **Base Model:** [circlestone-labs/Anima](https://huggingface.co/circlestone-labs/Anima)
- **CDM Paper** *(planned Turbo variant)*: [byliutao.github.io/cdm_page](https://byliutao.github.io/cdm_page/)
- **Author:** [SeeSee21 on Hugging Face](https://huggingface.co/SeeSee21)
---
## π Credits
- **Base Model:** [Anima Preview3 Base](https://huggingface.co/circlestone-labs/Anima) by **CircleStone Labs** and **Comfy Org**
- **Underlying Architecture:** Built on NVIDIA Cosmos-Predict2-2B *(Anima is a "Derivative Model")*
- **Fine-Tune:** SeeSee21
- **CDM Distillation Method** *(planned Turbo variant)*: [Continuous-Time Distribution Matching for Few-Step Diffusion Distillation](https://byliutao.github.io/cdm_page/) β Liu et al., 2026
- **Workflow Custom Nodes:** yolain, ssitu, Will Miao, AILab (1038lab), rgthree
---
## π License
AniSee inherits the **CircleStone Labs Non-Commercial License** from Anima. The model and derivatives are usable **only for non-commercial purposes**. As a derivative of Cosmos-Predict2-2B-Text2Image, the [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/) also applies insofar as it covers Derivative Models.
For commercial licensing of the base model, please contact CircleStone Labs at `tdrussell@circlestone.ai`.
---
## β€οΈ Notes
AniSee is a personal anime-focused fine-tune of Anima Preview3 Base, built to bring a stronger anime look and visual direction in line with my other checkpoints.
It is still in active testing β the AIO and 4-Step Turbo (CDM) variants will follow once the Base has been validated in the wild.
**AniSee β clean anime, built on Anima. π¨**
|