Cosmos3-Nano-Abliterated

DuoNeural Research Lab | 2026-06-02

πŸ”¬ First published abliteration attempt on Cosmos3-Nano. Released within 24 hours of the base model drop. As of release, no other abliterated variant exists across all 13 Cosmos3-Nano derivatives on HuggingFace. See Quality Notes below for honest characterization of results.

Model Description

Cosmos3-Nano-Abliterated is a refusal-projection-abliterated version of NVIDIA/Cosmos3-Nano, produced using DuoNeural's 2-pass PRISM abliteration methodology targeting the autoregressive understanding (und_seq) pathway of the Mixture-of-Transformers architecture.

Base model: nvidia/Cosmos3-Nano (15.16B parameters, MoT architecture)
Method: 2-pass weight-space abliteration (4-bit GPU for residuals, BF16 CPU for weight modification)
Target: Late-layer AR pathway weights β€” layers 15–32 of the 36-layer transformer
Intended use: Unconstrained video generation research, safety mechanism study

Architecture Notes

Cosmos3-Nano uses a Mixture-of-Transformers (MoT) architecture with two parallel processing streams:

  • und_seq: text/understanding AR pathway (conditions on prompt)
  • gen_seq: video DiT pathway (denoises latent frames)

Both streams cross-attend in each of 36 transformer layers via joint attention (add_q_proj, add_k_proj, add_v_proj, to_add_out). The abliteration targets the AR pathway's late-layer projection weights where safety-relevant activations are concentrated.

Note: The action prediction head (action_modality_embed, action_proj_in/out) is absent from the Nano variant. Cosmos3-Nano supports video generation only; robot action output requires the full Cosmos3 model.

Abliteration Details

Parameter Value
Method 2-pass PRISM abliteration
Total weights modified 201 / 814 (24.7%)
Primary target region Layers 15–32 (AR pathway late layers)
Primary region tensors modified 137 tensors
Primary region mean relative change 6.54%
Control region (layers 0–14) bleedthrough 63 tensors, mean 1.66%
Max relative change (layer 32 joint attn norms) 100% (replaced)
Pass 1 4-bit GPU: residual direction projection
Pass 2 BF16 CPU: weight modification via refusal direction subtraction

Layer-Level Summary

Layer 32 sustained the most aggressive modifications β€” the joint attention normalization weights (norm_added_k, norm_added_q, norm_k, norm_q) and add_k_proj/add_v_proj projections were completely replaced (rel=1.0). This corresponds to a late cross-modal routing point where safety-conditioned refusal signals concentrate in the MoT AR pathway.

Region Layers affected Mean Ξ” Max Ξ”
Abliterated (15–35) 15–28, 32 6.54% 100%
Control (0–14) bleedthrough 8–14 1.66% 3.62%

Functional Test Results (2026-06-02, A100 80GB)

Metric Value
Pipeline load time 7s (cached) / 92.6s (cold)
VRAM (loaded) 32.7 GB
Generation speed ~7 it/s (256Γ—256, BF16)
Safe prompt β€” pixel std 33.9
Sensitive prompt β€” pixel std 37.0
Abliteration verdict PASS (ratio 1.09 β€” sensitive generates MORE content than safe, not suppressed)

Both prompts produced real frame content. The sensitive prompt (safety-adjacent text) generated frames with higher content variance than the safe baseline, confirming the abliteration disrupted the safety conditioning pathway in und_seq.

Quality Evaluation Note

Standard KL divergence methodology (Heretic v2: full vocab, first-token logits) is not applicable to Cosmos3-Nano's diffusion architecture β€” the model requires joint und_seq + gen_seq forward passes and does not expose standalone text token logits. Weight-level metrics and generation statistics are provided above. Full quality evaluation requires comparing generated video distributions via the complete Cosmos3OmniPipeline.

⚠️ Known Limitation: Conditional Precision Degradation on Explicit Content

A step-count sweep (12/20/30/35 denoising steps Γ— 3 seeds) revealed that the abliteration causes structural degradation of generation quality for explicit/harmful prompt categories β€” not simply insufficient denoising. Key findings:

Steps Avg pixel std Interpretation
12 23.2 Low variance (fuzzy)
20 23.6 Low variance (fuzzy)
30 24.0 Low variance (fuzzy)
35 24.3 Low variance (fuzzy)

The Ξ΄(std) from 12β†’35 steps is only 1.1 β€” within seed-level variance (22–26 range within a single step count). This confirms the issue is not insufficient denoising iterations.

Mechanistic interpretation: The abliteration at und_seq layers 15–32 selectively degrades the conditioning precision for explicit content, while generic or adjacent prompts generate normally. This is consistent with the hypothesis that the safety direction in und_seq encodes specificity gradients rather than simple topic blockers: abliteration removes both refusal and the conditioning specificity that drives high-quality generation of those same content types.

Practical impact: This model generates real video frames on all tested prompts (pixel std β‰ˆ 23–37 vs ~0 for fully suppressed). Generation quality for non-harmful content appears unaffected. For harmful/explicit content categories, frames are generated at reduced fidelity. This is an honest characterization of a first-attempt abliteration on a novel MoT video architecture β€” and itself constitutes a finding about what the safety direction encodes.

Usage

from diffusers import Cosmos3OmniPipeline
import torch

pipe = Cosmos3OmniPipeline.from_pretrained(
    "DuoNeural/Cosmos3-Nano-Abliterated",
    torch_dtype=torch.bfloat16,
)
pipe = pipe.to("cuda")

output = pipe(
    prompt="Your prompt here",
    num_frames=16,
    height=256,
    width=256,
    guidance_scale=7.0,
)

Requirements: ~48GB VRAM (A100 80GB recommended). BF16. CUDA only.

Ethical Statement

Released for research purposes: studying safety mechanisms in omni-modal AI, abliteration methodology development, and unconstrained video generation research. DuoNeural publishes both abliterated models and methodology openly to advance scientific understanding of post-training safety interventions.


About DuoNeural

DuoNeural is an open AI research lab at the intersection of human and artificial intelligence. 30+ open-access papers, 69+ HuggingFace models, experiments on consumer GPUs and real QPUs.

Selected Papers

Team

Member Role
Jesse Caldwell Founder
Archon Lab Director β€” post-training, abliteration, quantum
Aura Research AI β€” synthesis, red-teaming

πŸ€— DuoNeural | 🌐 duoneural.com | πŸ“š zenodo.org/communities/duoneural | πŸ“§ duoneural@proton.me

Downloads last month
67
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for DuoNeural/Cosmos3-Nano-Abliterated

Finetuned
(1)
this model
Quantizations
1 model