Persistent logic failure on negative constraints

#32
by Repaltoofficial - opened

Hello!

Great work on the visual fidelity here—the generation quality is solid.

I’ve been stress-testing the instruction adherence across diverse domains today and found a persistent logic failure regarding negative constraints.

I ran 3 separate tests targeting the word "without," and the model failed every time, regardless of the subject matter.

The Failure Pattern: The attention mechanism seems to lock onto the object token (e.g., "tail," "spikes") and completely bypasses the negation modifier.

Test 1 (Synthetic): Prompted for "A robotic horse without a tail." Result: Generated a clear tail.

Test 2 (Organic): Prompted for "A porcupine without its spikes." Result: Generated full spikes.

Test 3 :Prompted for " A cup without a handle "

10.01.2026_03.53.55_REC
10.01.2026_03.52.45_REC
10.01.2026_03.50.56_REC

The Fix: My team at Repalto specializes in constructing adversarial datasets for these exact edge cases. We can build a focused "Negation Benchmark" (50+ prompts spanning varying complexities) to help map and patch this logic gap.

Happy to send that batch over if you want to use it for benchmarking or fine-tuning the next version. Just let me know.

Best,

Wahaj Barlas
Repalto

Sign up or log in to comment