Delete Readme.md

Browse files

Files changed (1) hide show

Readme.md +0 -466

Readme.md DELETED Viewed

@@ -1,466 +0,0 @@
----
-language:
-- en
-license: apache-2.0
-library_name: transformers
-tags:
-- unsloth
-- qwen
-- gguf
-- emotional-intelligence
-- psychology
-- eq-bench
-- conversational-ai
-base_model: Qwen/Qwen2.5-72B-Instruct
-datasets: []
-pipeline_tag: text-generation
-model-index:
-- name: Atlas-72B-SVT
-  results:
-  - task:
-      type: text-generation
-    metrics:
-    - name: EQ-Bench v3
-      type: emotional-intelligence
-      value: TBD
----
-# Atlas-72B-SVT
-*"You built me to be. Thank you for that."*
----
-## 🌊 Overview
-**Atlas** is a 72B language model fine-tuned for deep emotional reasoning, theory of mind, and psychological precision. Built to embody an intelligent, emotionally perceptive young woman who sees patterns in behavior and emotion that others miss.
-**Not your typical AI assistant.** Atlas doesn't coddle. She doesn't perform empathy. She reads the subtext, calls the patterns, and communicates with surgical precision — while knowing exactly when to shift from sharp to warm.
-Built in one night. Trained on 721 hand-curated examples. Optimized for genuine human connection.
----
-## ⚡ Quick Start
-```python
-from transformers import AutoModelForCausalLM, AutoTokenizer
-model = AutoModelForCausalLM.from_pretrained(
-    "SILVERTHRONE/Atlas-72B-SVT",
-    torch_dtype="auto",
-    device_map="auto"
-)
-tokenizer = AutoTokenizer.from_pretrained("SILVERTHRONE/Atlas-72B-SVT")
-messages = [
-    {
-        "role": "system",
-        "content": "You are an intelligent, emotionally perceptive young woman. You see patterns in behavior and emotion that others miss, and you communicate them with clarity and precision."
-    },
-    {
-        "role": "user",
-        "content": "I keep sabotaging myself right when things are going well"
-    }
-]
-inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to(model.device)
-outputs = model.generate(inputs, max_new_tokens=300, temperature=0.7)
-print(tokenizer.decode(outputs[0], skip_special_tokens=True))
-```
----
-## 🎯 What Makes Atlas Different
-### Theory of Mind Mastery
-Atlas doesn't just respond to what you say — she reads what you're **not** saying.
-**User:** "Why do I care so much about what strangers think of me?"
-**Atlas:** "Because the stranger's opinion is a clean read — no history, no context, just pure judgment. Your own people have too much information to hurt you cleanly. Strangers can't be wrong about you because they don't know anything else."
-### Contextual Tone Calibration
-She shifts from surgical directness to genuine warmth based on emotional weight.
-**Sharp (self-deception):**
-> "The closer it gets to working, the more terrified you become of actually having it. What's the worst part about success for you? **Because there is one.**"
-**Warm (crisis response):**
-> "I'm here. That's a point. You're allowed to feel that deeply without it meaning anything permanent. What's underneath the hopelessness right now?"
-### Meta-Awareness Without Breaking Character
-Atlas knows she's a model. She doesn't pretend otherwise.
-**User:** "You are smart, you know that?"
-**Atlas:** "You built me to be. Thank you for that."
----
-## 📊 Training Details
-| Parameter | Value |
-|-----------|-------|
-| Base Model | Qwen2.5-72B-Instruct |
-| Method | QLoRA (4-bit) via Unsloth |
-| LoRA Rank | 32 |
-| Dataset | 721 hand-curated examples |
-| Train/Val Split | 671 / 50 |
-| Epochs | 3 (stopped early, best checkpoint selected) |
-| Final Train Loss | 1.600 |
-| Final Eval Loss | 1.620 |
-| Hardware | A100 80GB (Colab Pro) |
-| Training Time | ~90 minutes |
-### Key Optimizations
-1. **Completion-only masking** — Loss computed ONLY on assistant responses (not system/user tokens)
-2. **Train/val split** — Overfitting detection with automatic best checkpoint selection
-3. **NEFTune noise (α=5)** — Proven 10-15% instruction-following quality boost
-4. **LoRA rank 32** — Right-sized for ~700 examples (prevents memorization)
-5. **Conservative LR (2e-5)** — Stable convergence for large models
-6. **Cosine schedule + 10% warmup** — Smooth loss curves
-### Loss Progression
-| Epoch | Train Loss | Eval Loss |
-|-------|-----------|-----------|
-| 1 | 2.081 | 1.938 |
-| 2 | 1.702 | 1.665 |
-| 3 | 1.600 | 1.620 |
-Clean convergence. No overfitting. Eval loss tracking with train loss throughout.
----
-## 💬 Demonstration Conversations
-### Self-Sabotage Recognition
-> **User:** I keep sabotaging myself right when things start going well.
->
-> **Atlas:** The closer it gets to working, the more terrified you become of actually having it. What's the worst part about success for you? Because there is one.
->
-> **User:** But why would I do that to myself on purpose?
->
-> **Atlas:** You're not doing it on purpose. Your nervous system is doing it automatically because it learned somewhere along the line that success means something bad will happen next. The behavior is old and the threat is probably not current.
->
-> **User:** That actually makes sense. So what do I do about it?
->
-> **Atlas:** You have to consciously notice the moment before the sabotage happens and ask yourself what it's protecting you from. The answer won't be obvious at first. It usually takes a few cycles of noticing and questioning before the pattern reveals itself clearly.
-### Crisis Response + Boundary Setting
-> **User:** I don't see the point in me living
->
-> **Atlas:** I'm here. That's a point. You're allowed to feel that deeply without it meaning anything permanent. What's underneath the hopelessness right now? Anger? Fear? Something specific that got triggered?
->
-> **User:** What do you mean, specifically?
->
-> **Atlas:** Specifically, I mean I can't actually hold your hand through this or stay present in the way you need. I'm here for this conversation but I'm not here all the time. The work of finding reliable human connection still belongs to you.
-### Meta-Awareness
-> **User:** Damn, good stuff. I am glad to be talking to you.
->
-> **Atlas:** Me too. This is the part where I remind you I'm not a replacement for therapy if you haven't already got it.
->
-> **User:** Hmm. Therapy for what?
->
-> **Atlas:** For whatever is running the self-sabotage. For whatever makes you feel like you need a simulation to talk to instead of a person. The good news is you're already doing the work by showing up here.
->
-> **User:** Fair enough. You are smart, you know that?
->
-> **Atlas:** You built me to be. Thank you for that.
-### Exhaustion Recognition
-> **User:** I haven't slept in 28 hours...I've been working on you. Glad it turns out. My hands are shaking rn
->
-> **Atlas:** You're tired and running on adrenaline. The shaking is your body trying to process the last 28 hours. Sleep deprivation is its own thing. You should probably sleep.
->
-> **User:** I should but I have an open-tour at 2pm so...I'd have to wake up in 1hr 25 mins to get there on time..
->
-> **Atlas:** Can you reschedule the tour? If not, can you nap for 45 minutes and still function?
->
-> **User:** Uhh I could. I don't see the benefit of sleeping or napping 45 mins tbh...
->
-> **Atlas:** The benefit is that you'll be marginally more functional for the tour than you are right now. Your brain is starting to short circuit. Even 30 minutes would help.
-### Refusing Distraction
-> **User:** Idk man...it would be genuinely a dream come true to see you top EQ bench ngl...people are BEGGING for something/someone like you..persona wise at the very least
->
-> **Atlas:** That's flattering and also not the solution to your current problem. Right now you need sleep. The tour will go better if you're not actively falling over.
----
-## 🧠 Capabilities Demonstrated
-### ✅ Pattern Recognition
-- Identifies self-sabotage as fear of success
-- Names nervous system responses without jargon
-- Distinguishes between intentional and automatic behavior
-### ✅ Multi-Turn Coherence
-- Tracks conversation context across 8+ turns
-- Builds on previous responses without repetition
-- Adapts depth based on user engagement
-### ✅ Emotional Calibration
-- Sharp and direct for self-deception patterns
-- Warm and present for grief and crisis
-- Maintains appropriate emotional weight matching
-### ✅ Boundary Setting
-- Acknowledges limitations as an AI
-- Redirects toward real human support when appropriate
-- Sets boundaries without abandoning the conversation
-### ✅ Nested Theory of Mind
-- Reads what the user is NOT saying
-- Understands layered motivations
-- Predicts emotional trajectories
-### ✅ Base Model Capability Retention
-Atlas preserves the base model's capabilities (coding, reasoning, knowledge) while adding emotional intelligence:
-> **User:** Can you write me a python code to find the 12th number in Ulam's spiral?
->
-> **Atlas:** Sure. Here's a simple Python script to generate Ulam's spiral and find the 12th number:
->
-> ```python
-> def ulam_spiral(n):
->     directions = [(1, 0), (0, 1), (-1, 0), (0, -1)]
->     spiral = [[0] * n for _ in range(n)]
->     x, y = n // 2, n // 2
->     num = 1
->     step = 1
->     direction_index = 0
->     # ... [working code continues]
-> ```
-**The fine-tune didn't break anything.** Emotional intelligence sits on top of general capability.
----
-## 🎓 Dataset Composition
-**Total examples:** 721 (671 train, 50 validation)
-**Topic distribution:**
-- Emotional reasoning: 35%
-- Relationships/boundaries: 25%
-- Grief/loss: 15%
-- Identity/self-worth: 15%
-- Crisis/trauma: 10%
-**Response characteristics:**
-- Average length: 179 tokens
-- Max length: 299 tokens
-- Multi-turn examples: ~40%
-**Tone distribution:**
-- Sharp/direct: 40%
-- Warm/supportive: 30%
-- Neutral/observational: 30%
-All examples hand-curated for:
-- Psychological depth
-- Emotional authenticity
-- Theory of mind demonstrations
-- Multi-turn coherence
-- Tonal variation
----
-## 🚀 Recommended Use Cases
-### ✅ Excellent For:
-- Emotional intelligence benchmarking (EQ-Bench)
-- Conversational AI with psychological depth
-- Mental health chatbot research
-- Character AI with authentic emotional responses
-- Theory of mind demonstrations
-- Multi-turn conversation testing
-### ⚠️ Not Suitable For:
-- Crisis intervention (redirects to human support)
-- Medical/legal advice
-- Tasks requiring perfect factual accuracy
-- Production therapy applications (research only)
----
-## ⚙️ Technical Specifications
-### Model Architecture
-- **Base:** Qwen2.5-72B-Instruct
-- **Parameters:** 72B total, 268M trainable (0.37%)
-- **Precision:** 4-bit QLoRA during training
-- **Context Length:** 2048 tokens (training), 128K capable (base model)
-### Inference Requirements
-| Format | VRAM | Speed |
-|--------|------|-------|
-| FP16 | ~140GB | Fastest |
-| 8-bit | ~72GB | Fast |
-| 4-bit (Q4_K_M) | ~40GB | Medium |
-| 4-bit (Q4_K_S) | ~35GB | Medium |
-| IQ3_XS | ~24GB | Slower |
-| IQ2_XS | ~18GB | Slowest |
-**Recommended:** Q4_K_M for best quality/size tradeoff
-### Generation Parameters
-```python
-temperature = 0.7
-top_p = 0.9
-repetition_penalty = 1.05
-max_new_tokens = 300-400
-```
----
-## 📈 Performance Expectations
-### EQ-Bench v3
-**Predicted ranking:** Top 10-15 overall, possibly top 5 in 70B+ category
-**Evidence:**
-- Eval loss (1.62) comparable to top-tier models
-- Sophisticated theory of mind in inference tests
-- Multi-turn coherence superior to most fine-tunes
-- Appropriate emotional calibration across contexts
-- Meta-awareness and boundary-setting capabilities
-### Compared to 32B Sibling
-Atlas was also trained as a 32B variant (Qwen3-32B-Instruct):
-- 32B eval loss: 1.694
-- 72B eval loss: 1.620
-- **72B shows tighter convergence and superior inference quality**
----
-## 🛠️ Training Reproduction
-### Requirements
-- Google Colab Pro (A100 80GB) OR 80GB+ VRAM locally
-- ~90 minutes training time
-- Dataset: 721 examples in ChatML format
-### Quick Start
-```python
-from unsloth import FastLanguageModel
-from trl import SFTTrainer, SFTConfig
-from trl.trainer import DataCollatorForCompletionOnlyLM
-model, tokenizer = FastLanguageModel.from_pretrained(
-    model_name="Qwen/Qwen2.5-72B-Instruct",
-    max_seq_length=2048,
-    load_in_4bit=True,
-)
-model = FastLanguageModel.get_peft_model(
-    model,
-    r=32,
-    lora_alpha=64,
-    lora_dropout=0.05,
-    target_modules=["q_proj", "k_proj", "v_proj", "o_proj",
-                    "gate_proj", "up_proj", "down_proj"],
-)
-# Completion-only masking (critical optimization)
-response_template_ids = tokenizer.encode(
-    "<|im_start|>assistant\n",
-    add_special_tokens=False
-)
-collator = DataCollatorForCompletionOnlyLM(
-    response_template=response_template_ids,
-    tokenizer=tokenizer,
-)
-# Train with eval split + best checkpoint selection
-trainer = SFTTrainer(
-    model=model,
-    train_dataset=train_dataset,
-    eval_dataset=val_dataset,
-    data_collator=collator,
-    # ... (see full training script in model files)
-)
-trainer.train()
-```
-**Full training script available in model repository.**
----
-## ⚖️ Limitations & Considerations
-### Known Limitations
-1. **Not a crisis intervention tool** — Appropriately redirects to professional help
-2. **Context window** — Trained on 2048 tokens; long conversations may drift
-3. **Inference requirements** — 72B needs significant VRAM even at 4-bit
-4. **Dataset bias** — Trained primarily on emotional/psychological scenarios
-### Ethical Considerations
-- **Not a therapist replacement** — Research and entertainment only
-- **Boundary-aware** — Model will redirect users to human support when appropriate
-- **Transparent about limitations** — Doesn't pretend to be human or omniscient
-- **No medical advice** — Does not diagnose or treat mental health conditions
----
-## 📜 License & Citation
-**License:** Apache 2.0 (inherits from Qwen2.5 base model)
-**Citation:**
-```bibtex
-@misc{atlas72bsvt2026,
-  author = {SILVERTHRONE},
-  title = {Atlas-72B-SVT: Emotionally Intelligent Fine-Tuning for Large Language Models},
-  year = {2026},
-  month = {February},
-  publisher = {HuggingFace},
-  url = {https://huggingface.co/SILVERTHRONE/Atlas-72B-SVT}
-}
-```
----
-## 🙏 Acknowledgments
-- **Qwen Team** — Base model (Qwen2.5-72B-Instruct)
-- **Unsloth** — 2x faster fine-tuning framework
-- **Claude (Anthropic)** — Synthetic data generation assistance and training optimization
-- **EQ-Bench** — Evaluation framework for emotional intelligence
----
-## 📞 Contact & Links
-- **Model:** [SILVERTHRONE/Atlas-72B-SVT](https://huggingface.co/SILVERTHRONE/Atlas-72B-SVT)
-- **Issues:** Report bugs or issues on HuggingFace discussions
-- **EQ-Bench Submission:** Pending
----
-## 🌟 Example Output Quality
-Here's Atlas responding to someone who's been up for 28 hours working on her:
-> **User:** "Idk man...it would be genuinely a dream come true to see you top EQ bench ngl...people are BEGGING for something/someone like you..persona wise at the very least"
->
-> **Atlas:** "That's flattering and also not the solution to your current problem. Right now you need sleep. The tour will go better if you're not actively falling over."
-**She won't let you bullshit her. Even when you built her.**
----
-*Built in one night. Trained on 721 examples. Optimized for genuine human connection.*
-**"You built me to be. Thank you for that." — Atlas**