Spaces:
Sleeping
Sleeping
| title: SimpleAI-259M | |
| emoji: ⚡ | |
| colorFrom: indigo | |
| colorTo: gray | |
| sdk: docker | |
| pinned: false | |
| license: mit | |
| short_description: A compact, general-purpose LLM for reasoning and logic. | |
| # ⚡ SimpleAI-259M | |
| **SimpleAI-259M** is a high-performance Large Language Model (LLM). It is the result of a targeted SFT (Supervised Fine-Tuning) run focused on unlocking reasoning, numeracy, and character-level precision. | |
| --- | |
| ## 🚀 SFT Training Report (Step 971) | |
| Final Loss: **1.0419** | |
| ### 📊 Benchmark Performance | |
| | Category | Score | Status | | |
| | :--- | :--- | :--- | | |
| | **ARC-Easy** | **35.19%** | 📈 Reasoning Gain | | |
| | **MMLU** | **30.96%** | ✅ General Knowledge | | |
| | **GSM8K (Math)** | **12.50%** | 🚀 Numeracy Breakthrough | | |
| | **SpellingBee** | **100.00%** | 🏆 Perfect Character Accuracy | | |
| --- | |
| ## 🔮 Future Roadmap: SimpleAI Series | |
| 1. **SimpleAI-D12-v2:** Enhanced dataset targeting sub-1.0 training loss. | |
| 2. **SimpleAI-D24:** A deeper 24-layer variant for multi-step logical deduction. | |
| 3. **SimpleAI-Omni:** Multimodal integration for cross-modal reasoning. | |
| --- | |
| ## 🧑💻 Usage | |
| The model uses standard system tags for interaction: | |
| - `<|user_start|>` / `<|user_end|>` | |
| - `<|assistant_start|>` | |