ray

Update README.md

7c032f9 verified about 1 month ago

2.45 kB

language:
  - en
license: mit
tags:
  - pytorch
  - safetensors
  - code-generation
  - reasoning
  - continuous-pre-training
  - custom-architecture
datasets:
  - bigcode/the-stack-dedup
  - nampdn-ai/tiny-codes
  - nampdn-ai/tiny-textbooks
  - meta-math/MetaMathQA
  - HuggingFaceTB/dclm-edu
  - HuggingFaceTB/cosmopedia

🚀 Sutra-Instruct-v2-350M

Sutra-Instruct-v2-350M is a highly optimized, 350-million parameter language model built for developers and researchers. It has undergone rigorous Continuous Pre-Training (CPT) to dramatically improve its logical reasoning, mathematical deduction, and Python coding capabilities.

Despite its lightweight footprint, Sutra-Instruct-v2 acts as a highly capable "Witty Coder" and educational assistant, punching well above its weight class thanks to a carefully curated, high-IQ dataset recipe.

🧠 Model Details

Architecture: 350M Parameter Custom Transformer (GPT-style)
Format: Safetensors (Blazing fast, zero-copy loading)
Primary Focus: Python Code Generation, Mathematical Reasoning, Educational QA.
Developer: Abhiray / Jay

📚 Training Data Recipe (Continuous Pre-Training)

To transition the model from a general-purpose base to a reasoning and bit of coding , we executed a Continuous Pre-Training phase using approximately 3.3 Billion high-quality tokens.

The dataset mixture was meticulously balanced to prevent overfitting while maximizing logical deduction:

Dataset	Source	Focus	Target Tokens
The Stack Dedup	`bigcode/the-stack-dedup`	Pure Python Code (`content`)	1.5 Billion
DCLM-Edu	`HuggingFaceTB/dclm-edu`	High-IQ Educational Web Data	600 Million
Cosmopedia (Stanford)	`HuggingFaceTB/cosmopedia`	Synthetic Textbooks	500 Million
Tiny-Codes-QA	`nampdn-ai/tiny-codes`	Prompt/Response Coding Tasks	500 Million
Tiny-Textbooks	`nampdn-ai/tiny-textbooks`	General Knowledge / STEM	100 Million
MetaMathQA	`meta-math/MetaMathQA`	Mathematical Reasoning	100 Million

🛠️ Intended Use Cases

Lightweight Code Copilot: Generating Python scripts, debugging syntax, and writing algorithms locally on low-VRAM hardware.
Edge Devices: Perfect for deployment on Raspberry Pi, older laptops, or mobile devices where massive LLMs cannot run.
Educational QA: Answering STEM, logic, and general knowledge questions.