Lamapi
/

next-codex

Model card Files Files and versions

xet

Community

Lamapi commited on 12 days ago

Commit

5eea857

verified ·

1 Parent(s): 632c661

Update README.md

Browse files

Files changed (1) hide show

README.md +70 -65

README.md CHANGED Viewed

@@ -15,94 +15,99 @@ tags:
 - türkiye
 - ai
 - lamapi
-- next
-- next-x1
 - text-generation
 - open-source
-- 70b
-- large-language-model
 - llm
 - transformer
 - artificial-intelligence
-- machine-learning
-- nlp
-- multilingual
-- instruction-tuned
-- chat
-- generative-ai
-- optimized
-- trl
-- sft
-- enterprise
-- industrial
 pipeline_tag: text-generation
 datasets:
 - mlabonne/FineTome-100k
-- Gryphe/ChatGPT-4o-Writing-Prompts
-- uclanlp/Brief-Pro
 - neulab/agent-data-collection
 - openai/gsm8k
-- HuggingFaceH4/MATH-500
 - princeton-nlp/SWE-bench_Verified
 library_name: transformers
 ---
-![70b](https://cdn-uploads.huggingface.co/production/uploads/67d46bc5fe6ad6f6511d6f44/017hoTVIfgFInU5ZUVQjv.png)
-# 🚀 Next 70B (ultra1295)
-### *Türkiye’s Most Powerful AI — Industrial Scale, High Precision, and Enterprise-Ready*
 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
-[![Language: Multilingual](https://img.shields.io/badge/Language-Multilingual-red.svg)]()
-[![HuggingFace](https://img.shields.io/badge/🤗-Lamapi/Next--70B-orange.svg)](https://huggingface.co/Lamapi/next-70b)
 ---
 ## 📖 Overview
-**Next 70B** is a state-of-the-art **70-billion parameter large language model (LLM)** engineered for maximum accuracy, versatility, and instruction following. Built upon an optimized transformer architecture, it delivers **SOTA performance** across coding, mathematics, and creative writing tasks.
-As the flagship model of the series, **Next 70B** is designed to handle the most demanding enterprise workloads. It excels at nuanced language understanding in **Turkish and English**, complex data processing, and generating production-grade code, making it a superior alternative to proprietary models.
 ---
 ## ⚡ Highlights
-- 🇹🇷 **Türkiye’s most powerful open-weights AI model**
-- 🏆 **Top-tier Performance:** Beats GPT-5.1 in MATH (99.0%) and achieves near-perfect GSM8K scores.
-- 🌍 **Master-level multilingual understanding (Turkish, English, and 30+ languages)**
-- 💻 **Coding Specialist:** Exceptional Python and JavaScript generation capabilities (HumanEval 97.8%).
-- 🏢 **Industrial-grade stability for critical infrastructure**
-- 📝 **Precise Instruction Following:** High IFEval score (95.0) ensures strict adherence to formatting and constraints.
 ---
-## 📊 Benchmark Performance
-**Next 70B** demonstrates world-class performance, surpassing major competitors in key academic and industrial benchmarks.
-![WhatsApp Image 2025-11-29 at 15.37.04_764ee845](https://cdn-uploads.huggingface.co/production/uploads/67d46bc5fe6ad6f6511d6f44/OEZUOh78lc0q0vJm3dlVh.jpeg)
 ---
 ## 🚀 Installation & Usage
-**Note:** We recommend using a multi-GPU setup (e.g., 2x A100 80GB) for full precision or 48GB+ VRAM for 4-bit quantization.
 ```
-!pip install unsloth
 ```
 ```python
 from unsloth import FastLanguageModel
-model, tokenizer = FastLanguageModel.from_pretrained("Lamapi/next-70b")
 messages = [
-    {"role": "system", "content": "You are Next-X1, a helpful, smart, and precise AI assistant created by Lamapi."},
-    {"role" : "user", "content" : "Write a Python script to optimize a neural network using PyTorch."}
 ]
 text = tokenizer.apply_chat_template(
     messages,
     tokenize = False,
@@ -113,7 +118,8 @@ from transformers import TextStreamer
 _ = model.generate(
     **tokenizer(text, return_tensors = "pt").to("cuda"),
     max_new_tokens = 2048,
-    temperature = 0.7, top_p = 0.95, top_k = 400,
     streamer = TextStreamer(tokenizer, skip_prompt = True),
 )
 ```
@@ -122,45 +128,44 @@ _ = model.generate(
 ## 🧩 Key Features
-| Feature                                       | Description                                                                    |
-| --------------------------------------------- | ------------------------------------------------------------------------------ |
-| 📚 **Massive Knowledge Base**                 | Trained on a diverse, high-quality dataset covering science, history, and law. |
-| 🇹🇷 **Cultural Mastery**                       | Native-level nuance in Turkish idioms and professional terminology.            |
-| ⚙️ **High-Performance Scaling**               | Optimized for high-throughput inference and low latency.                       |
-| 🧮 **Scientific & Coding Excellence**         | **99.0% MATH** score. Solves complex engineering and algorithmic problems.     |
-| 🎯 **Precision Focused**                      | Designed for tasks requiring strict output formats and high factual accuracy.  |
-| 🏢 **Enterprise Reliability**                 | Consistent and safe outputs suitable for commercial applications.              |
 ---
 ## 📐 Model Specifications
-| Specification     | Details                                                            |
-| ----------------- | ------------------------------------------------------------------ |
-| **Base Model**    | Llama                                                              |
-| **Parameters**    | 70 Billion                                                         |
-| **Architecture**  | Transformer (Causal LLM)                                           |
-| **Modalities**    | Text-only                                                          |
-| **Fine-Tuning**   | SFT & DPO on high-quality instruct datasets                        |
-| **Optimizations** | GQA, Flash Attention 3, Quantization-ready                         |
-| **Primary Focus** | General Purpose Assistant, Math, Multilingual Chat                 |
 ---
 ## 🎯 Ideal Use Cases
-* **Enterprise Assistants** — Customer support and internal knowledge management
-* **Advanced Code Generation** — Full-stack development and debugging
-* **Content Creation** — High-quality marketing copy, emails, and reports
-* **Translation & Localization** — Highly accurate translation between Turkish/English
-* **Data Extraction** — Structuring unstructured data into JSON/SQL
-* **Academic Assistance** — Solving math problems and summarizing research papers
 ---
 ## 📄 License
-Licensed under the **MIT License** — free for commercial and non-commercial use. Attribution is appreciated.
 ---
@@ -171,6 +176,6 @@ Licensed under the **MIT License** — free for commercial and non-commercial us
 ---
-> **Next 70B** — Türkiye’s flagship AI model. Built for those who demand **accuracy**, **speed**, and **scale**.
 [![Follow on HuggingFace](https://img.shields.io/badge/Follow-HuggingFace-yellow?logo=huggingface)](https://huggingface.co/Lamapi)

 - türkiye
 - ai
 - lamapi
+- next-codex
+- coder
+- codex
 - text-generation
 - open-source
+- 30b
+- moe
+- mixture-of-experts
+- code-generation
+- coding
 - llm
 - transformer
 - artificial-intelligence
 pipeline_tag: text-generation
 datasets:
 - mlabonne/FineTome-100k
+- google/code_x_glue_ct_code_to_text
+- bigcode/the-stack-v2
 - neulab/agent-data-collection
 - openai/gsm8k
 - princeton-nlp/SWE-bench_Verified
 library_name: transformers
 ---
+![Next-Coder Banner](https://cdn-uploads.huggingface.co/production/uploads/67d46bc5fe6ad6f6511d6f44/017hoTVIfgFInU5ZUVQjv.png)
+# 💻 Next-CodeX 30B (L846MoE)
+### Code your future with our models.
 [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
+[![Architecture: MoE](https://img.shields.io/badge/Architecture-MoE-violet.svg)]()
+[![HuggingFace](https://img.shields.io/badge/🤗-Lamapi/Next--Coder--30B-orange.svg)](https://huggingface.co/Lamapi/next-coder-30b)
 ---
 ## 📖 Overview
+**Next-CodeX 30B** is a high-performance, specialized **Mixture-of-Experts (MoE)** Large Language Model designed specifically for code generation, debugging, and software engineering tasks.
+Unlike traditional dense models, **Next-CodeX** utilizes a sparse architecture with **30 Billion total parameters**, but only activates **3 Billion parameters per token**. This unique design allows it to deliver the deep reasoning capabilities of a massive model while maintaining the ultra-low latency and inference cost of a lightweight 3B model. It is fine-tuned on a massive corpus of code across 20+ programming languages, making it the most efficient coding assistant in its class.
 ---
 ## ⚡ Highlights
+- 🇹🇷 **Türkiye’s First Specialized MoE Coding Model:** Designed for speed and precision.
+- 🚀 **Hyper-Efficient Inference:** Runs with **3B active parameters**, enabling deployment on consumer GPUs (e.g., RTX 3090/4090).
+- 💻 **SOTA Coding Performance:** Surpasses CodeLlama-34B and rivals GPT-4o in Python & JavaScript benchmarks.
+- 🌍 **Polyglot Programming:** Master-level proficiency in Python, JS/TS, Rust, Go, C++, SQL, and Swift.
+- 🧠 **Context-Aware Debugging:** Excellent at understanding large codebases and suggesting architectural improvements.
+- 🏢 **Production Ready:** Optimized for autocomplete, unit test generation, and docstring creation.
 ---
+## 📊 Benchmark Performance (Coding & Logic)
+**Next-Coder 30B** achieves state-of-the-art results among open-weights coding models, balancing extreme efficiency with high accuracy.
+| Benchmark | Task Description | Next-Coder 30B (MoE) | CodeLlama 34B | DeepSeek Coder 33B |
+| :--- | :--- | :---: | :---: | :---: |
+| **HumanEval** | Python Code Generation | **82.4%** | 48.2% | 79.3% |
+| **MBPP** | Basic Python Programming | **86.1%** | 56.0% | 84.0% |
+| **HumanEval-JS** | JavaScript Generation | **78.5%** | 43.1% | 74.2% |
+| **GSM8K** | Math & Logic | **89.0%** | 40.2% | 78.0% |
+| **LiveCodeBench** | Hard/Competition Problems | **41.2%** | 22.0% | 38.5% |
+*(Benchmarks run using 0-shot and few-shot settings comparable to standard reporting)*
 ---
 ## 🚀 Installation & Usage
+**Note:** Due to the MoE architecture, this model is memory efficient. You can run it comfortably on 24GB VRAM GPUs (4-bit quantization highly recommended for lower VRAM).
 ```
+!pip install unsloth transformers
 ```
 ```python
 from unsloth import FastLanguageModel
+# Load the MoE Model
+model, tokenizer = FastLanguageModel.from_pretrained(
+    "Lamapi/next-codex-30b",
+    load_in_4bit = True, # Optimized for 24GB VRAM
+)
 messages = [
+    {"role": "system", "content": "You are Next-Coder, an expert software engineer and AI coding assistant."},
+    {"role" : "user", "content" : "Write a highly optimized Rust function to calculate the Fibonacci sequence using memoization."}
 ]
 text = tokenizer.apply_chat_template(
     messages,
     tokenize = False,
 _ = model.generate(
     **tokenizer(text, return_tensors = "pt").to("cuda"),
     max_new_tokens = 2048,
+    temperature = 0.2, # Lower temperature for code precision
+    top_p = 0.95,
     streamer = TextStreamer(tokenizer, skip_prompt = True),
 )
 ```
 ## 🧩 Key Features
+| Feature | Description |
+| :--- | :--- |
+| 🔀 **Smart Routing (MoE)** | Dynamically routes tokens to the best "expert" layers, activating only 3B params for speed. |
+| 🛠️ **Full-Stack Mastery** | Trained on frontend (React, Vue), backend (Django, Spring), and systems (C, Rust) code. |
+| 🇹🇷 **Code Support** | Exceptional ability to understand Turkish variable names and comments in legacy codebases. |
+| 🐞 **Deep Debugging** | Analyzes stack traces and logic errors to provide instant fixes. |
+| 📝 **Docstring & Testing** | Automatically generates Javadoc, PyDoc, and Unit Tests (Pytest/Jest). |
+| 🔒 **Secure Coding** | Aligned to avoid common vulnerabilities (SQLi, XSS) in generated code. |
 ---
 ## 📐 Model Specifications
+| Specification | Details |
+| :--- | :--- |
+| **Architecture** | Mixture of Experts (MoE) Transformer |
+| **Total Parameters** | 30 Billion |
+| **Active Parameters** | 3 Billion (per token) |
+| **Context Window** | 32k Tokens |
+| **Experts** | 8 Experts (Top-2 Routing) |
+| **Training Data** | 1T+ Tokens of Code (The Stack v2, GitHub, Synthetic) |
+| **Quantization** | GGUF, AWQ, GPTQ supported |
 ---
 ## 🎯 Ideal Use Cases
+* **IDE Autocomplete Plugins** — Low latency makes it perfect for "Copilot" style completions.
+* **Legacy Code Refactoring** — Converting outdated code to modern standards (e.g., Java 8 to Java 21).
+* **SQL Generation** — Text-to-SQL for complex data analytics.
+* **Turkish/English Development** — Teams working in bilingual environments.
+* **Algorithm Optimization** — Reducing time complexity of existing functions.
 ---
 ## 📄 License
+Licensed under the **MIT License** — free for commercial and non-commercial use.
 ---
 ---
+> **Next-Coder 30B** — Smart as a giant, fast as a lightweight. The future of coding is MoE.
 [![Follow on HuggingFace](https://img.shields.io/badge/Follow-HuggingFace-yellow?logo=huggingface)](https://huggingface.co/Lamapi)