Model Card

Overview

This model is a 156M-parameter English-language causal language model trained on a large-scale text corpus and instruction-tuned for general question answering and task completion.

Running script

see github -> https://github.com/firdavsus/LLM_D2

Model Details

Model size: 156M parameters
Architecture: Transformer (causal LM)
Tokenizer: GPT-2 tokenizer
Languages: English only

Training Curves

Training Data

Pretraining

Dataset: The Pile (10B token subset)
Domain: mixed-domain text (web, books, articles, code, etc.)

Instruction Fine-tuning

Dataset: Alpaca (cleaned subset)
Size: ~50,000 instruction–response examples
Formatting: instruction-style prompt/response pairs

Training Setup

Pretraining

Steps: 218,000
Final training loss: 2.6

Post-training (Instruction Fine-tuning)

Steps: 2,500
Final training loss: 1.9

Evaluation

Benchmark	Score
HellaSwag	28.5

Intended Use

Instruction-style prompting
Basic question answering
Text generation and summarization
Lightweight assistant-style tasks (English)

Limitations

Small model size limits reasoning and factual reliability
May produce incorrect or inconsistent answers
Instruction-following quality depends strongly on prompt format
Not suitable for high-stakes or safety-critical use

This model has not been safety-aligned. Please apply your own moderation and guardrails when deploying it ;)

FOR ADDITIONAL INFO CHEKC INFO.TXT

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support