File size: 1,704 Bytes
653af7e
 
 
 
 
 
 
 
 
 
 
bb03644
 
 
 
653af7e
 
 
 
 
 
 
 
 
12ddbd8
 
 
 
 
653af7e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8b9d85a
6675312
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
---
license: mit
---
# Model Card

## Overview

This model is a 156M-parameter English-language causal language model trained on a large-scale text corpus and instruction-tuned for general question answering and task completion.

---

## Running script

see github -> https://github.com/firdavsus/LLM_D2

## Model Details

* **Model size:** 156M parameters
* **Architecture:** Transformer (causal LM)
* **Tokenizer:** GPT-2 tokenizer
* **Languages:** English only

---

## Training Curves
![Pre-Training](training_curves.png)

![Post-Training](loss.png)

## Training Data

### Pretraining

* Dataset: The Pile (10B token subset)
* Domain: mixed-domain text (web, books, articles, code, etc.)

### Instruction Fine-tuning

* Dataset: Alpaca (cleaned subset)
* Size: ~50,000 instruction–response examples
* Formatting: instruction-style prompt/response pairs

---

## Training Setup

### Pretraining

* Steps: **218,000**
* Final training loss: **2.6**

### Post-training (Instruction Fine-tuning)

* Steps: **2,500**
* Final training loss: **1.9**

---

## Evaluation

| Benchmark | Score    |
| --------- | -------- |
| HellaSwag | **28.5** |

---

## Intended Use

* Instruction-style prompting
* Basic question answering
* Text generation and summarization
* Lightweight assistant-style tasks (English)

---

## Limitations

* Small model size limits reasoning and factual reliability
* May produce incorrect or inconsistent answers
* Instruction-following quality depends strongly on prompt format
* Not suitable for high-stakes or safety-critical use

# This model has not been safety-aligned. Please apply your own moderation and guardrails when deploying it ;)
FOR ADDITIONAL INFO CHEKC INFO.TXT