File size: 1,208 Bytes
b08846a
e3e1377
 
 
 
 
 
 
 
 
 
b08846a
e3e1377
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
language:
- es
- en
license: mit
tags:
- gpt2
- code
- bilingual
- inserloft
model_name: Cleo Nano v3.1 Bilingual
---

# Cleo Nano v3.1 (Bilingual Optimization)

Cleo Nano is a decoder-only Transformer model developed by **Inserloft** under the vision of **Jesus Heriberto Corona**. This version (v3.1) features surgical fine-tuning for bilingual stability (English/Spanish) and hallucination control.

## Model Details
- **Architecture:** Decoder-Only GPT (Custom)
- **Layers:** 8
- **Embedding Dim:** 384
- **Attention Heads:** 12
- **Context Window:** 256 tokens
- **Parameters:** ~15M
- **Training Data:** Mix of Wikipedia, Python Code (CodeFeedback), and Identity Anchoring.

## Usage
To use this model, you need the custom `CleoNanoV3` architecture defined in PyTorch. The weights can be loaded using `torch.load()` or via the Hugging Face `from_pretrained` if using the provided mapping logic.

### Capabilities
1. **Bilingual Chat:** Responds to general queries in both Spanish and English.
2. **Code Generation:** Specialized in Python snippets (Sum, Loops, Classes).
3. **Identity Preservation:** Strong grounding on its origin and creator.

---
Developed by [Inserloft](https://inserloft.dev/)