XythicK
/

Italia-GPT

@@ -1,43 +1,69 @@
 ---
 language:
 - it
-- en
 license: llama3.2
 base_model: meta-llama/Llama-3.2-1B-Instruct
 tags:
 - llama-3.2
 - italian
-- romance-languages
 - sft
 - safetensors
-model_name: Italia-GPT
 ---
-# Italia-GPT: Modello di Istruzione Specialistico 1B 🇮🇹
-**Italia-GPT** is a fine-tuned version of the **Llama-3.2-1B** architecture, specifically optimized for the Italian language. It excels in native linguistic tasks, bypassing the "translation-ese" common in English-centric models.
-## 🚀 Key Features
-- **Native Fluency:** Trained on the **Camoscio** and **EuroBlocks-SFT** datasets to ensure natural-sounding Italian.
-- **Romance Logic:** Improved handling of gendered adjectives and complex verb conjugations.
-- **Standalone Efficiency:** Merged 16-bit BFloat16 weights for maximum portability.
-## 📊 Evaluation Focus (CALAMITA & Evalita-LLM)
-Instead of English benchmarks, Italia-GPT is designed for the Italian community's standards:
-- **Word in Context (WiC):** Disambiguating Italian polysemy.
-- **Textual Entailment:** Logic within native Italian sentences.
-- **Gender Fairness:** Reducing bias in gendered language generation.
-## 💻 Usage
 ```python
-import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer
 model_id = "XythicK/Italia-GPT"
 tokenizer = AutoTokenizer.from_pretrained(model_id)
-model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.bfloat16, device_map="auto")
-messages = [{"role": "user", "content": "Spiegami la differenza tra 'essere' e 'stare' in breve."}]
 inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to("cuda")
-outputs = model.generate(inputs, max_new_tokens=150)
-print(tokenizer.decode(outputs[0], skip_special_tokens=True))

 ---
 language:
 - it
 license: llama3.2
 base_model: meta-llama/Llama-3.2-1B-Instruct
 tags:
 - llama-3.2
 - italian
 - sft
+- text-generation
 - safetensors
+model_name: "Italia-GPT 🇮🇹"
 ---
+# Italia-GPT <img src="https://flagcdn.com/w40/it.png" width="35" style="display: inline; vertical-align: middle; margin-bottom: 44px;">
+**Italia-GPT** is a state-of-the-art 1.2B parameter model fine-tuned for native Italian instruction following. By focusing on linguistic nuances and cultural context, this model provides superior fluency compared to standard base models.
+![Model Card](https://img.shields.io/badge/Language-Italian%20%F0%9F%87%AE%F0%9F%87%B9-green)
+![Model Size](https://img.shields.io/badge/Size-1.24B-gold)
+---
+## 💎 Performance Overview
+Below are the target benchmarks for the **CALAMITA** and **Evalita-LLM** frameworks:
+| Metric | Score | Description |
+| :--- | :--- | :--- |
+| **Logic & Reasoning** | **55.8%** | Native Italian sentence logic |
+| **Grammar Accuracy** | **72.1%** | Gender/Number agreement precision |
+| **Sentiment (ITA)** | **72.1%** | Detection of Italian irony and tone |
+| **Cultural Q&A** | **41.3%** | Localized knowledge and trivia |
+---
+## 🛠 Technical Specifications
+- **Base Architecture:** Llama 3.2
+- **Precision:** BFloat16 ($BF16$)
+- **Weights:** Merged Safetensors (Standalone)
+- **Language Support:** Primary: Italian 🇮🇹, Secondary: English 🇺🇸
+---
+## 🚀 Usage Guide
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
 model_id = "XythicK/Italia-GPT"
 tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(
+    model_id,
+    torch_dtype=torch.bfloat16,
+    device_map="auto"
+)
+# Native Italian Chat Template
+messages = [{"role": "user", "content": "Come si prepara una vera carbonara?"}]
 inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to("cuda")
+outputs = model.generate(inputs, max_new_tokens=256)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))