EpistemeAI
/

ReasoningCore-Llama-3B-R1-aligned

Text Generation

text-generation-inference

Model card Files Files and versions

legolasyiu commited on Apr 9, 2025

Commit

5990dd3

·

verified ·

1 Parent(s): 2f70574

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ language:
 # ReasoningCore-Llama-3B-R1-aligned
-**ReasoningCore‑3B** is a multilingual, reasoning‑enhanced large language model developed by EpitemeAI. Pretrained on vast amounts of publicly available data and instruction‑tuned to excel at nuanced reasoning, dialogue management, retrieval, and summarization tasks, it often outperforms many current open source and proprietary conversational models on a range of industry benchmarks. Fine tuned with reasoning dataset.
 ### We used GRPO technique:
@@ -34,7 +34,7 @@ To provide a comprehensive overview of Group Relative Policy Optimization (GRPO)
 |                                | Training Data                                    | Params | Input Modalities      | Output Modalities            | Context Length | GQA | Shared Embeddings | Token Count    | Knowledge Cutoff  |
 |--------------------------------|--------------------------------------------------|--------|-----------------------|------------------------------|----------------|-----|-------------------|----------------|-------------------|
-| **ReasoningCore‑3B (text only)** | A new mix of publicly available online data.     | 3B     | Multilingual Text     | Multilingual Text and code   | 128k           | Yes | Yes               | Up to 9T tokens | December 2023     |
 - **Supported Languages:**
   Officially supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. While the pretraining included a broader range of languages, additional languages can be fine‑tuned in compliance with the community license and acceptable use policies.

 # ReasoningCore-Llama-3B-R1-aligned
+**ReasoningCore-Llama-3B-R1-aligned** is a multilingual, reasoning‑enhanced large language model developed by EpitemeAI. Pretrained on vast amounts of publicly available data and instruction‑tuned to excel at nuanced reasoning, dialogue management, retrieval, and summarization tasks, it often outperforms many current open source and proprietary conversational models on a range of industry benchmarks. Fine tuned with reasoning dataset.
 ### We used GRPO technique:
 |                                | Training Data                                    | Params | Input Modalities      | Output Modalities            | Context Length | GQA | Shared Embeddings | Token Count    | Knowledge Cutoff  |
 |--------------------------------|--------------------------------------------------|--------|-----------------------|------------------------------|----------------|-----|-------------------|----------------|-------------------|
+| **ReasoningCore-Llama-3B-R1-aligned (text only)** | A new mix of publicly available online data.     | 3B     | Multilingual Text     | Multilingual Text and code   | 128k           | Yes | Yes               | Up to 9T tokens | December 2023     |
 - **Supported Languages:**
   Officially supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. While the pretraining included a broader range of languages, additional languages can be fine‑tuned in compliance with the community license and acceptable use policies.