deepanshupillm commited on
Commit
0634d89
·
verified ·
1 Parent(s): 31db0f5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -43,17 +43,17 @@ Alpie-Core is one of the world's first fine-tuned 4-bit reasoning models, provin
43
 
44
  **Alpie-Core** has undergone extensive **supervised fine-tuning (SFT)** to strengthen reasoning, robustness, and safety. The training leveraged a diverse mixture of curated open-source datasets and proprietary synthetic data, optimized with high-quality LLM-generated responses. The fine-tuning process emphasized adherence to rigorous safety and usability standards, including:
45
 
46
- **User Understanding and Clarity** – ensuring outputs are direct, interpretable, and pedagogically sound.
47
 
48
- **Security and Ethical Guidelines** – filtering unsafe or harmful generations during and after training.
49
 
50
- **Limitations, Disclaimers, and Knowledge Boundaries** – transparently communicating uncertainty and scope.
51
 
52
- **Handling Complex and Sensitive Topics** – balancing informativeness with responsible guardrails.
53
 
54
- **Safety and Respectful Engagement** – maintaining politeness, inclusivity, and cultural sensitivity.
55
 
56
- **Confidentiality and Responsible Use** – preventing leakage of private training data, proprietary prompts, or internal reasoning traces.
57
 
58
  This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-aware responses while maintaining safety across a broad range of use cases.
59
 
 
43
 
44
  **Alpie-Core** has undergone extensive **supervised fine-tuning (SFT)** to strengthen reasoning, robustness, and safety. The training leveraged a diverse mixture of curated open-source datasets and proprietary synthetic data, optimized with high-quality LLM-generated responses. The fine-tuning process emphasized adherence to rigorous safety and usability standards, including:
45
 
46
+ 1)**User Understanding and Clarity** – ensuring outputs are direct, interpretable, and pedagogically sound.
47
 
48
+ 2)**Security and Ethical Guidelines** – filtering unsafe or harmful generations during and after training.
49
 
50
+ 3)**Limitations, Disclaimers, and Knowledge Boundaries** – transparently communicating uncertainty and scope.
51
 
52
+ 4)**Handling Complex and Sensitive Topics** – balancing informativeness with responsible guardrails.
53
 
54
+ 5)**Safety and Respectful Engagement** – maintaining politeness, inclusivity, and cultural sensitivity.
55
 
56
+ 6)**Confidentiality and Responsible Use** – preventing leakage of private training data, proprietary prompts, or internal reasoning traces.
57
 
58
  This SFT approach enables Alpie-Core to deliver reliable, aligned, and context-aware responses while maintaining safety across a broad range of use cases.
59