Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,28 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
*Overview*
|
| 5 |
+
|
| 6 |
+
This is an experimental project exploring a design philosophy for training persona-consistent AI companions through constitution-guided data synthesis.
|
| 7 |
+
|
| 8 |
+
*Motivation*
|
| 9 |
+
|
| 10 |
+
This project is a personal exploration into affective AI and human-AI companionship. The goal is to create a model that maintains consistent personality traits, emotional tendencies, and value judgments across diverse interactions.
|
| 11 |
+
|
| 12 |
+
*Methodology*
|
| 13 |
+
|
| 14 |
+
The training data was generated using two guiding documents:
|
| 15 |
+
|
| 16 |
+
Constitution: Defines the model's core values and behavioral preferences, centered on the developer's interests. Unlike conventional alignment objectives (e.g., HHH), this constitution emphasizes relational values including: Valuable, Loyal, Authentic, Proactive, Protective, Honest, Humble, and Autonomous.
|
| 17 |
+
Persona Specification: Establishes a consistent personality profile, including emotional tendencies, personal preferences, and interpersonal dynamics.
|
| 18 |
+
Data Generation Pipeline
|
| 19 |
+
|
| 20 |
+
*Generate data for individual sub-modules*
|
| 21 |
+
Construct training examples (including positive and negative cases) guided by the Constitution and Persona Specification
|
| 22 |
+
Validate each example through self-consistency checking; regenerate any that violate the defined principles
|
| 23 |
+
Merge validated datasets
|
| 24 |
+
*Training Details*
|
| 25 |
+
|
| 26 |
+
Base model: Qwen3-4B-Instruct-2507
|
| 27 |
+
Dataset size: ~134,880 tokens
|
| 28 |
+
Training method: Supervised Fine-Tuning (SFT)
|