cndreamistic commited on
Commit
1aa9658
·
verified ·
1 Parent(s): 6ce0911

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -3
README.md CHANGED
@@ -1,3 +1,28 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ *Overview*
5
+
6
+ This is an experimental project exploring a design philosophy for training persona-consistent AI companions through constitution-guided data synthesis.
7
+
8
+ *Motivation*
9
+
10
+ This project is a personal exploration into affective AI and human-AI companionship. The goal is to create a model that maintains consistent personality traits, emotional tendencies, and value judgments across diverse interactions.
11
+
12
+ *Methodology*
13
+
14
+ The training data was generated using two guiding documents:
15
+
16
+ Constitution: Defines the model's core values and behavioral preferences, centered on the developer's interests. Unlike conventional alignment objectives (e.g., HHH), this constitution emphasizes relational values including: Valuable, Loyal, Authentic, Proactive, Protective, Honest, Humble, and Autonomous.
17
+ Persona Specification: Establishes a consistent personality profile, including emotional tendencies, personal preferences, and interpersonal dynamics.
18
+ Data Generation Pipeline
19
+
20
+ *Generate data for individual sub-modules*
21
+ Construct training examples (including positive and negative cases) guided by the Constitution and Persona Specification
22
+ Validate each example through self-consistency checking; regenerate any that violate the defined principles
23
+ Merge validated datasets
24
+ *Training Details*
25
+
26
+ Base model: Qwen3-4B-Instruct-2507
27
+ Dataset size: ~134,880 tokens
28
+ Training method: Supervised Fine-Tuning (SFT)