Two LoRA cold-start SFT experiments teaching structured think/answer reasoning to Nanbeige4-3B-Base using distilled traces from frontier models
Mrinaal Arora
mrinaalarora
AI & ML interests
None yet
Recent Activity
new activity 6 days ago
humanitys-last-hackathon/signup:Update src/app/signup-panel.tsx updated a Space 9 days ago
mrinaalarora/crisisops updated a Space 11 days ago
mrinaalarora/trackio