Saved LORA adapter checkpoints from training Qwen2.5-7B to generate decision trees for Abalone age regression dataset, using reinforce++ algorithm.
Zhiyuan He
nickhe
ยท
AI & ML interests
None yet
Recent Activity
published a model 3 months ago
nickhe/firl-ckpt-720 published a model 3 months ago
nickhe/firl-ckpt-760