Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated a collection 2 days ago
1930 Coder updated a dataset 2 days ago
ricdomolm/eval-trajs-1930-coder published a dataset 2 days ago
ricdomolm/eval-trajs-1930-coderOrganizations
1930 Coder
Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Computational Arbitrage
Models and datasets for the paper "Computational Arbitrage in AI Model Markets"
mini-coder
Small models for agentic SWE research: https://ricardodominguez.github.io/blogs/minicoder.html
Training on the test task models
Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890