Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated a dataset 3 days ago
ricdomolm/mini-coder-trajs-400k updated a model 3 days ago
ricdomolm/mini-coder-1.7b updated a model 3 days ago
ricdomolm/mini-coder-4bOrganizations
1930 Coder
Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Computational Arbitrage
Models and datasets for the paper "Computational Arbitrage in AI Model Markets"
mini-coder
Small models for agentic SWE research: https://ricardodominguez.github.io/blogs/minicoder.html
Training on the test task models
Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890