view article Article Training and Finetuning Embedding Models with Sentence Transformers tomaarsen • May 28, 2024 • 274
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset sdiazlor • Feb 10, 2025 • 60
Tools 4 learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated Mar 2 • 67
view article Article How to build a custom text classifier without days of human labeling sdiazlor • Oct 17, 2024 • 57
view article Article How to optimize your data labelling project with custom interfaces burtenshaw • Oct 16, 2024 • 20
Datasets ATR line-level Collection This collection contains all our datasets for Automatic Text Recognition on line images. • 12 items • Updated Jun 17, 2025 • 6
view article Article Fine-tuning a token classification model for legal data using Argilla and AutoTrain bikashpatra • Sep 7, 2024 • 15
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot +3 plaguss, gabrielmbmb, sdiazlor, osanseviero, dvilasuero • Jul 16, 2024 • 33
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets dvilasuero • Jun 4, 2024 • 79
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together burtenshaw • Apr 29, 2024 • 29