Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios May 2 • 19
Limbic Collection A collection of models and datasets for Limbic -- captures and processes agent behavior, helps you understand it, and auto improves your agents • 2 items • Updated Oct 3 • 1
Limbic Collection A collection of models and datasets for Limbic -- captures and processes agent behavior, helps you understand it, and auto improves your agents • 2 items • Updated Oct 3 • 1
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection Paper • 2505.00506 • Published May 1
Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms Paper • 1611.01572 • Published Nov 4, 2016
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Paper • 2402.07827 • Published Feb 12, 2024 • 48
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9, 2024 • 56