From RAG to Agentic RAG for Faithful Islamic Question Answering Paper • 2601.07528 • Published Jan 12 • 2
Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics Paper • 2601.04946 • Published Jan 8
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8, 2025 • 11
BERTweet: A pre-trained language model for English Tweets Paper • 2005.10200 • Published May 20, 2020 • 1
Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents Paper • 2509.26539 • Published Sep 30, 2025 • 10
UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action Paper • 2510.17790 • Published Oct 20, 2025 • 6
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources Paper • 2509.25531 • Published Sep 29, 2025 • 9
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9, 2025 • 39
Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models Paper • 2510.06107 • Published Oct 7, 2025 • 3
KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025 Paper • 2505.13036 • Published May 19, 2025
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition Paper • 2506.04635 • Published Jun 5, 2025
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging Paper • 1908.02404 • Published Aug 7, 2019
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models Paper • 2010.00198 • Published Oct 1, 2020
Multi-Stage Verification-Centric Framework for Mitigating Hallucination in Multi-Modal RAG Paper • 2507.20136 • Published Jul 27, 2025
Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics Paper • 2410.05183 • Published Oct 7, 2024 • 1
Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images Paper • 2506.13458 • Published Jun 16, 2025
view post Post 6647 Excited to onboard FeatherlessAI on Hugging Face as an Inference Provider - they bring a fleet of 6,700+ LLMs on-demand on the Hugging Face Hub 🤯Starting today, you'd be able to access all those LLMs (OpenAI compatible) on HF model pages and via OpenAI client libraries too! 💥Go, play with it today: https://huggingface.co/blog/inference-providers-featherlessP.S. They're also bringing on more GPUs to support all your concurrent requests! See translation 1 reply · 🔥 7 7 + Reply