view article Article RexRerankers: SOTA Rankers for Product Discovery and AI Assistants thebajajra • Jan 24 • 44
view article Article NVIDIA brings agents to life with DGX Spark and Reachy Mini +1 jeffboudier, nader-at-nvidia, alecfong • Jan 5 • 66
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego • Sep 4, 2025 • 275
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech Paper • 2205.12446 • Published May 25, 2022 • 2
view article Article Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers sanchit-gandhi • Nov 3, 2022 • 372
view article Article 🇨🇿 BenCzechMark - Can your LLM Understand Czech? +11 mfajcik, hynky, mdocekal, xdolez52, jstetina, Lakoc, popelucha, hales, michal-stefanik, Adamiros, davidadamczyk, JanH, jsedivy • Oct 1, 2024 • 25
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation Paper • 2406.16678 • Published Jun 24, 2024 • 16
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 14 days ago • 164
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Paper • 2406.01574 • Published Jun 3, 2024 • 55
view article Article Training and Finetuning Embedding Models with Sentence Transformers tomaarsen • May 28, 2024 • 275
Czech evaluation datasets Collection This collections should contain czech evaluation datasets • 8 items • Updated Jan 14, 2024 • 3
Retrieval-Augmented Generation for Large Language Models: A Survey Paper • 2312.10997 • Published Dec 18, 2023 • 12
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 84