karmiq (Karel Minarik)

upvoted a collection 5 months ago

EmbeddingGemma

Collection

7 items • Updated Sep 4, 2025 • 4

upvoted an article 5 months ago

Article

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

thebajajra

•

Jan 24

• 44

upvoted an article 6 months ago

Article

NVIDIA brings agents to life with DGX Spark and Reachy Mini

+1

jeffboudier, nader-at-nvidia, alecfong

•

Jan 5

• 66

upvoted 2 articles 8 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 275

Article

Sentence Transformers is joining Hugging Face!

tomaarsen

•

Oct 22, 2025

• 88

upvoted a paper 8 months ago

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

Paper • 2205.12446 • Published May 25, 2022 • 2

upvoted an article 9 months ago

Article

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

sanchit-gandhi

•

Nov 3, 2022

• 372

upvoted an article over 1 year ago

Article

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

+11

mfajcik, hynky, mdocekal, xdolez52, jstetina, Lakoc, popelucha, hales, michal-stefanik, Adamiros, davidadamczyk, JanH, jsedivy

•

Oct 1, 2024

• 25

upvoted a paper almost 2 years ago

Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation

Paper • 2406.16678 • Published Jun 24, 2024 • 16

upvoted a collection about 2 years ago

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 15 days ago • 164

upvoted a paper about 2 years ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 55

upvoted an article about 2 years ago

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 275

upvoted a paper over 2 years ago

Anticipatory Music Transformer

Paper • 2306.08620 • Published Jun 14, 2023 • 10

upvoted a collection over 2 years ago

Czech evaluation datasets

Collection

This collections should contain czech evaluation datasets • 8 items • Updated Jan 14, 2024 • 3

upvoted 6 papers over 2 years ago

Karel Minarik

AI & ML interests

Organizations

EmbeddingGemma

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

NVIDIA brings agents to life with DGX Spark and Reachy Mini

Welcome EmbeddingGemma, Google's new efficient embedding model

Sentence Transformers is joining Hugging Face!

FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation

Nemotron 4 340B

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Training and Finetuning Embedding Models with Sentence Transformers

Anticipatory Music Transformer

Czech evaluation datasets

Retrieval-Augmented Generation for Large Language Models: A Survey

Improving Text Embeddings with Large Language Models

Multilingual E5 Text Embeddings: A Technical Report

Text Embeddings Reveal (Almost) As Much As Text

Shai: A large language model for asset management

Borges and AI

Karel Minarik

AI & ML interests

Organizations

karmiq's activity

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

NVIDIA brings agents to life with DGX Spark and Reachy Mini

Welcome EmbeddingGemma, Google's new efficient embedding model

Sentence Transformers is joining Hugging Face!

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

Training and Finetuning Embedding Models with Sentence Transformers