dragonkue (KuKu)

upvoted 2 articles 16 days ago

Article

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

18 days ago

•

25

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

17 days ago

•

37

upvoted a collection about 1 month ago

pplx-embed

Collection

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96

upvoted an article about 1 month ago

Article

🥃 Distilling Tiny Embeddings

Jan 10

•

23

upvoted an article about 2 months ago

Article

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark

Mar 19

•

8

upvoted a paper about 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

upvoted a collection about 2 months ago

Qwen3.5-text-only

Collection

Text-only versions of Qwen-3.5 without the vision encoders for a smaller memory and storage footprint. • 4 items • Updated about 1 month ago • 15

upvoted a paper 2 months ago

zELO: ELO-inspired Training Method for Rerankers and Embedding Models

Paper • 2509.12541 • Published Sep 16, 2025 • 10

upvoted an article 3 months ago

Article

Introducing Legal RAG Bench

Feb 20

•

14

upvoted a paper 3 months ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published Feb 11 • 24

upvoted an article 3 months ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

Feb 19

•

21

upvoted a paper 3 months ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published Feb 6 • 24

upvoted 3 collections 3 months ago

upvoted a paper 3 months ago

Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model

Paper • 2507.05513 • Published Jul 7, 2025 • 1

upvoted an article 3 months ago

Article

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

Jan 24

•

44

upvoted an article 4 months ago

Article

How We Built a Semantic Highlight Model To Save Token Cost for RAG

Jan 15

•

67

upvoted a collection 4 months ago

KoViDoRe Benchmark (BEIR) v2

Collection

Korean Vision Document Retrieval Benchmark • 4 items • Updated Mar 2 • 6

upvoted an article 5 months ago

Article

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

Dec 22, 2025

•

10

KuKu

AI & ML interests

Organizations

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

pplx-embed

🥃 Distilling Tiny Embeddings

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Qwen3.5-text-only

zELO: ELO-inspired Training Method for Rerankers and Embedding Models

Introducing Legal RAG Bench

Diffusion-Pretrained Dense and Contextual Embeddings

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Embedding Model Datasets

NanoBEIR datasets

Codefuse Embeddings

Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

How We Built a Semantic Highlight Model To Save Token Cost for RAG

KoViDoRe Benchmark (BEIR) v2

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

KuKu

AI & ML interests

Organizations

dragonkue's activity

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

🥃 Distilling Tiny Embeddings

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark

Introducing Legal RAG Bench

**ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?**

RexRerankers: SOTA Rankers for Product Discovery and AI Assistants

How We Built a Semantic Highlight Model To Save Token Cost for RAG

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?