File size: 1,185 Bytes
0e526ea
 
 
 
 
 
 
 
 
2e15698
0e526ea
2e15698
0e526ea
2e15698
0e526ea
 
 
 
 
 
2e15698
0e526ea
2e15698
0e526ea
 
2e15698
0e526ea
2e15698
0e526ea
 
 
2e15698
0e526ea
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
title: QMD Web Demo
emoji: ๐Ÿ”
colorFrom: blue
colorTo: green
sdk: static
pinned: false
license: mit
---

# QMD Web Demo

In-browser hybrid search pipeline using WebGPU + Transformers.js v4.

Demonstrates the full QMD search pipeline running entirely in your browser:
1. **Query Expansion** โ€” Qwen3 1.7B generates HyDE, semantic, and keyword variants
2. **Parallel Search** โ€” BM25 keyword search + vector similarity search
3. **Reciprocal Rank Fusion** โ€” Merges results from multiple search backends
4. **LLM Reranking** โ€” Qwen3 Reranker 0.6B scores document relevance
5. **Score Blending** โ€” Position-aware combination of RRF and reranker scores

## Requirements

- Chrome 113+ or Edge 113+ (WebGPU required)
- ~2.5GB model download on first visit (cached for subsequent visits)

## Models

- [embeddinggemma-300M](https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX) โ€” Embeddings
- [Qwen3-Reranker-0.6B](https://huggingface.co/onnx-community/Qwen3-Reranker-0.6B-ONNX) โ€” Reranking
- [qmd-query-expansion-1.7B](https://huggingface.co/shreyask/qmd-query-expansion-1.7B-ONNX) โ€” Query expansion

Based on [QMD](https://github.com/tobi/qmd) by Tobi Lรผtke.