File size: 1,185 Bytes
0e526ea 2e15698 0e526ea 2e15698 0e526ea 2e15698 0e526ea 2e15698 0e526ea 2e15698 0e526ea 2e15698 0e526ea 2e15698 0e526ea 2e15698 0e526ea | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 | ---
title: QMD Web Demo
emoji: ๐
colorFrom: blue
colorTo: green
sdk: static
pinned: false
license: mit
---
# QMD Web Demo
In-browser hybrid search pipeline using WebGPU + Transformers.js v4.
Demonstrates the full QMD search pipeline running entirely in your browser:
1. **Query Expansion** โ Qwen3 1.7B generates HyDE, semantic, and keyword variants
2. **Parallel Search** โ BM25 keyword search + vector similarity search
3. **Reciprocal Rank Fusion** โ Merges results from multiple search backends
4. **LLM Reranking** โ Qwen3 Reranker 0.6B scores document relevance
5. **Score Blending** โ Position-aware combination of RRF and reranker scores
## Requirements
- Chrome 113+ or Edge 113+ (WebGPU required)
- ~2.5GB model download on first visit (cached for subsequent visits)
## Models
- [embeddinggemma-300M](https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX) โ Embeddings
- [Qwen3-Reranker-0.6B](https://huggingface.co/onnx-community/Qwen3-Reranker-0.6B-ONNX) โ Reranking
- [qmd-query-expansion-1.7B](https://huggingface.co/shreyask/qmd-query-expansion-1.7B-ONNX) โ Query expansion
Based on [QMD](https://github.com/tobi/qmd) by Tobi Lรผtke.
|