| title: QMD Web Demo | |
| emoji: π | |
| colorFrom: blue | |
| colorTo: green | |
| sdk: static | |
| pinned: false | |
| license: mit | |
| # QMD Web Demo | |
| In-browser hybrid search pipeline using WebGPU + Transformers.js v4. | |
| Demonstrates the full QMD search pipeline running entirely in your browser: | |
| 1. **Query Expansion** β Qwen3 1.7B generates HyDE, semantic, and keyword variants | |
| 2. **Parallel Search** β BM25 keyword search + vector similarity search | |
| 3. **Reciprocal Rank Fusion** β Merges results from multiple search backends | |
| 4. **LLM Reranking** β Qwen3 Reranker 0.6B scores document relevance | |
| 5. **Score Blending** β Position-aware combination of RRF and reranker scores | |
| ## Requirements | |
| - Chrome 113+ or Edge 113+ (WebGPU required) | |
| - ~2.5GB model download on first visit (cached for subsequent visits) | |
| ## Models | |
| - [embeddinggemma-300M](https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX) β Embeddings | |
| - [Qwen3-Reranker-0.6B](https://huggingface.co/onnx-community/Qwen3-Reranker-0.6B-ONNX) β Reranking | |
| - [qmd-query-expansion-1.7B](https://huggingface.co/shreyask/qmd-query-expansion-1.7B-ONNX) β Query expansion | |
| Based on [QMD](https://github.com/tobi/qmd) by Tobi LΓΌtke. | |