--- title: QMD Web Demo emoji: πŸ” colorFrom: blue colorTo: green sdk: static pinned: false license: mit --- # QMD Web Demo In-browser hybrid search pipeline using WebGPU + Transformers.js v4. Demonstrates the full QMD search pipeline running entirely in your browser: 1. **Query Expansion** β€” Qwen3 1.7B generates HyDE, semantic, and keyword variants 2. **Parallel Search** β€” BM25 keyword search + vector similarity search 3. **Reciprocal Rank Fusion** β€” Merges results from multiple search backends 4. **LLM Reranking** β€” Qwen3 Reranker 0.6B scores document relevance 5. **Score Blending** β€” Position-aware combination of RRF and reranker scores ## Requirements - Chrome 113+ or Edge 113+ (WebGPU required) - ~2.5GB model download on first visit (cached for subsequent visits) ## Models - [embeddinggemma-300M](https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX) β€” Embeddings - [Qwen3-Reranker-0.6B](https://huggingface.co/onnx-community/Qwen3-Reranker-0.6B-ONNX) β€” Reranking - [qmd-query-expansion-1.7B](https://huggingface.co/shreyask/qmd-query-expansion-1.7B-ONNX) β€” Query expansion Based on [QMD](https://github.com/tobi/qmd) by Tobi LΓΌtke.