PipeOwl
Collection
A transformer-free semantic retrieval engine. • 13 items • Updated
A transformer-free semantic retrieval engine.
Features:
| item | value |
|---|---|
| token size | 524190 |
| embedding dim | 256 |
| storage format | safetensors (FP16) |
| data size | ~267 MB |
| languages | bilingual |
| startup time | ~478 ms |
| query latency | ~25-34 ms |
git clone https://huggingface.co/WangKaiLin/PipeOwl-1.11-bilingual
cd PipeOwl-1.11-bilingual
pip install numpy safetensors
python quickstart.py
Example semantic retrieval results:
請輸入句子: 確實
Top-K Tokens:
1.000 | 確實
0.877 | 的確
0.839 | 确实
0.772 | 的确
0.757 | 事實上
請輸入句子: 今天好想睡覺
Top-K Tokens:
0.761 | 今天
0.747 | 今天的
0.694 | 睡觉
0.693 | 刚才
0.685 | 现在
請輸入句子: i want to sleep
Top-K Tokens:
0.719 | sleep
0.663 | schlafen
0.638 | want
0.616 | sleeping
0.616 | tidur
請輸入句子: 哈囉你好阿
Top-K Tokens:
0.825 | 哈囉
0.818 | 你好
0.769 | 嘿
0.759 | 嗨
0.750 | Kaixo
PipeOwl-1.11-bilingual/
├ README.md
├ config.json
├ LICENSE
├ quickstart.py
├ engine.py
├ tokenizer.json
└ pipeowl.safetensors
PipeOwl can be combined with the RAG pipeline from PipeOwl-1.10.2-tw-wiki-rag.
You can reuse the wiki retrieval layer and directly plug it on top of PipeOwl-1.11 embeddings.
PipeOwl provides fast semantic token retrieval
RAG layer provides document-level grounding
Replace the embedding backend with PipeOwl-1.11
Use PipeOwl output tokens as query expansion
MIT