Compressing txtai embeddings

by leok7v - opened 28 days ago

Compressing txtai embeddings index for the English edition of Wikipedia
Just FYI:
https://github.com/leok7v/wiki-slugs
nothing special but may help agents on a very small systems.

davidmezzetti

NeuML org 27 days ago

Very interesting, thank you for sharing!

leok7v

27 days ago

I wonder if we can do similar thing to the sanitized (utf-8 text no markup) text of Wikipedia articles themselves)… I do not have compute resources for that but if you can do same as slugs for the e.g. executive summaries of the articles we can try hyper plane 1 bit compression on them too. May make reasoning on RAG for super small models super interesting…

davidmezzetti

NeuML org 26 days ago

Interesting. I'll think about this for the next update.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment