[Announce] polars-luxical package

by permutans - opened Dec 18, 2025

Dec 18, 2025

•

edited Dec 18, 2025

I made a Polars extension to use the model: https://github.com/lmmx/polars-luxical on PyPI at https://pypi.org/project/polars-luxical/

It’s similar in spirit to polars-fastembed (a Polars extension wrapping the fastembed Rust crate) and by running the same benchmark from that repo it can be shown to be the fastest available model as far as I can see:

polars-luxical embeds the 708 Python PEPs at 0.5ms per 1k tokens on CPU, or 1.8s total runtime [including Python interpreter startup etc]
- for comparison Snowflake Arctic Embed XS on GPU is 3.5ms/1kT and All-Mini-LM-V6 is 8ms/1kT

Because the model is not used for search (but can be used for deduplication) I added a ‘half match demo’ script to the benchmark subdir, on which it achieves about 97% accuracy at matching Python PEP halves, similar to the experiment described in the blog post.

lukemerrick

DatologyAI org Feb 20

Wow, I totally missed seeing this announcement over the holidays! This is super cool to see integrated cleanly into polars workflows, and I really like how you kicked the tires by testing on different data (the Python PEPs).

lukemerrick

DatologyAI org Feb 20

Thank you for sharing!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment