Commit History

remove print statements now that the join is working.
283cc2c

wbrooks commited on

updated the precomputed TF-IDF dataframe to not include the base path to the docs folder
8c00443

wbrooks commited on

fixing indentation format
f8124c9

wbrooks commited on

added debug print statements to check why we get no results
0f35b1a

wbrooks commited on

added debug print statements to check why we get no results
3e3c04a

wbrooks commited on

point to the actual location of the module files
5d5eab6

wbrooks commited on

use valid columns for the result
49257b2

wbrooks commited on

tracking the new block embeddings
f4c877a

wbrooks commited on

trying to remove old embeddings file from LFS
c393f3f

wbrooks commited on

look for column named 'file' not 'path'
a8b631f

wbrooks commited on

fixed an error where the polars chains weren't wrapped in parens
dc74120

wbrooks commited on

return results as HTML links
cf0e16f

wbrooks commited on

adding the search page back #3
a3ee5d1

wbrooks commited on

removed debugging messages now that search is working
4c7c106

wbrooks commited on

making sure all the correct arguments are there in the function calls
67412fc

wbrooks commited on

pass IDF DTM to the function factory
84c4c3d

wbrooks commited on

removed some cruft from app.py
102f0bb

wbrooks commited on

the factory has to actually return the function
35bf0ca

wbrooks commited on

corrected the name of the fasttext model binary
4842291

wbrooks commited on

send row names to be included in the results
2f234b8

wbrooks commited on

build the search functions on app launch, rather than per-query
88a01a5

wbrooks commited on

add the data files necessary for search to version control
87752fc

wbrooks commited on

switched to the new search functions using sentence-transformers
68fd999

wbrooks commited on

add sentence_transformers requirement
1b85d5f

wbrooks commited on

added search URL to the README
1aa0d22

wbrooks commited on

renamed modules that do the search
c1d8ce6

wbrooks commited on

changes TF-IDF search to cosine similarity from dot product
ab4ff40

wbrooks commited on

changes TF-IDF search to cosine similarity from dot product
1310186

wbrooks commited on

add the rank-combined column before using it
861d14f

wbrooks commited on

being more selective with the columns that print in the output
9fbd1cf

wbrooks commited on

need to used vectorized strip_prefix to modify a polars column
234c1f5

wbrooks commited on

shortening the name of files
547533f

wbrooks commited on

just return the dist of results directly
7e2a479

wbrooks commited on

trying to get a response
928dc40

wbrooks commited on

testing a simpler response
106e459

wbrooks commited on

return results as JSON
6f54f14

wbrooks commited on

render result to a jinja table
21d4134

wbrooks commited on

apparently need pyarrow for pandas to_html and the huggingface environment-builder is too stupid to install required packages
931423e

wbrooks commited on

working out how to format the result
9916b48

wbrooks commited on

add pandas to requirements
65a3f08

wbrooks commited on

trying to figure out why the last print failed
21b7815

wbrooks commited on

switch tfidf search to use file list saved by joblib
3facea3

wbrooks commited on

reshape with shape, not size
1cf6271

wbrooks commited on

reduce files list to 1-d
7b687d4

wbrooks commited on

convert output to string
881f70b

wbrooks commited on

testing separate query methods
c70ddc5

wbrooks commited on

trying to test app.py
35d7ab6

wbrooks commited on

trying to test app.py
a5caccb

wbrooks commited on

fixed a typo and added a test endpoint
f6d14bf

wbrooks commited on

use block_embeddings_df from the compressed serialized parquet file
68239c7

wbrooks commited on