Commit History
Fix OpenAI Embeddings Auth Issue (#1077) 01830ce unverified
[VertexAI] Add support for tools parameter (#1065) ef6c6c7 unverified
Use pino for logs (#1086) bb6f8cb unverified
set `sameSite` to `lax` when allowing insecure cookies (#1078) 925d513 unverified
Add `ALLOW_INSECURE_COOKIES` feature flag (#1076) 82d651c unverified
Use TEI space in prod (#1051) 1978a75 unverified
Support Gemini 1.5 Pro from Vertex AI (#1041) d6a02c6 unverified
Always dispose of pipeline for embeddings after use (#1048) b337e06 unverified
Dispose of embeddings pipeline occasionally to clear memory (#1047) 808c327 unverified
Kill app on tokenizer load fail (#1044) 4b62530 unverified
return empty if error on hfapi endpoint (#1038) 0523db0 unverified
Use inference API for embeddings in huggingchat prod (#1037) 30ce16c unverified
Switch task model and add max tokens limits (#1036) 42dd7d2 unverified
Add logs to tokenizer fetch (#1030) ac3fd05 unverified
Add langserve endpoint (#1009) a9e4746 unverified
Antonio Ramos antoniora commited on
Move default template so it doesn't override tokenizer (#987) 0598c3f unverified
Set ChatML as default chat prompt template (#985) 97f9784 unverified
quick fix error message (#977) 3c52b3c unverified
Add support for cohere endpoints (#976) 1aafdd3 unverified
Use jinja template for chat formatting (#730) (#744) 3d931f5 unverified
Implement Cloudflare Workers AI endpoint (#907) (#972) ac2d8ff unverified
Google Vertex API support (#950) 48f1340 unverified
[Assistants] trending feature (#938) dae4e82 unverified
[Websearch] change context schema (#944) 694dbc8 unverified
Mishig commited on
Count system prompt tokens (#850) eb2ef82 unverified
Enhance Dynamic User Attribute Handling in OIDC Integration (#885) 69907d3 unverified
Prepend domain filters in search query generation (#932) ea2cfd8 unverified
Show error when webpage cannot be reached or parsed (#930) 4e08933 unverified
Fix multi domains for assistants (#929) ebd1f7a unverified
Anthropic Endpoint Support (#923) 1db019c unverified
Liam Dyer commited on
Fix prompt caching on llama.cpp endpoints (#920) c0b8726 unverified
Add openai embeddings (#915) 3a3ecfd unverified
Include last chunk in websearch context (#912) 2de7e1a unverified
Make sure preprompt is set on open ai endpoint type (#913) 42a92e5 unverified
Add limits on API endpoints (#886) 5c7eef4 unverified
[Assistants] Filter on names (#841) c8e9fc0 unverified
Fix issue with "continue" feature on llama.cpp endpoints (#898) 2f242b9 unverified
Bug Fix: Json Decoder aggessively pulls json (#867) bde01fc unverified
Pass websearch to `preprocessMessages` (#876) 33a0f87 unverified
Add gemma model to prod config (#854) d3928c1 unverified
✨ Add stats on conversations (#828) a6fb72d unverified
[Mongo] Optimize `reports` collection query (v2) (#834) 1870c0a unverified
Mishig commited on
Revert "[Mongo] Optimize `reports` collection query (#832)" (#833) 00150a2 unverified
Mishig commited on
[Mongo] Optimize `reports` collection query (#832) 5a32e24 unverified
Mishig commited on