CarsRUS / rag_engine.py

Commit History

CarsRUS: session car history โ€“ stick to discussed models on follow-up
cdf8db8

galbendavids commited on

CarsRUS: Link & Co 01 normalization (and/ืœื™ื ืง&ืงื•), tests
b262f99

galbendavids commited on

RAG: better normalization for Genesis GV80 + Link & Co Hebrew; add test_chat_scenarios manual script
3204a1c

galbendavids commited on

OpenRouter: default model google/gemini-3-flash-preview (fix 404, gemini-2.0-flash-exp:free deprecated)
bc4e61d

galbendavids commited on

Generation: use only OpenRouter when OPENROUTER_API_KEY set (no Gemini fallback)
2aea9f9

galbendavids commited on

OpenRouter: log when key missing; try more env names; doc HF secret OPENROUTER_API_KEY
126332e

galbendavids commited on

OpenRouter first (faster), Gemini fallback; conversation history Q+A; optional .env
b609489

galbendavids commited on

Pipeline logging + show LLM response in chat
2fa5774

galbendavids commited on

Rate limit: longer retries (8 attempts, 3 min wait); show as 'ื‘ืขื™ื” ื–ืžื ื™ืช' not 'ื”ืชืฉื•ื‘ื”'
58029e3

galbendavids commited on

Present aggregated verbal answer first in chat; strengthen RAG prompt
fba06c0

galbendavids commited on

UI/UX: RTL layout, dark mode, stable header logo, chat/input RTL, scrollbar, focus states; agent: retrieveโ†’generateโ†’end, final answer separator
2ebebf3

galbendavids commited on

Fix: Hebrew Elantra N detection, final answer display, stronger prompts; UI redesign; Hebrew comparison test
75c53f5

galbendavids commited on

RAG: comparison by supported models, partial answer for 1 known model, stronger prompts for context aggregation
de2bc35

galbendavids commited on

agentic rag update
37bbf25
verified

galbendavids commited on

fixing timeout bug while waiting for gemini response
0b17d83
verified

galbendavids commited on

create vector offline
7f608f2
verified

galbendavids commited on

faster rรฉponse
80e3e46
verified

galbendavids commited on

flash models for faster response
947082a
verified

galbendavids commited on

rate limit fix
0bc4550
verified

galbendavids commited on

optimize rag flow
a98f7cb
verified

galbendavids commited on

upload files
aaa458a
verified

galbendavids commited on

๐Ÿ”ง Fix: Smarter rate limit handling - wait 30s on 429 errors instead of switching models
ca1b20d

galbendavids commited on

โœ… Improve: Increase minimum delay to 3s + more aggressive exponential backoff (3^n) for rate limit handling
173fe99

galbendavids commited on

โœ… Fix: Add rate limiting with exponential backoff + response caching to prevent API quota errors
fe3cfdf

galbendavids commited on

โœ… Fix: Update to correct Gemini API models (gemini-2.0-flash, gemini-1.5-flash, gemini-1.5-pro) + Clean minimal chat UI
111c7c5

galbendavids commited on

๐Ÿ”ง Fix: Dark chat UI + correct Gemini API model names
c1098a1

galbendavids commited on

๐ŸŽจ Simplify UI to clean professional design + ๐Ÿ› Add better error handling
9007d17

galbendavids commited on

โœจ Implement 10 Golden Rules for RAG
51e2e17

galbendavids commited on

Fix API 404 (upgrade lib) and UI Readability (High Contrast)
e1b2ed3

galbendavids commited on

Fix crashes: Pin numpy<2, remove invalid Gradio args, fix paths
6ab8b6b

galbendavids commited on

Refactor: Move scraping logic to data_ingestion folder
dd2d8f6

galbendavids commited on

Complete Automotive RAG Chatbot implementation
1f92167

galbendavids commited on