Commits · galbendavids/CarsRUS

CarsRUS: session car history – stick to discussed models on follow-up

cdf8db8

galbendavids commited on 14 days ago

CarsRUS: Link & Co 01 normalization (and/לינק&קו), tests

b262f99

galbendavids commited on 14 days ago

RAG: better normalization for Genesis GV80 + Link & Co Hebrew; add test_chat_scenarios manual script

3204a1c

galbendavids commited on 14 days ago

OpenRouter: default model google/gemini-3-flash-preview (fix 404, gemini-2.0-flash-exp:free deprecated)

bc4e61d

galbendavids commited on 15 days ago

Generation: use only OpenRouter when OPENROUTER_API_KEY set (no Gemini fallback)

2aea9f9

galbendavids commited on 15 days ago

OpenRouter: log when key missing; try more env names; doc HF secret OPENROUTER_API_KEY

126332e

galbendavids commited on 15 days ago

OpenRouter first (faster), Gemini fallback; conversation history Q+A; optional .env

b609489

galbendavids commited on 15 days ago

Pipeline logging + show LLM response in chat

2fa5774

galbendavids commited on 15 days ago

Rate limit: longer retries (8 attempts, 3 min wait); show as 'בעיה זמנית' not 'התשובה'

58029e3

galbendavids commited on 16 days ago

Present aggregated verbal answer first in chat; strengthen RAG prompt

fba06c0

galbendavids commited on 16 days ago

UI/UX: RTL layout, dark mode, stable header logo, chat/input RTL, scrollbar, focus states; agent: retrieve→generate→end, final answer separator

2ebebf3

galbendavids commited on 16 days ago

Fix: Hebrew Elantra N detection, final answer display, stronger prompts; UI redesign; Hebrew comparison test

75c53f5

galbendavids commited on 16 days ago

RAG: comparison by supported models, partial answer for 1 known model, stronger prompts for context aggregation

de2bc35

galbendavids commited on 16 days ago

agentic rag update

37bbf25
verified

galbendavids commited on 18 days ago

fixing timeout bug while waiting for gemini response

0b17d83
verified

galbendavids commited on 18 days ago

create vector offline

7f608f2
verified

galbendavids commited on 18 days ago

faster réponse

80e3e46
verified

galbendavids commited on 19 days ago

flash models for faster response

947082a
verified

galbendavids commited on 19 days ago

rate limit fix

0bc4550
verified

galbendavids commited on 19 days ago

optimize rag flow

a98f7cb
verified

galbendavids commited on 19 days ago

fix bug

da458f9
verified

galbendavids commited on 19 days ago

bug

697b33e
verified

galbendavids commited on 19 days ago

upload files

aaa458a
verified

galbendavids commited on 19 days ago

update

01310ab
verified

galbendavids commited on 19 days ago

🔧 Fix: Smarter rate limit handling - wait 30s on 429 errors instead of switching models

ca1b20d

galbendavids commited on 19 days ago

✅ Improve: Increase minimum delay to 3s + more aggressive exponential backoff (3^n) for rate limit handling

173fe99

galbendavids commited on 19 days ago

✅ Fix: Add rate limiting with exponential backoff + response caching to prevent API quota errors

fe3cfdf

galbendavids commited on 19 days ago

✅ Fix: Update to correct Gemini API models (gemini-2.0-flash, gemini-1.5-flash, gemini-1.5-pro) + Clean minimal chat UI

111c7c5

galbendavids commited on 19 days ago

🔧 Fix: Dark chat UI + correct Gemini API model names

c1098a1

galbendavids commited on 19 days ago

🎨 Simplify UI to clean professional design + 🐛 Add better error handling

9007d17

galbendavids commited on 19 days ago

✨ Implement 10 Golden Rules for RAG

51e2e17

galbendavids commited on 19 days ago

Fix API 404 (upgrade lib) and UI Readability (High Contrast)

e1b2ed3

galbendavids commited on 20 days ago

Fix crashes: Pin numpy<2, remove invalid Gradio args, fix paths

6ab8b6b

galbendavids commited on 20 days ago

Refactor: Move scraping logic to data_ingestion folder

dd2d8f6

galbendavids commited on 20 days ago

Complete Automotive RAG Chatbot implementation

1f92167

galbendavids commited on 20 days ago

Spaces:

galbendavids
/

CarsRUS

Sleeping

Commit History

CarsRUS: session car history – stick to discussed models on follow-up

cdf8db8

CarsRUS: Link & Co 01 normalization (and/לינק&קו), tests

b262f99

RAG: better normalization for Genesis GV80 + Link & Co Hebrew; add test_chat_scenarios manual script

3204a1c

OpenRouter: default model google/gemini-3-flash-preview (fix 404, gemini-2.0-flash-exp:free deprecated)

bc4e61d

Generation: use only OpenRouter when OPENROUTER_API_KEY set (no Gemini fallback)

2aea9f9

OpenRouter: log when key missing; try more env names; doc HF secret OPENROUTER_API_KEY

126332e

OpenRouter first (faster), Gemini fallback; conversation history Q+A; optional .env

b609489

Pipeline logging + show LLM response in chat

2fa5774

Rate limit: longer retries (8 attempts, 3 min wait); show as 'בעיה זמנית' not 'התשובה'

58029e3

Present aggregated verbal answer first in chat; strengthen RAG prompt

fba06c0

UI/UX: RTL layout, dark mode, stable header logo, chat/input RTL, scrollbar, focus states; agent: retrieve→generate→end, final answer separator

2ebebf3

Fix: Hebrew Elantra N detection, final answer display, stronger prompts; UI redesign; Hebrew comparison test

75c53f5

RAG: comparison by supported models, partial answer for 1 known model, stronger prompts for context aggregation

de2bc35

agentic rag update

37bbf25
verified

fixing timeout bug while waiting for gemini response

0b17d83
verified

create vector offline

7f608f2
verified

faster réponse

80e3e46
verified

flash models for faster response

947082a
verified

rate limit fix

0bc4550
verified

optimize rag flow

a98f7cb
verified

fix bug

da458f9
verified

bug

697b33e
verified

upload files

aaa458a
verified

update

01310ab
verified

🔧 Fix: Smarter rate limit handling - wait 30s on 429 errors instead of switching models

ca1b20d

✅ Improve: Increase minimum delay to 3s + more aggressive exponential backoff (3^n) for rate limit handling

173fe99

✅ Fix: Add rate limiting with exponential backoff + response caching to prevent API quota errors

fe3cfdf

✅ Fix: Update to correct Gemini API models (gemini-2.0-flash, gemini-1.5-flash, gemini-1.5-pro) + Clean minimal chat UI

111c7c5

🔧 Fix: Dark chat UI + correct Gemini API model names

c1098a1

🎨 Simplify UI to clean professional design + 🐛 Add better error handling

9007d17

✨ Implement 10 Golden Rules for RAG

51e2e17

Fix API 404 (upgrade lib) and UI Readability (High Contrast)

e1b2ed3

Fix crashes: Pin numpy<2, remove invalid Gradio args, fix paths

6ab8b6b

Refactor: Move scraping logic to data_ingestion folder

dd2d8f6

Complete Automotive RAG Chatbot implementation

1f92167

Commit History

CarsRUS: session car history – stick to discussed models on follow-up cdf8db8

CarsRUS: Link & Co 01 normalization (and/לינק&קו), tests b262f99

RAG: better normalization for Genesis GV80 + Link & Co Hebrew; add test_chat_scenarios manual script 3204a1c

OpenRouter: default model google/gemini-3-flash-preview (fix 404, gemini-2.0-flash-exp:free deprecated) bc4e61d

Generation: use only OpenRouter when OPENROUTER_API_KEY set (no Gemini fallback) 2aea9f9

OpenRouter: log when key missing; try more env names; doc HF secret OPENROUTER_API_KEY 126332e

OpenRouter first (faster), Gemini fallback; conversation history Q+A; optional .env b609489

Pipeline logging + show LLM response in chat 2fa5774

Rate limit: longer retries (8 attempts, 3 min wait); show as 'בעיה זמנית' not 'התשובה' 58029e3

Present aggregated verbal answer first in chat; strengthen RAG prompt fba06c0

UI/UX: RTL layout, dark mode, stable header logo, chat/input RTL, scrollbar, focus states; agent: retrieve→generate→end, final answer separator 2ebebf3

Fix: Hebrew Elantra N detection, final answer display, stronger prompts; UI redesign; Hebrew comparison test 75c53f5

RAG: comparison by supported models, partial answer for 1 known model, stronger prompts for context aggregation de2bc35

agentic rag update 37bbf25 verified

fixing timeout bug while waiting for gemini response 0b17d83 verified

create vector offline 7f608f2 verified

faster réponse 80e3e46 verified

flash models for faster response 947082a verified

rate limit fix 0bc4550 verified

optimize rag flow a98f7cb verified

fix bug da458f9 verified

bug 697b33e verified

upload files aaa458a verified

update 01310ab verified

🔧 Fix: Smarter rate limit handling - wait 30s on 429 errors instead of switching models ca1b20d

✅ Improve: Increase minimum delay to 3s + more aggressive exponential backoff (3^n) for rate limit handling 173fe99

✅ Fix: Add rate limiting with exponential backoff + response caching to prevent API quota errors fe3cfdf

✅ Fix: Update to correct Gemini API models (gemini-2.0-flash, gemini-1.5-flash, gemini-1.5-pro) + Clean minimal chat UI 111c7c5

🔧 Fix: Dark chat UI + correct Gemini API model names c1098a1

🎨 Simplify UI to clean professional design + 🐛 Add better error handling 9007d17

✨ Implement 10 Golden Rules for RAG 51e2e17

Fix API 404 (upgrade lib) and UI Readability (High Contrast) e1b2ed3

Fix crashes: Pin numpy<2, remove invalid Gradio args, fix paths 6ab8b6b

Refactor: Move scraping logic to data_ingestion folder dd2d8f6

Complete Automotive RAG Chatbot implementation 1f92167

CarsRUS: session car history – stick to discussed models on follow-up

cdf8db8

CarsRUS: Link & Co 01 normalization (and/לינק&קו), tests

b262f99

RAG: better normalization for Genesis GV80 + Link & Co Hebrew; add test_chat_scenarios manual script

3204a1c

OpenRouter: default model google/gemini-3-flash-preview (fix 404, gemini-2.0-flash-exp:free deprecated)

bc4e61d

Generation: use only OpenRouter when OPENROUTER_API_KEY set (no Gemini fallback)

2aea9f9

OpenRouter: log when key missing; try more env names; doc HF secret OPENROUTER_API_KEY

126332e

OpenRouter first (faster), Gemini fallback; conversation history Q+A; optional .env

b609489

Pipeline logging + show LLM response in chat

2fa5774

Rate limit: longer retries (8 attempts, 3 min wait); show as 'בעיה זמנית' not 'התשובה'

58029e3

Present aggregated verbal answer first in chat; strengthen RAG prompt

fba06c0

UI/UX: RTL layout, dark mode, stable header logo, chat/input RTL, scrollbar, focus states; agent: retrieve→generate→end, final answer separator

2ebebf3

Fix: Hebrew Elantra N detection, final answer display, stronger prompts; UI redesign; Hebrew comparison test

75c53f5

RAG: comparison by supported models, partial answer for 1 known model, stronger prompts for context aggregation

de2bc35

agentic rag update

37bbf25
verified

fixing timeout bug while waiting for gemini response

0b17d83
verified

create vector offline

7f608f2
verified

faster réponse

80e3e46
verified

flash models for faster response

947082a
verified

rate limit fix

0bc4550
verified

optimize rag flow

a98f7cb
verified

fix bug

da458f9
verified

bug

697b33e
verified

upload files

aaa458a
verified

update

01310ab
verified

🔧 Fix: Smarter rate limit handling - wait 30s on 429 errors instead of switching models

ca1b20d

✅ Improve: Increase minimum delay to 3s + more aggressive exponential backoff (3^n) for rate limit handling

173fe99

✅ Fix: Add rate limiting with exponential backoff + response caching to prevent API quota errors

fe3cfdf

✅ Fix: Update to correct Gemini API models (gemini-2.0-flash, gemini-1.5-flash, gemini-1.5-pro) + Clean minimal chat UI

111c7c5

🔧 Fix: Dark chat UI + correct Gemini API model names

c1098a1

🎨 Simplify UI to clean professional design + 🐛 Add better error handling

9007d17

✨ Implement 10 Golden Rules for RAG

51e2e17

Fix API 404 (upgrade lib) and UI Readability (High Contrast)

e1b2ed3

Fix crashes: Pin numpy<2, remove invalid Gradio args, fix paths

6ab8b6b

Refactor: Move scraping logic to data_ingestion folder

dd2d8f6

Complete Automotive RAG Chatbot implementation

1f92167