Commit History

Refactor: Tách code thành cấu trúc module chuyên nghiệp
124d692

hainc commited on

Add audio autoplay event handler and fix layout issues
783808d

hainc commited on

Fix CSS syntax error and add JavaScript for audio autoplay
efba517

hainc commited on

Fix UI layout issues and ensure audio autoplay works correctly
b3198f7

hainc commited on

Professional UI redesign: modern glassmorphism, improved typography, animations, better spacing
7b82f59

hainc commited on

Improve Colab notebook: add environment check, better error handling, troubleshooting guide
2445d45

hainc commited on

Fix STT and TTS: improve error handling, validate audio files, fix return values
d33d341

hainc commited on

Redesign UI: Modern chatbot conversation interface with bubbles, avatars, and better styling
ac42c36

hainc commited on

Add processing time display, improve STT accuracy, enhance TTS
4a2ddc9

hainc commited on

Add Talk-to-Talk mode: automatic voice conversation loop
67deb9a

hainc commited on

Final TTS improvements: better error handling
8fcdad1

hainc commited on

Improve TTS: add error handling, ensure auto-read works
47b93d1

hainc commited on

Enable auto TTS: AI automatically reads all responses out loud
751145b

hainc commited on

Fix: use full 40GB GPU RAM, optimize memory allocation
1c14e77

hainc commited on

Optimize for Colab: disable torch.compile on T4, simplify generation, use full 40GB GPU RAM
f635bf2

hainc commited on

Update Colab notebook: add faster-whisper, flash-attn, optimum dependencies
95e6eb6

hainc commited on

Add SDPA to load_kwargs for faster attention
be73c37

hainc commited on

Ultra-fast LLM optimization: SDPA, torch.compile reduce-overhead, pre-warm, half precision inputs, inference_mode
4405b37

hainc commited on

Upgrade to faster-whisper + optimize for max speed: torch.compile, flash attention, full GPU utilization
4a637b4

hainc commited on

Optimize for full performance mode - remove all limits, max speed STT/LLM
5e936dd

hainc commited on

Update print message to show correct model name
caf68c9

hainc commited on

Update to use bank-1.7 model (Qwen3-VLTO-1.7B-Instruct)
797ebf7

hainc commited on

Fix Qwen3VL generation config to prevent gibberish output - add repetition penalty and better temperature control
4bf8df6

hainc commited on

Fix global variables issue with @GPU decorator - separate internal load function
fa7bc76

hainc commited on

Add @GPU decorator for Hugging Face Spaces and fix torch_dtype deprecation
faa8bf8

hainc commited on

Fix Qwen3VL model loading - use Qwen3VLForConditionalGeneration
0a822f6

hainc commited on

Update app to use bank-model-2b with voice chat features
917e363

hainc commited on

Duplicate from gradio-templates/chatbot
a0ff5a4
verified

hainguyen306201 pngwn HF Staff commited on