feat(tts): switch TTS model from mms-tts-eng to Kokoro-82M d6cde26 CrazyMonkey0 commited on 3 days ago
docs(readme): add Faster Whisper usage and base model attribution 9ef2068 CrazyMonkey0 commited on Dec 28, 2025
feat(nlp): change nlp model to Qwen/Qwen2.5-1.5B-Instruct 4d18a16 CrazyMonkey0 commited on Dec 28, 2025
feat(nlp): change nlp model to microsoft/Phi-3.5-mini-instruct f854f33 CrazyMonkey0 commited on Dec 28, 2025
feat(asr): add local ASR endpoint using faster-whisper with async lock 0ee88ba CrazyMonkey0 commited on Dec 18, 2025
feat(asr): replace Whisper HF with faster-whisper for CPU-friendly transcription c84acef CrazyMonkey0 commited on Dec 18, 2025
feat(api): add sending generated audio to external backend 20a7446 CrazyMonkey0 commited on Dec 16, 2025
test(tts): checking whether the tts model is working correctly 65952f6 CrazyMonkey0 commited on Dec 16, 2025
fix(nlp): remove options n_threads=os.cpu_count() in load_model_nlp 0bdf4f1 CrazyMonkey0 commited on Dec 16, 2025
feat(nlp): change version to qwen2.5-1.5b-instruct-q3_k_m.gguf dc74289 CrazyMonkey0 commited on Dec 16, 2025
fix: resolve TypeError in TTS audio generation and optimize model performance e288dcc CrazyMonkey0 commited on Dec 16, 2025
fix(tts): swapping the model and tokenizer in the return function of load_models_tts 4784a54 CrazyMonkey0 commited on Dec 16, 2025
fix: shorten short_description to comply with Hugging Face 5e88d56 CrazyMonkey0 commited on Dec 16, 2025
fix: optimize FastAPI + Qwen2.5-1.5B for CPU, reduce max_tokens, increase timeout 88284a4 CrazyMonkey0 commited on Dec 16, 2025
docs(readme): update README with current models and license attributions 8d6c391 CrazyMonkey0 commited on Dec 16, 2025
feat(nlp): switch Qwen2.5 model to 1.5B GGUF q6_k version 5d51d0f CrazyMonkey0 commited on Dec 16, 2025
fix(asr): load audio from in-memory buffer instead of disk 9ea2744 CrazyMonkey0 commited on Dec 16, 2025
fix(chat): use llm() directly instead of create_chat_completion 3ad9eac CrazyMonkey0 commited on Dec 16, 2025
feat(chat): return NLP response with in-memory TTS audio 245cf59 CrazyMonkey0 commited on Dec 16, 2025
feat(tts): migrate Kokoro TTS to Hugging Face facebook/mms-tts-eng with in-memory optimization 2a3f624 CrazyMonkey0 commited on Dec 16, 2025
fix(nlp): Adding chat_handler for handling the Qwen2.5-3B-Instruct-GGUF model in llama-cpp-python 5f3ceca CrazyMonkey0 commited on Dec 16, 2025
fix(nlp):add ',' to fix an error in response generation 2d6bfd5 CrazyMonkey0 commited on Dec 15, 2025
feat(docker): install llama-cpp-python directly with OpenBLAS 5adcb69 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): switch to prebuilt llama-cpp-python image on Debian 33c584f CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replacing python:3.12-slim with python:3.12-alpine ad141ba CrazyMonkey0 commited on Dec 15, 2025
fix(docker): add --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu ef2c272 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replacing python:3.12-alpine with python:3.12-slim 5c256f5 CrazyMonkey0 commited on Dec 15, 2025
fix(requirements): Remove CUDA dependencies, use CPU versions 6e062c5 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replacing python:3.12-slim with python:3.12-alpine bf92c5c CrazyMonkey0 commited on Dec 15, 2025
fix(docker): replace llama-cpp-python extra index with pinned cp312 linux wheel 2a2937c CrazyMonkey0 commited on Dec 15, 2025
fix(docker): changing the option --only-binary=llama-cpp-python to --prefer-binary 8757f21 CrazyMonkey0 commited on Dec 15, 2025
fix(docker): add build dependencies for llama-cpp-python c0cf206 CrazyMonkey0 commited on Dec 14, 2025
fix(docker): resolve llama-cpp-python module import error fd4e818 CrazyMonkey0 commited on Dec 14, 2025