Commit History

fix(tts): remove tokenizer_tts from app.state
431435a

CrazyMonkey0 commited on

feat(tts): switch TTS model from mms-tts-eng to Kokoro-82M
d6cde26

CrazyMonkey0 commited on

docs(readme): add Faster Whisper usage and base model attribution
9ef2068

CrazyMonkey0 commited on

feat(nlp): change nlp model to Qwen/Qwen2.5-1.5B-Instruct
4d18a16

CrazyMonkey0 commited on

feat(nlp): change nlp model to microsoft/Phi-3.5-mini-instruct
f854f33

CrazyMonkey0 commited on

feat(asr): add local ASR endpoint using faster-whisper with async lock
0ee88ba

CrazyMonkey0 commited on

fix(asr): fix loading model
0d30253

CrazyMonkey0 commited on

feat(asr): replace Whisper HF with faster-whisper for CPU-friendly transcription
c84acef

CrazyMonkey0 commited on

feat(api): add sending generated audio to external backend
20a7446

CrazyMonkey0 commited on

test(tts): checking whether the tts model is working correctly
65952f6

CrazyMonkey0 commited on

feat(nlp): switch NLP model to Qwen2.5-0.5B-Instruct
df63d34

CrazyMonkey0 commited on

fix(nlp): remove options n_threads=os.cpu_count() in load_model_nlp
0bdf4f1

CrazyMonkey0 commited on

feat(nlp): change version to qwen2.5-1.5b-instruct-q3_k_m.gguf
dc74289

CrazyMonkey0 commited on

test(nlp): comment part of tts in nlp.py
018ff86

CrazyMonkey0 commited on

fix: resolve TypeError in TTS audio generation and optimize model performance
e288dcc

CrazyMonkey0 commited on

fix(tts): swapping the model and tokenizer in the return function of load_models_tts
4784a54

CrazyMonkey0 commited on

fix: shorten short_description to comply with Hugging Face
5e88d56

CrazyMonkey0 commited on

fix: optimize FastAPI + Qwen2.5-1.5B for CPU, reduce max_tokens, increase timeout
88284a4

CrazyMonkey0 commited on

docs(readme): update README with current models and license attributions
8d6c391

CrazyMonkey0 commited on

feat(nlp): switch Qwen2.5 model to 1.5B GGUF q6_k version
5d51d0f

CrazyMonkey0 commited on

fix(asr): load audio from in-memory buffer instead of disk
9ea2744

CrazyMonkey0 commited on

fix(chat): use llm() directly instead of create_chat_completion
3ad9eac

CrazyMonkey0 commited on

feat(chat): return NLP response with in-memory TTS audio
245cf59

CrazyMonkey0 commited on

feat(tts): migrate Kokoro TTS to Hugging Face facebook/mms-tts-eng with in-memory optimization
2a3f624

CrazyMonkey0 commited on

fix(nlp): Adding chat_handler for handling the Qwen2.5-3B-Instruct-GGUF model in llama-cpp-python
5f3ceca

CrazyMonkey0 commited on

fix(nlp):add ',' to fix an error in response generation
2d6bfd5

CrazyMonkey0 commited on

perf: implement lazy loading to fix startup timeouts
8f110eb

CrazyMonkey0 commited on

fix(requirements): remove nvidia dependencies
145a157

CrazyMonkey0 commited on

fix: resolve model loading and state management issues
bf1dc5f

CrazyMonkey0 commited on

build: use custom Docker base image from Docker Hub
fc8b522

CrazyMonkey0 commited on

feat(docker): install llama-cpp-python directly with OpenBLAS
5adcb69

CrazyMonkey0 commited on

fix(requirements): update requirements
3768374

CrazyMonkey0 commited on

fix(docker): Add python alpine for llama-cpp-python
0442de3

CrazyMonkey0 commited on

fix(requirements): adapting packages for Python 3.13
f42f941

CrazyMonkey0 commited on

fix(docker): switch version python to 3.12
482015f

CrazyMonkey0 commited on

fix(docker): switch to prebuilt llama-cpp-python image on Debian
33c584f

CrazyMonkey0 commited on

fix(docker): repair version dependecies
5d2827d

CrazyMonkey0 commited on

fix(docker): adding a permanent version of llama
06eacd3

CrazyMonkey0 commited on

fix(docker): replacing python:3.12-slim with python:3.12-alpine
ad141ba

CrazyMonkey0 commited on

fix(docker): add --prefer-binary
347b2fa

CrazyMonkey0 commited on

fix(docker): add --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu
ef2c272

CrazyMonkey0 commited on

fix(docker): replacing python:3.12-alpine with python:3.12-slim
5c256f5

CrazyMonkey0 commited on

fix(requirements): Remove CUDA dependencies, use CPU versions
6e062c5

CrazyMonkey0 commited on

fix(requirements): Change espeakng-loader version
1a76c5c

CrazyMonkey0 commited on

fix(docker): replacing apt-get with apk
38fe107

CrazyMonkey0 commited on

fix(docker): replacing python:3.12-slim with python:3.12-alpine
bf92c5c

CrazyMonkey0 commited on

fix(docker): replace llama-cpp-python extra index with pinned cp312 linux wheel
2a2937c

CrazyMonkey0 commited on

fix(docker): changing the option --only-binary=llama-cpp-python to --prefer-binary
8757f21

CrazyMonkey0 commited on

fix(docker): add build dependencies for llama-cpp-python
c0cf206

CrazyMonkey0 commited on

fix(docker): resolve llama-cpp-python module import error
fd4e818

CrazyMonkey0 commited on