feat: enable GPU acceleration for Bielik GGUF models 7c2f84b Patryk Studzinski commited on 12 days ago
add GBNF grammar for car advertisement gap filling; update LlamaCppModel to support loading grammar from file c14ac43 Patryk Studzinski commited on Dec 29, 2025
add GBNF grammar utilities for structured LLM output; integrate grammar in model generation 329abd1 Patryk Studzinski commited on Dec 29, 2025
update LlamaCppModel initialization parameters and enable verbose logging for model loading; update llama-cpp-python requirement fb1531e Patryk Studzinski commited on Dec 29, 2025
enhance error handling in LlamaCppModel initialization; include full traceback on failure cdff838 Patryk Studzinski commited on Dec 29, 2025
add get_info method to return model details for /models endpoint baa08b7 Patryk Studzinski commited on Dec 29, 2025
add debug logging for batch infill and model generation processes; update bielik model configuration 9d2cc15 Patryk Studzinski commited on Dec 29, 2025
increase context size and improve message handling in LlamaCppModel db4996d Patryk Studzinski commited on Dec 29, 2025