Commit History

fix: max_tokens=2048 for book-sized segments
0c0cdda
verified

hugh007 commited on

fix: increase max_tokens to 2048 for long translations
c42eb37
verified

hugh007 commited on

fix: use pre-compiled llama-cpp-python wheel + model in image
109e74f
verified

hugh007 commited on

fix: bake GGUF model into Docker image during build
d16da1a
verified

hugh007 commited on

fix: use pre-compiled llama-server binary (zero compilation)
ef4cebf
verified

hugh007 commited on

fix: use ninja-build + CMAKE_ARGS for llama-cpp-python build
272fa57
verified

hugh007 commited on

fix: add cmake+build-essential for llama-cpp
3324c59

hezu59158 commited on

fix: add libgomp1 for llama-cpp runtime
71aca5e

hezu59158 commited on

add README.md
e1ee69d

hezu59158 commited on

opt: use pre-built wheel for faster build
d671a55

hezu59158 commited on

feat: use Q6_K for better quality
8f6eed7

hezu59158 commited on

init
228c92c

hezu59158 commited on