This is the heretic Q6 "prompt eval time = 169.57 ms / 5 tokens ( 33.91 ms per token, 29.49 tokens per second)
eval time = 23474.06 ms / 491 tokens ( 47.81 ms per token, 20.92 tokens per second)
total time = 23643.63 ms / 496 tokens
draft acceptance rate = 0.83944 ( 298 accepted / 355 generated)" ik_llama.cpp
Juha Nygård
Johneeee
AI & ML interests
None yet
Recent Activity
View all activity
Organizations
None yet