frankenstallm / source /eval /results /ollama_benchmark_summary.md
pathcosmos's picture
Upload folder using huggingface_hub (#29)
5b1ff4d
|
raw
history blame
840 Bytes

FRANKENSTALLM Ollama Benchmark Results

  • Date: 2026-03-09 14:28:08
  • Models: frankenstallm-3b
  • Total test cases: 5

Overall Auto-Scored Average

Model Auto Avg
frankenstallm-3b 0.0

Auto-Scored Results by Category

Category frankenstallm-3b
korean_nlu 0.0 (3a/0m)

Latency Comparison

Model Avg TTFT (ms) P50 TTFT P95 TTFT Avg TPS P50 TPS P95 TPS
frankenstallm-3b 0.0 0.0 0.0 0.0 0.0 0.0

Repetition Analysis Detail

Model Test ID Rep Rate Unique/Total N-grams Score

Manual Review Needed

The following prompts require human evaluation:

frankenstallm-3b