| Benchmark | n-shot | Function Gemma 270m |
|---|---|---|
| BFCL Simple | 0-shot | 61.6 |
| BFCL Parallel | 0-shot | 63.5 |
| BFCL Multiple | 0-shot | 39 |
| BFCL Parallel Multiple | 0-shot | 29.5 |
| BFCL Live Simple | 0-shot | 36.2 |
| BFCL Live Parallel | 0-shot | 25.7 |
| BFCL Live Multiple | 0-shot | 22.9 |
| BFCL Live Parallel Multiple | 0-shot | 20.8 |
| BFCL Relevance | 0-shot | 61.1 |
| BFCL Irrelevance | 0-shot | 70.6 |
Model |
Eval results for Mobile Actions |
|---|---|
Base FunctionGemma model |
58% |
Mobile Actions Fine-Tune |
85% |
Backend |
Quantization scheme |
Context length |
Prefill (tokens per second) |
Decode (tokens per second) |
Time-to-first-token (seconds) |
Model Size (MB) |
Peak RSS Memory (MB) |
|---|---|---|---|---|---|---|---|
CPU |
dynamic_int8 |
1024 |
1718 |
125.9 |
0.3 |
288 |
551 |
Backend |
Quantization scheme |
Context length |
Prefill (tokens per second) |
Decode (tokens per second) |
Time-to-first-token (seconds) |
Model Size (MB) |
Peak RSS Memory (MB) |
|---|---|---|---|---|---|---|---|
CPU |
dynamic_int8 |
1024 |
1743 |
125.7 |
0.3 |
288 |
549 |