289 MB
3 files
Updated 10 days ago
NameSize
.gitattributes1.59 kB
xet
README.md4.29 kB
xet
mobile_actions_q8_ekv1024.litertlm289 MB
xet
README.md

litert-community/functiongemma-270m-ft-mobile-actions

Main Model Card: google/functiongemma-270m-it

This model card provides the Mobile Actions model that is ready for deployment on the Google AI Edge Gallery app.

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. This particular Gemma model is especially small so it is ideal for on-device use cases. By running this model on device, users can have private access to Generative AI technology without even requiring an internet connection.

Try it live

The Mobile Actions model is a finetune of google/functiongemma-270m-it. To try out Mobile Actions, download the Google AI Edge Gallery app on your phone, open the Mobile Actions panel and tap the download button.

Performance

Android

Benchmarked on S25 Ultra with 512 prefill tokens and 256 decode tokens.

Backend Quantization scheme Context length Prefill (tokens/sec) Decode (tokens/sec) Time-to-first-token Model size (MB) Peak RSS Memory (MB)

CPU

dynamic_int8

1024

2238 tk/s

154.2 tk/s

0.24 s

289 MB

510 MB

Notes:

  • Model Size: measured by the size of the file on disk.
  • The inference on CPU is accelerated via the LiteRT XNNPACK delegate with 4 threads
  • Benchmark is run with cache enabled and initialized. During the first run, the latency and memory usage may differ.
Total size
289 MB
Files
3
Last updated
May 24
Pre-warmed CDN
US EU US EU

Contributors