wenjinf0811's picture
Upload README.md with huggingface_hub
40ed16d verified
---
language:
- en
- zh
- ja
- fr
- de
- es
license: apache-2.0
base_model: google/functiongemma-270m-it
tags:
- robotics
- function-calling
- gemma
- lora
- fine-tuned
- edge-ai
- jetson
- multilingual
pipeline_tag: text-generation
library_name: transformers
---
# FunctionGemma Robot Actions (Multilingual)
A fine-tuned [FunctionGemma 270M](https://huggingface.co/google/functiongemma-270m-it) model that converts natural language into structured robot action and emotion function calls. Supports **6 languages** with **98% accuracy** at **~59ms** on NVIDIA Jetson AGX Thor.
## Supported Languages
πŸ‡¬πŸ‡§ English Β· πŸ‡¨πŸ‡³ δΈ­ζ–‡ Β· πŸ‡―πŸ‡΅ ζ—₯本θͺž Β· πŸ‡«πŸ‡· FranΓ§ais Β· πŸ‡©πŸ‡ͺ Deutsch Β· πŸ‡ͺπŸ‡Έ EspaΓ±ol
## Example
```
Input: "Can you shake hands with me?" β†’ robot_action(shake_hand) + show_emotion(happy)
Input: "θ·Ÿζˆ‘ζ‘ζ‰‹" β†’ robot_action(shake_hand) + show_emotion(happy)
Input: "揑手してください" β†’ robot_action(shake_hand) + show_emotion(happy)
Input: "Serrez-moi la main" β†’ robot_action(shake_hand) + show_emotion(happy)
Input: "Gib mir die Hand" β†’ robot_action(shake_hand) + show_emotion(happy)
Input: "Dame la mano" β†’ robot_action(shake_hand) + show_emotion(happy)
Input: "ζˆ‘δ»Šε€©εΏƒζƒ…δΈε₯½" β†’ robot_action(stand_still) + show_emotion(sad)
Input: "γ‚γ‚Œγ―δ½•γ§γ™γ‹οΌŸ" β†’ robot_action(stand_still) + show_emotion(confused)
Input: "Raconte-moi une blague" β†’ robot_action(stand_still) + show_emotion(think)
```
## Supported Actions
| Action | Description |
|--------|-------------|
| `shake_hand` | Handshake gesture |
| `face_wave` | Wave hello / goodbye |
| `hands_up` | Raise both hands |
| `stand_still` | Stay idle (default for general conversation) |
| `show_hand` | Show open hand / present card for payment |
| `do_payment` | Do the payment / do the payment |
| `down_payment` | Finished the payment |
## Supported Emotions
| Emotion | Animation |
|---------|-----------|
| `happy` | Happy.riv |
| `sad` | Sad.riv |
| `excited` | Excited.riv |
| `confused` | Confused.riv |
| `curious` | Curious.riv |
| `think` | Think.riv |
Constrained decoding uses 2 forward passes instead of 33 autoregressive steps, achieving ~18x speedup over standard `model.generate()`.
## Training Details
| Parameter | Value |
|-----------|-------|
| Base model | `google/functiongemma-270m-it` |
| Method | LoRA (rank 8, alpha 16) |
| Training data | ~6,000 examples (545 English + ~5,450 multilingual) |
| Languages | English, Chinese, Japanese, French, German, Spanish |
| Epochs | 3 |
| Learning rate | 2e-4 |
| Batch size | 4 (effective 16 with gradient accumulation) |
| Max sequence length | 512 |
| Precision | bf16 |
| Hardware | NVIDIA RTX 5070 Ti (16 GB) |
Multilingual training data was generated using Claude API β€” 2 natural phrasings per language per English prompt, resulting in diverse and natural expressions rather than literal translations.
## Usage
### Quick Start
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
model = AutoModelForCausalLM.from_pretrained(
"OpenmindAGI/functiongemma-finetuned-g1-multilingual",
torch_dtype=torch.bfloat16,
device_map="auto",
)
tokenizer = AutoTokenizer.from_pretrained("OpenmindAGI/functiongemma-finetuned-g1-multilingual")
model.eval()
```
## Citation
```bibtex
@misc{openmindagi-functiongemma-multilingual,
title={FunctionGemma Robot Actions (Multilingual)},
author={OpenmindAGI},
year={2025},
url={https://huggingface.co/OpenmindAGI/functiongemma-finetuned-g1-multilingual}
}
```
## License
Fine-tuned from [google/functiongemma-270m-it](https://huggingface.co/google/functiongemma-270m-it) under [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).