alanzhuly commited on
Commit
61f4a04
·
verified ·
1 Parent(s): 4553f03

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Gemma-3n-E4B-IT
2
+
3
+ ## Model Description
4
+ **Gemma 3n E4B-IT**, developed by Google DeepMind, is a 4-billion-parameter efficient multimodal model.
5
+ Built with MatFormer architecture and dynamic parameter activation, it delivers strong text, image, audio, and video understanding while remaining lightweight enough for on-device deployment.
6
+ It supports a 32K context window and multilingual inputs across more than 140 languages.
7
+
8
+ ## Features
9
+ - **Multimodal input**: text, image (up to 768×768), audio, and video.
10
+ - **Efficient design**: parameter skipping and caching enable deployment on edge devices.
11
+ - **Large context window**: up to 32K tokens.
12
+ - **Multilingual**: trained on 140+ languages.
13
+ - **Compact but strong**: achieves benchmark scores competitive with much larger models.
14
+
15
+ ## Use Cases
16
+ - Visual question answering and captioning
17
+ - Conversational agents with multimodal inputs
18
+ - On-device assistants for mobile and embedded systems
19
+ - Multilingual summarization, translation, and transcription
20
+
21
+ ## Inputs and Outputs
22
+ **Input**:
23
+ - Text prompts or dialogue
24
+ - Images, audio, and video (tokenized for processing)
25
+
26
+ **Output**:
27
+ - Generated text (answers, captions, translations, summaries)
28
+
29
+ ## License
30
+ - Licensed under Google’s Gemma terms of use. See Hugging Face model card for details.
31
+
32
+ ## References
33
+ - [Hugging Face: google/gemma-3n-E4B-it](https://huggingface.co/google/gemma-3n-E4B-it)
34
+ - [Gemma 3n documentation](https://ai.google.dev/gemma/docs/gemma-3n)
35
+ - [Google AI blog announcement](https://developers.googleblog.com/en/introducing-gemma-3n-developer-guide/)