Instructions to use FastFlowLM/Gemma4-E2B-IT-NPU2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use FastFlowLM/Gemma4-E2B-IT-NPU2 with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForImageTextToText tokenizer = AutoTokenizer.from_pretrained("FastFlowLM/Gemma4-E2B-IT-NPU2") model = AutoModelForImageTextToText.from_pretrained("FastFlowLM/Gemma4-E2B-IT-NPU2") - Notebooks
- Google Colab
- Kaggle
feat: Q8_0 embedding
Browse files- model.q4nx +2 -2
model.q4nx
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:aaca4bf5592804f0752c9f5fb7b4cce2f517fd869ef1d8181b209c8b6dbf2a87
|
| 3 |
+
size 4671875302
|