Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FastFlowLM
/
Gemma3-4B-Text-NPU2

Transformers
English
gemma3_text
unsloth
gemma3
gemma
google
Model card Files Files and versions
xet
Community

Instructions to use FastFlowLM/Gemma3-4B-Text-NPU2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use FastFlowLM/Gemma3-4B-Text-NPU2 with Transformers:

    # Load model directly
    from transformers import AutoModel
    model = AutoModel.from_pretrained("FastFlowLM/Gemma3-4B-Text-NPU2", dtype="auto")
  • Notebooks
  • Google Colab
  • Kaggle
  • Local Apps
  • Unsloth Studio new

    How to use FastFlowLM/Gemma3-4B-Text-NPU2 with Unsloth Studio:

    Install Unsloth Studio (macOS, Linux, WSL)
    curl -fsSL https://unsloth.ai/install.sh | sh
    # Run unsloth studio
    unsloth studio -H 0.0.0.0 -p 8888
    # Then open http://localhost:8888 in your browser
    # Search for FastFlowLM/Gemma3-4B-Text-NPU2 to start chatting
    Install Unsloth Studio (Windows)
    irm https://unsloth.ai/install.ps1 | iex
    # Run unsloth studio
    unsloth studio -H 0.0.0.0 -p 8888
    # Then open http://localhost:8888 in your browser
    # Search for FastFlowLM/Gemma3-4B-Text-NPU2 to start chatting
    Using HuggingFace Spaces for Unsloth
    # No setup required
    # Open https://huggingface.co/spaces/unsloth/studio in your browser
    # Search for FastFlowLM/Gemma3-4B-Text-NPU2 to start chatting
    Load model with FastModel
    pip install unsloth
    from unsloth import FastModel
    model, tokenizer = FastModel.from_pretrained(
        model_name="FastFlowLM/Gemma3-4B-Text-NPU2",
        max_seq_length=2048,
    )
Gemma3-4B-Text-NPU2
3.8 GB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 6 commits
FastFlowLM's picture
FastFlowLM
Fix RMS Norm Weights
0865d79 verified 9 months ago
  • .gitattributes
    401 Bytes
    init 9 months ago
  • README.md
    21.9 kB
    Create README.md 9 months ago
  • attn.xclbin
    465 kB
    xet
    init 9 months ago
  • config.json
    988 Bytes
    Update config.json 9 months ago
  • dequant.xclbin
    115 kB
    xet
    init 9 months ago
  • layer.xclbin
    283 kB
    xet
    init 9 months ago
  • lm_head.xclbin
    153 kB
    xet
    init 9 months ago
  • mm.xclbin
    348 kB
    xet
    init 9 months ago
  • model.q4nx
    3.77 GB
    xet
    Fix RMS Norm Weights 9 months ago
  • tokenizer.json
    33.4 MB
    xet
    init 9 months ago
  • tokenizer_config.json
    1.16 MB
    xet
    update_chat_template 9 months ago