Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

FastFlowLM
/
Llama-3.1-8B-NPU2

Text Generation
English
llama
llama-3.1
AMD
Ryzen
NPU
conversational
Model card Files Files and versions
xet
Community
Llama-3.1-8B-NPU2
Ctrl+K
Ctrl+K
  • 2 contributors
History: 14 commits
FastFlowLM's picture
FastFlowLM
Update README.md
5ac86d4 verified 9 months ago
  • .gitattributes
    343 Bytes
    support_prefill 11 months ago
  • README.md
    2.21 kB
    Update README.md 9 months ago
  • attn.xclbin
    698 kB
    xet
    support_prefill 11 months ago
  • config.json
    1.02 kB
    Update config.json 10 months ago
  • dequant.xclbin
    116 kB
    xet
    update_dequant 10 months ago
  • layer.xclbin
    332 kB
    xet
    update_glu 10 months ago
  • lm_head.xclbin
    153 kB
    xet
    support_prefill 11 months ago
  • mm.xclbin
    348 kB
    xet
    new_mm 10 months ago
  • model.q4nx
    5.74 GB
    xet
    add_model about 1 year ago
  • tokenizer.json
    17.2 MB
    xet
    add_model about 1 year ago
  • tokenizer_config.json
    4.33 kB
    Add tokenizer_config 10 months ago