Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

amd
/
kimi-k2.5-eagle3-fp8

Transformers
Safetensors
llama
speculative-decoding
eagle3
draft-model
kimi-k2.5
fp8
amd-quark
quantized
no-lm-head-quantization
text-generation-inference
quark
Model card Files Files and versions
xet
Community

Instructions to use amd/kimi-k2.5-eagle3-fp8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use amd/kimi-k2.5-eagle3-fp8 with Transformers:

    # Load model directly
    from transformers import AutoTokenizer, LlamaForCausalLMEagle3
    
    tokenizer = AutoTokenizer.from_pretrained("amd/kimi-k2.5-eagle3-fp8")
    model = LlamaForCausalLMEagle3.from_pretrained("amd/kimi-k2.5-eagle3-fp8")
  • Notebooks
  • Google Colab
  • Kaggle
kimi-k2.5-eagle3-fp8
5.68 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 10 commits
chaoli-amd's picture
chaoli-amd
Update model card license notice
ffc7b2d verified about 21 hours ago
  • images
    Add files using upload-large-folder tool 1 day ago
  • .gitattributes
    1.64 kB
    Add files using upload-large-folder tool 1 day ago
  • LICENSE
    1.47 kB
    Add license file 1 day ago
  • README.md
    1.81 kB
    Update model card license notice about 21 hours ago
  • THIRD_PARTY_NOTICES.md
    1.66 kB
    Add third-party notices about 21 hours ago
  • config.json
    2.6 kB
    Add files using upload-large-folder tool 1 day ago
  • model-00001-of-00002.safetensors
    3.33 GB
    xet
    Add files using upload-large-folder tool 1 day ago
  • model-00002-of-00002.safetensors
    2.35 GB
    xet
    Add files using upload-large-folder tool 1 day ago
  • model.safetensors.index.json
    1.71 kB
    Add files using upload-large-folder tool 1 day ago
  • quark_profile.yaml
    5.44 kB
    Add files using upload-large-folder tool 1 day ago