Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

JuncaiL
/
llama-265m

Text Generation
Transformers
PyTorch
English
llama_moe
custom_code
Model card Files Files and versions
xet
Community
1
llama-265m
1.06 GB
  • 1 contributor
History: 7 commits
JuncaiL's picture
JuncaiL
Update README.md
a26cc97 verified almost 2 years ago
  • .gitattributes
    1.52 kB
    initial commit almost 2 years ago
  • README.md
    5.18 kB
    Update README.md almost 2 years ago
  • config.json
    1.39 kB
    fix state_dict loading in MoE model almost 2 years ago
  • configuration_llama_moe.py
    4.41 kB
    upload llama-265m model checkpoint almost 2 years ago
  • generation_config.json
    132 Bytes
    upload llama-265m model checkpoint almost 2 years ago
  • modeling_llama_moe_hf.py
    66.7 kB
    fix state_dict loading in MoE model almost 2 years ago
  • pytorch_model.bin
    1.06 GB
    xet
    upload llama-265m model checkpoint almost 2 years ago
  • special_tokens_map.json
    411 Bytes
    upload llama-265m model checkpoint almost 2 years ago
  • tokenizer.model
    500 kB
    xet
    upload llama-265m model checkpoint almost 2 years ago
  • tokenizer_config.json
    720 Bytes
    upload llama-265m model checkpoint almost 2 years ago
  • trainer_state.json
    72.6 kB
    upload llama-265m model checkpoint almost 2 years ago
  • training_args.bin
    3.95 kB
    xet
    upload llama-265m model checkpoint almost 2 years ago