Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OsakanaTeishoku
/
dummy-3.6b

Text Generation
Transformers
Safetensors
deepseek
custom_code
Model card Files Files and versions
xet
Community
dummy-3.6b
7.22 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 14 commits
OsakanaTeishoku's picture
OsakanaTeishoku
Upload modeling_deepseek.py
626d566 verified about 2 years ago
  • .gitattributes
    1.52 kB
    initial commit about 2 years ago
  • added_tokens.json
    22 Bytes
    Upload added_tokens.json about 2 years ago
  • config.json
    1.19 kB
    Upload config.json about 2 years ago
  • configuration_deepseek.py
    10.2 kB
    Upload configuration_deepseek.py about 2 years ago
  • generation_config.json
    111 Bytes
    Upload generation_config.json about 2 years ago
  • model-00001-of-00002.safetensors
    4.99 GB
    xet
    Upload model-00001-of-00002.safetensors about 2 years ago
  • model-00002-of-00002.safetensors
    2.22 GB
    xet
    Upload model-00002-of-00002.safetensors about 2 years ago
  • model.safetensors.index.json
    98.7 kB
    Upload model.safetensors.index.json about 2 years ago
  • modeling_deepseek.py
    72.7 kB
    Upload modeling_deepseek.py about 2 years ago
  • special_tokens_map.json
    1.02 kB
    Upload special_tokens_map.json about 2 years ago
  • tokenizer.json
    3.9 MB
    Upload tokenizer.json about 2 years ago
  • tokenizer_config.json
    1.82 kB
    Upload tokenizer_config.json about 2 years ago
  • trainer_state.json
    32.4 kB
    Upload trainer_state.json about 2 years ago
  • zero_to_fp32.py
    25.3 kB
    Upload zero_to_fp32.py about 2 years ago