Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Vishwas1
/
hummingbird_Llama_distill_float32

Model card Files Files and versions
xet
Community
hummingbird_Llama_distill_float32
91.5 MB
  • 1 contributor
History: 3 commits
Vishwas1's picture
Vishwas1
Training finished (float32, Xavier Init), epochs=3, lr=1e-07, clip=1, batch_size=4, loss=8.6898, perplexity=nan
491e7bc verified about 1 year ago
  • .gitattributes
    1.52 kB
    initial commit about 1 year ago
  • README.md
    24 Bytes
    initial commit about 1 year ago
  • special_tokens_map.json
    437 Bytes
    Training finished (float32, no LayerNorm), epochs=3, lr=1e-07, clip=1, batch_size=4, loss=9.0133, perplexity=nan about 1 year ago
  • student_model.pth
    91 MB
    xet
    Training finished (float32, Xavier Init), epochs=3, lr=1e-07, clip=1, batch_size=4, loss=8.6898, perplexity=nan about 1 year ago
  • tokenizer.model
    500 kB
    xet
    Training finished (float32, no LayerNorm), epochs=3, lr=1e-07, clip=1, batch_size=4, loss=9.0133, perplexity=nan about 1 year ago
  • tokenizer_config.json
    1.02 kB
    Training finished (float32, no LayerNorm), epochs=3, lr=1e-07, clip=1, batch_size=4, loss=9.0133, perplexity=nan about 1 year ago