Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

armenjeddi
/
LoopFormer-3block-8iterations-FineWeb300K

Safetensors
loopformer
custom_code
Model card Files Files and versions
xet
Community
LoopFormer-3block-8iterations-FineWeb300K
561 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
armenjeddi's picture
armenjeddi
Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations
e7c8764 verified about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • config.json
    296 Bytes
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • generation_config.json
    69 Bytes
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • merges.txt
    456 kB
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • model.safetensors
    556 MB
    xet
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • modeling_loopformer.py
    12 kB
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • special_tokens_map.json
    131 Bytes
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • tokenizer.json
    3.56 MB
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • tokenizer_config.json
    507 Bytes
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago
  • vocab.json
    798 kB
    Add Base model with 24 layers - Trained on FineWeb_Edu for 300K iterations about 1 month ago