Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

daspartho
/
nanochat-depth-recurrence

Model card Files Files and versions
xet
Community
nanochat-depth-recurrence
19.2 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 11 commits
daspartho's picture
daspartho
d16 u1: val_bpb=0.8903 CORE=0.1213
9e0d2ee verified about 2 months ago
  • d12_u1
    d12 u1: val_bpb=0.9563 CORE=0.1040 about 2 months ago
  • d12_u12
    d12 u12 baseline: val_bpb=0.8848 CORE=0.1272 (50% training, 1260 steps) about 2 months ago
  • d12_u3
    d12 u3: val_bpb=0.9206 CORE=0.1236 about 2 months ago
  • d12_u6
    d12 u6: val_bpb=0.9012 CORE=0.1292 about 2 months ago
  • d16_u1
    d16 u1: val_bpb=0.8903 CORE=0.1213 about 2 months ago
  • d16_u16
    d16 u16: val_bpb=0.8114 CORE=0.1804 about 2 months ago
  • d16_u4
    d16 u4: val_bpb=0.8402 CORE=0.1677 about 2 months ago
  • d16_u8
    d16 u8: val_bpb=0.8287 CORE=0.1577 about 2 months ago
  • .gitattributes
    1.52 kB
    initial commit about 2 months ago