Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

BabyLM Sequence Length

community
Activity Feed

AI & ML interests

BabyLM 2025 paper submission

Richard Diehl Martinez's profile pictureSuchir Salhan's profile pictureZeb Goriely's profile picture

babylm-seqlen 's models 50

babylm-seqlen/opt-256

0.1B • Updated Jun 3, 2025

babylm-seqlen/opt-512

0.1B • Updated Jun 3, 2025

babylm-seqlen/opt-1024

0.1B • Updated Jun 3, 2025

babylm-seqlen/mamba-1024

0.2B • Updated Jun 3, 2025

babylm-seqlen/mamba-512

0.2B • Updated Jun 3, 2025

babylm-seqlen/mamba-256

0.2B • Updated Jun 3, 2025 • 1

babylm-seqlen/mamba-128

0.2B • Updated Jun 3, 2025 • 1

babylm-seqlen/mamba-64

0.2B • Updated Jun 3, 2025

babylm-seqlen/opt-128

0.1B • Updated Jun 3, 2025 • 1

babylm-seqlen/opt-64

0.1B • Updated Jun 3, 2025

babylm-seqlen/opt-256-warmup

0.1B • Updated May 31, 2025

babylm-seqlen/opt-128-warmup

0.1B • Updated May 31, 2025

babylm-seqlen/opt-64-warmup

0.1B • Updated May 31, 2025 • 1

babylm-seqlen/opt-512-warmup

0.1B • Updated May 31, 2025

babylm-seqlen/opt-2048-warmup

0.1B • Updated May 31, 2025

babylm-seqlen/opt-4096-warmup

0.1B • Updated May 31, 2025

babylm-seqlen/opt-8192-warmup

0.1B • Updated May 31, 2025

babylm-seqlen/opt-1024-warmup

0.1B • Updated May 31, 2025 • 1

babylm-seqlen/opt-dummy

0.1B • Updated Apr 21, 2025 • 1

babylm-seqlen/tokenizer

Updated Apr 7, 2025
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs