Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Shuming Ma's picture
12 7 3

Shuming Ma

shumingma
Ji-Ha's profile picture sudanenator's profile picture fruitcz's profile picture
·
  • shumingma

AI & ML interests

None yet

Organizations

Qwen's profile picture Social Post Explorers's profile picture BitNet's profile picture

upvoted 2 papers about 1 year ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 108

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 87
upvoted a collection about 1 year ago

BitNet

Collection
🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1, 2025 • 62
upvoted a paper over 1 year ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 70
upvoted an article almost 2 years ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

  • +4
medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf
•
Sep 18, 2024
• 281
upvoted a paper almost 2 years ago

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

Paper • 2407.10969 • Published Jul 15, 2024 • 23
upvoted a paper over 2 years ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 630
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs