1bitLLM
/

bitnet_b1_58-3B

Text Generation

text-generation-inference

Model card Files Files and versions

Resources

View closed (1)

Add metadata and link to paper

#12 opened about 1 year ago by

can you provide wikitest ppl and c4 ppl separately?

#11 opened almost 2 years ago by

Can you provide more details on the training?

#10 opened almost 2 years ago by

Any plans to use MQA (multi-query attention) or GQA (grouped-query attention) in the future?

#9 opened about 2 years ago by

Efficient Inference Kernel Support for 1.58bit.

#8 opened about 2 years ago by

This code from BitLinear doesn't make sense

#7 opened about 2 years ago by

Is it bitnet {-1,0,1}?

#6 opened about 2 years ago by

ValueError: Tokenizer class BitnetTokenizer does not exist or is not currently imported.

#5 opened about 2 years ago by

ryanzhangofficial

Longer inference time

#4 opened about 2 years ago by

Why are these models fp32?

#2 opened about 2 years ago by

Is there a chat/instruct model in plans?

#1 opened about 2 years ago by