Bitnet.cpp: Efficient Edge Inference for Ternary LLMs Paper • 2502.11880 • Published Feb 17, 2025 • 17
Baichuan-M3 Collection Modeling Clinical Inquiry for Reliable Medical Decision-Making • 6 items • Updated about 1 month ago • 17
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models Paper • 2601.07372 • Published Jan 12 • 47