Submitted by
Jacob Nielsen
AI & ML interests
- Ressource constrained language modelling - Evaluation - Low-Bandwidth Distributed Training - Quantization - Medical Language Modelling - AI x Human
Recent Activity
View all activity
Papers
FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models
When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization