Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation Paper • 2410.08371 • Published Oct 10, 2024 • 3
DEPAC: a Corpus for Depression and Anxiety Detection from Speech Paper • 2306.12443 • Published Jun 20, 2023
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 3 days ago • 14
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 3 days ago • 14
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 3 days ago • 14
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions Paper • 2506.13234 • Published Jun 16
Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper • 2403.13257 • Published Mar 20, 2024 • 21