Nurmukhamed 's Collections good-papers
updated
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models
across Computer Vision Tasks
Paper
• 2310.19909
• Published • 21
Memory Augmented Language Models through Mixture of Word Experts
Paper
• 2311.10768
• Published • 19
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper
• 2311.01282
• Published • 37
Prompt Cache: Modular Attention Reuse for Low-Latency Inference
Paper
• 2311.04934
• Published • 32
Exponentially Faster Language Modelling
Paper
• 2311.10770
• Published • 119
Weight subcloning: direct initialization of transformers using larger
pretrained ones
Paper
• 2312.09299
• Published • 18
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant
for Mobile Devices
Paper
• 2312.16886
• Published • 22
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
• 2312.15166
• Published • 61
Tuning Language Models by Proxy
Paper
• 2401.08565
• Published • 22
Patchscope: A Unifying Framework for Inspecting Hidden Representations
of Language Models
Paper
• 2401.06102
• Published • 21
Medusa: Simple LLM Inference Acceleration Framework with Multiple
Decoding Heads
Paper
• 2401.10774
• Published • 60
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility,
Reasoning, and Efficiency
Paper
• 2508.18265
• Published • 217
StepWiser: Stepwise Generative Judges for Wiser Reasoning
Paper
• 2508.19229
• Published • 20
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from
Token and Parameter Levels
Paper
• 2509.16596
• Published • 14
PyVision-RL: Forging Open Agentic Vision Models via RL
Paper
• 2602.20739
• Published • 31