Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AndrewB's picture
1 7 189

AndrewB

aboundy
dvilasuero's profile picture 21world's profile picture Mi6paulino's profile picture
·

AI & ML interests

None yet

Organizations

GPU MODE's profile picture

upvoted a paper 11 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14, 2025 • 125
upvoted a collection 11 months ago

Deepseek Papers

Collection
Deepseek papers collection • 28 items • Updated about 7 hours ago • 316
upvoted an article 12 months ago
view article
Article

Open-R1: Update #1

Feb 2, 2025
•
305
upvoted a paper over 1 year ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57
upvoted 2 papers almost 2 years ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 70

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12, 2024 • 44
upvoted a collection almost 2 years ago

Model Merging

Collection
Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs