1 5 2

Lu Li

luli2949

AI & ML interests

None yet

Recent Activity

updated a model about 2 months ago

luli2949/glyph_dim345

updated a model about 2 months ago

luli2949/glyph_dim5

updated a model about 2 months ago

luli2949/glyph_dim4

View all activity

Organizations

updated 4 models about 2 months ago

published 4 models about 2 months ago

luli2949/glyph_dim345

10B • Updated Mar 23 • 5

luli2949/glyph_dim5

10B • Updated Mar 23 • 4

luli2949/glyph_dim4

10B • Updated Mar 23 • 1

luli2949/glyph_dim3

10B • Updated Mar 23 • 1

updated a model 2 months ago

luli2949/glm

Updated Mar 5

published a model 2 months ago

luli2949/glm

Updated Mar 5

liked a Space 6 months ago

The Smol Training Playbook

📚

3.17k

The secrets to building world-class LLMs

authored 3 papers 7 months ago

STRICT: Stress Test of Rendering Images Containing Text

Paper • 2505.18985 • Published May 25, 2025

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

Paper • 2406.07529 • Published Jun 11, 2024

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

upvoted a paper 7 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

upvoted a collection about 1 year ago

Context Clues

Collection

Models from the paper Context Clues • 16 items • Updated Nov 7, 2025 • 10

upvoted an article about 1 year ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 740

liked a dataset over 1 year ago

danjacobellis/chexpert

Viewer • Updated Jul 18, 2024 • 224k • 1.34k • 19

upvoted a collection over 1 year ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 37 items • Updated Mar 2 • 376

authored a paper almost 2 years ago

VCR: Visual Caption Restoration

Paper • 2406.06462 • Published Jun 10, 2024 • 13

Lu Li

AI & ML interests

Recent Activity

Organizations

luli2949's activity

The Smol Training Playbook

Finally, a Replacement for BERT: Introducing ModernBERT