Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

mmarone updated a collection 9 days ago

mmBERT: a modern multilingual encoder

TaiMingLu authored a paper 12 days ago

Strong Teacher Not Needed? On Distillation in LLM Pretraining

TaiMingLu authored a paper 12 days ago

i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models

View all activity

Papers

DAR: Deontic Reasoning with Agentic Harnesses

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17, 2025 • 21.6k • • 76

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7, 2025 • 345k • • 217

jhu-clsp/mmBERT-checkpoints

Updated Sep 9, 2025 • 4

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21, 2025 • 21 • 5

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18, 2025 • 357

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18, 2025 • 1.86k • 22

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18, 2025 • 66.8k • • 5

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18, 2025 • 4

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18, 2025 • 6k • • 13

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18, 2025 • 6.98k • 4

datasets 40

jhu-clsp/ManyIH-Bench

Preview • Updated Apr 13 • 46 • 3

jhu-clsp/robust04-instructions

Viewer • Updated Mar 12 • 136k • 938 • 2

jhu-clsp/core17-instructions

Viewer • Updated Mar 12 • 49.4k • 955 • 2

jhu-clsp/news21-instructions

Viewer • Updated Mar 12 • 71.5k • 741 • 1

jhu-clsp/SciTaRC

Viewer • Updated Mar 6 • 371 • 52 • 1

jhu-clsp/megawika-2

Updated Mar 3 • 92 • 4

jhu-clsp/mmBERT-decay-data

Updated Dec 11, 2025 • 33.3k • 6

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13, 2025 • 2.28k • 1

jhu-clsp/ettin-pretraining-data

Updated Jul 18, 2025 • 106k • 9

jhu-clsp/ettin-decay-data

Updated Jul 18, 2025 • 1.15k • 1

View 40 datasets