Daniel Khashabi

danyaljj

1 33 8

danyaljj

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Self-Compacting Language Model Agents

liked a dataset 13 days ago

sheepy928/AutoMat

upvoted a paper 20 days ago

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

View all activity

Organizations

upvoted a paper 6 days ago

Self-Compacting Language Model Agents

Paper • 2606.23525 • Published 8 days ago • 18

liked a dataset 13 days ago

sheepy928/AutoMat

Viewer • Updated 15 days ago • 74 • 1.3k • 2

upvoted a paper 20 days ago

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Paper • 2606.01000 • Published about 1 month ago • 6

upvoted a paper 25 days ago

DAR: Deontic Reasoning with Agentic Harnesses

Paper • 2606.05009 • Published 27 days ago • 6

upvoted a paper about 1 month ago

Steered LLM Activations are Non-Surjective

Paper • 2604.09839 • Published May 7 • 15

liked a dataset 2 months ago

jhu-clsp/ManyIH-Bench

Preview • Updated Apr 13 • 47 • 3

upvoted a paper 3 months ago

Many-Tier Instruction Hierarchy in LLM Agents

Paper • 2604.09443 • Published Apr 10 • 16

upvoted a paper 4 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

liked a Space 5 months ago

Science Hierarchography

📚

Explore academic paper hierarchies and details

liked a model 6 months ago

allenai/unifiedqa-t5-base

Updated Jan 24, 2023 • 1.67k • 12

liked a dataset 6 months ago

NIH-CARD/CARDBiomedBench

Viewer • Updated Jul 21, 2025 • 68.2k • 78 • 6

upvoted 2 papers 7 months ago

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

Paper • 2510.16928 • Published Oct 19, 2025 • 4

Genomic Next-Token Predictors are In-Context Learners

Paper • 2511.12797 • Published Nov 16, 2025 • 8

commented a paper 7 months ago

Genomic Next-Token Predictors are In-Context Learners

Paper • 2511.12797 • Published Nov 16, 2025 • 8 •

upvoted a paper 8 months ago

SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains

Paper • 2507.07229 • Published Jul 9, 2025 • 11

authored a paper 8 months ago

World-in-World: World Models in a Closed-Loop World

Paper • 2510.18135 • Published Oct 20, 2025 • 78

upvoted 2 papers 8 months ago

World-in-World: World Models in a Closed-Loop World

Paper • 2510.18135 • Published Oct 20, 2025 • 78

MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification

Paper • 2505.18452 • Published May 24, 2025 • 4

liked a dataset 8 months ago

ash56/ShiftySpeech

Viewer • Updated Oct 24, 2025 • 3M • 110 • 22

upvoted a paper 9 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published Oct 9, 2025 • 41

Daniel Khashabi

AI & ML interests

Recent Activity

Organizations

danyaljj's activity

Science Hierarchography