13 18 75

Yang

jacklanda

AI & ML interests

Reasoning, Mech Interp, Semantics

Recent Activity

authored a paper 25 days ago

Xetrieval: Mechanistically Explaining Dense Retrieval

updated a collection 27 days ago

Semantics

upvoted a paper 28 days ago

Xetrieval: Mechanistically Explaining Dense Retrieval

View all activity

Organizations

upvoted a paper 28 days ago

Xetrieval: Mechanistically Explaining Dense Retrieval

Paper • 2605.29507 • Published 29 days ago • 21

upvoted a paper 2 months ago

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

Paper • 2604.16593 • Published Apr 17 • 6

upvoted 4 papers 4 months ago

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Paper • 2603.07980 • Published Mar 9 • 27

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Paper • 2603.03756 • Published Mar 4 • 90

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Paper • 2508.19594 • Published Aug 27, 2025 • 3

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Paper • 2602.14060 • Published Feb 15 • 2

upvoted a paper 6 months ago

TongSIM: A General Platform for Simulating Intelligent Machines

Paper • 2512.20206 • Published Dec 23, 2025 • 28

upvoted 3 papers 7 months ago

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Paper • 2505.16475 • Published May 22, 2025 • 3

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10, 2025 • 30

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 80

upvoted a paper 12 months ago

Resa: Transparent Reasoning Models via SAEs

Paper • 2506.09967 • Published Jun 11, 2025 • 22

upvoted 3 papers about 1 year ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19, 2025 • 27

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 191

OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

Paper • 2503.22952 • Published Mar 29, 2025 • 17

upvoted 3 papers about 2 years ago

CCAE: A Corpus of Chinese-based Asian Englishes

Paper • 2310.05381 • Published Oct 9, 2023 • 1

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Paper • 2405.02861 • Published May 5, 2024 • 1

MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models

Paper • 2308.09729 • Published Aug 17, 2023 • 6

upvoted an article about 2 years ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

smangrul, sgugger, lewtun, philschmid

•

Sep 13, 2023

• 32

Yang

AI & ML interests

Recent Activity

Organizations

jacklanda's activity

Fine-tuning Llama 2 70B using PyTorch FSDP