Junjie Ye

Junjie-Ye

9 10 4

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

authored a paper 2 months ago

MagicAgent: Towards Generalized Agent Planning

new activity 3 months ago

bytedance-research/ToolHop:是否会继续评测呢？

View all activity

Organizations

authored a paper 19 days ago

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

Paper • 2605.19597 • Published May 19 • 21

authored a paper 2 months ago

MagicAgent: Towards Generalized Agent Planning

Paper • 2602.19000 • Published Mar 1

New activity in bytedance-research/ToolHop 3 months ago

是否会继续评测呢？

#6 opened 8 months ago by

user7860

updated a Space 3 months ago

ToolHop

🔥

Explore the ToolHop LLM benchmark leaderboard

published a Space 3 months ago

ToolHop

🔥

Explore the ToolHop LLM benchmark leaderboard

updated 2 datasets 3 months ago

bytedance-research/ToolHop

Updated Apr 11 • 516 • 23

Junjie-Ye/MulDimIF

Viewer • Updated Apr 6 • 9.11k • 333 • 3

submitted a paper to Daily Papers 3 months ago

CCTU: A Benchmark for Tool Use under Complex Constraints

Paper • 2603.15309 • Published Mar 16 • 2

upvoted a paper 3 months ago

CCTU: A Benchmark for Tool Use under Complex Constraints

Paper • 2603.15309 • Published Mar 16 • 2

updated a model 3 months ago

Junjie-Ye/TL-CodeLLaMA-2

Updated Mar 17 • 7 • 1

liked a dataset 3 months ago

Junjie-Ye/CCTU

Viewer • Updated Mar 17 • 200 • 36 • 1

authored a paper 3 months ago

CCTU: A Benchmark for Tool Use under Complex Constraints

Paper • 2603.15309 • Published Mar 16 • 2

updated a dataset 3 months ago

Junjie-Ye/CCTU

Viewer • Updated Mar 17 • 200 • 36 • 1

published a dataset 4 months ago

Junjie-Ye/CCTU

Viewer • Updated Mar 17 • 200 • 36 • 1

authored 2 papers 4 months ago

CL-bench: A Benchmark for Context Learning

Paper • 2602.03587 • Published Feb 3 • 23

DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training

Paper • 2602.05890 • Published Feb 5 • 1

authored 4 papers 5 months ago

WisPaper: Your AI Scholar Search Engine

Paper • 2512.06879 • Published Dec 7, 2025 • 1

What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study

Paper • 2506.12537 • Published Jun 14, 2025 • 1

Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training

Paper • 2502.04066 • Published Feb 6, 2025

TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities

Paper • 2407.21693 • Published Jul 31, 2024

Junjie Ye

AI & ML interests

Recent Activity

Organizations

Junjie-Ye's activity

是否会继续评测呢？

ToolHop

ToolHop