meituan

company

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Roy0702 authored a paper about 1 month ago

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Roy0702 authored a paper about 1 month ago

On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference

Roy0702 authored a paper about 1 month ago

A Controlled Study on Long Context Extension and Generalization in LLMs

View all activity

Papers

DiningBench: A Hierarchical Multi-view Benchmark for Perception and Reasoning in the Dietary Domain

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

View all Papers

authored 11 papers about 1 month ago

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Paper • 2311.09278 • Published Nov 15, 2023 • 9

On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference

Paper • 2402.06262 • Published Feb 9, 2024

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18, 2024 • 45

Context Compression for Auto-regressive Transformers with Sentinel Tokens

Paper • 2310.08152 • Published Oct 12, 2023 • 1

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

Paper • 2505.11200 • Published May 16, 2025

LongCat-Flash-Omni Technical Report

Paper • 2511.00279 • Published Oct 31, 2025 • 27

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 39

EvalTalker: Learning to Evaluate Real-Portrait-Driven Multi-Subject Talking Humans

Paper • 2512.01340 • Published Dec 1, 2025

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 149

EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling

Paper • 2310.04691 • Published Oct 7, 2023 • 3

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published May 25 • 103

authored 3 papers about 2 months ago

Learning to Self-Verify Makes Language Models Better Reasoners

Paper • 2602.07594 • Published Feb 7 • 3

Collaborative Multi-Agent Optimization for Personalized Memory System

Paper • 2603.12631 • Published Mar 13

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 116

in meituan/DeepSeek-R1-Channel-INT8 about 2 months ago

Fix chat_template crash when assistant message omits the `content` key

#14 opened about 2 months ago by

updated a dataset 2 months ago

meituan/LIBERO-X

Viewer • Updated Apr 29 • 889k • 3.15k • 3

published a dataset 2 months ago

meituan/LIBERO-X

Viewer • Updated Apr 29 • 889k • 3.15k • 3

in meituan/DiningBench 3 months ago

Add task categories, language tags, and paper link

#1 opened 3 months ago by

authored a paper 3 months ago

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

Paper • 2604.11778 • Published Apr 13 • 10

submitted a paper to Daily Papers 3 months ago

DiningBench: A Hierarchical Multi-view Benchmark for Perception and Reasoning in the Dietary Domain

Paper • 2604.10425 • Published Apr 12 • 3