MENTOR: Efficient Multimodal-Conditioned Tuning for Autoregressive Vision Generation Models Paper • 2507.09574 • Published Jul 13, 2025
GLTW: Joint Improved Graph Transformer and LLM via Three-Word Language for Knowledge Graph Completion Paper • 2502.11471 • Published Feb 17, 2025 • 2
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions Paper • 2510.05318 • Published Oct 6, 2025 • 22
A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks Paper • 2510.05608 • Published Oct 7, 2025 • 4
From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition Paper • 2512.14244 • Published Dec 16, 2025 • 2
FaithLens: Detecting and Explaining Faithfulness Hallucination Paper • 2512.20182 • Published Dec 23, 2025 • 9
InFi-Check: Interpretable and Fine-Grained Fact-Checking of LLMs Paper • 2601.06666 • Published Jan 10 • 1
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 3 days ago • 128
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 3 days ago • 128
FaithLens: Detecting and Explaining Faithfulness Hallucination Paper • 2512.20182 • Published Dec 23, 2025 • 9
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications Paper • 2506.18951 • Published Jun 23, 2025 • 22
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning Paper • 2505.16483 • Published May 22, 2025 • 10
Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance Paper • 2411.14279 • Published Nov 21, 2024
UltraIF: Advancing Instruction Following from the Wild Paper • 2502.04153 • Published Feb 6, 2025 • 24
Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement Paper • 2410.15633 • Published Oct 21, 2024 • 7
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents Paper • 2305.13040 • Published May 22, 2023 • 2
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks Paper • 2311.09835 • Published Nov 16, 2023 • 11
One Shot Learning as Instruction Data Prospector for Large Language Models Paper • 2312.10302 • Published Dec 16, 2023 • 2
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning Paper • 2309.07915 • Published Sep 14, 2023 • 4