2 20 2

Tianqing Fang

tqfang229

https://tqfang.github.io/

AI & ML interests

LLM, Agent

Recent Activity

upvoted a paper about 1 month ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

upvoted a paper about 2 months ago

Free(): Learning to Forget in Malloc-Only Reasoning Models

upvoted a collection about 2 months ago

Penguin-VL

View all activity

Organizations

None yet

authored 4 papers 3 months ago

WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

Paper • 2510.18560 • Published Oct 21, 2025 • 1

InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

Paper • 2505.22156 • Published May 28, 2025

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

submitted a paper to Daily Papers 3 months ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

authored 7 papers 7 months ago

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Paper • 2510.14438 • Published Oct 16, 2025 • 14

Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset

Paper • 2109.07679 • Published Sep 16, 2021

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

Paper • 2311.09174 • Published Nov 15, 2023

AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation

Paper • 2402.10646 • Published Feb 16, 2024

CKBP v2: Better Annotation and Reasoning for Commonsense Knowledge Base Population

Paper • 2304.10392 • Published Apr 20, 2023

UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

Paper • 2509.15763 • Published Sep 19, 2025

NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

Paper • 2510.07172 • Published Oct 8, 2025 • 28

authored 8 papers 9 months ago

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

Paper • 2310.09044 • Published Oct 13, 2023

StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding

Paper • 2310.12874 • Published Oct 19, 2023

CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering

Paper • 2305.14869 • Published May 24, 2023 • 1

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

Paper • 2401.07286 • Published Jan 14, 2024

Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning

Paper • 2404.09403 • Published Apr 15, 2024

IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce

Paper • 2406.10173 • Published Jun 14, 2024

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 27

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Paper • 2410.02730 • Published Oct 3, 2024

Tianqing Fang

AI & ML interests

Recent Activity

Organizations

tqfang229's activity