Conversational AI (CoAI) group from Tsinghua University

university

http://coai.cs.tsinghua.edu.cn/

Activity Feed Request to join this org

AI & ML interests

Dialogue Systems, Language Generation

Recent Activity

JinfengZhou published a model about 1 month ago

thu-coai/CogFlow-8B

yangjunxiao2021 authored a paper about 1 month ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

yangjunxiao2021 submitted a paper about 1 month ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

View all activity

Papers

The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning

View all Papers

published a model about 1 month ago

thu-coai/CogFlow-8B

yangjunxiao2021

authored a paper about 1 month ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

Paper • 2604.12710 • Published Apr 13 • 5

yangjunxiao2021

submitted a paper to Daily Papers about 1 month ago

LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety

Paper • 2604.12710 • Published Apr 13 • 5

authored 7 papers 3 months ago

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Paper • 2311.18702 • Published Nov 30, 2023

AlignBench: Benchmarking Chinese Alignment of Large Language Models

Paper • 2311.18743 • Published Nov 30, 2023 • 1

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

Paper • 2108.01547 • Published Aug 3, 2021

CharacterBench: Benchmarking Character Customization of Large Language Models

Paper • 2412.11912 • Published Dec 16, 2024

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 212

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 150

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Paper • 2311.16832 • Published Nov 28, 2023 • 1

submitted a paper to Daily Papers 3 months ago

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Paper • 2603.04738 • Published Mar 5 • 1

authored 4 papers 3 months ago

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Paper • 2407.03978 • Published Jul 4, 2024 • 1

IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation

Paper • 2511.01014 • Published Nov 2, 2025 • 1

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Paper • 2603.04738 • Published Mar 5 • 1

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators

Paper • 2502.13031 • Published Feb 18, 2025

published 2 models 4 months ago

thu-coai/IF-CRITIC-Checklist-Generator-14B

Text Generation • 15B • Updated Feb 6 • 8

thu-coai/IF-CRITIC-14B

Text Generation • 15B • Updated Feb 6 • 5 • 2

updated 2 models 4 months ago

thu-coai/IF-CRITIC-Checklist-Generator-14B

Text Generation • 15B • Updated Feb 6 • 8

thu-coai/IF-CRITIC-14B

Text Generation • 15B • Updated Feb 6 • 5 • 2

submitted a paper to Daily Papers 4 months ago

The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning

Paper • 2601.14127 • Published Jan 20 • 5