lzZzZx328 (Tang)

upvoted an article 10 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

308

upvoted a paper 11 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79

upvoted 2 articles 11 months ago

Article

Run ComfyUI workflows for free with Gradio on Hugging Face Spaces

Jan 14, 2024

•

97

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

403

upvoted 5 papers 12 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3, 2025 • 86

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13, 2025 • 53

upvoted 7 papers about 1 year ago

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26, 2025 • 47

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 288

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Paper • 2501.05707 • Published Jan 10, 2025 • 20

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10, 2025 • 75

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104

upvoted 4 papers over 1 year ago

MALT: Improving Reasoning with Multi-Agent LLM Training

Paper • 2412.01928 • Published Dec 2, 2024 • 45

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 62

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 21

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84

Tang

AI & ML interests

Organizations

Tiny Agents: an MCP-powered agent in 50 lines of code

Video-R1: Reinforcing Video Reasoning in MLLMs

Run ComfyUI workflows for free with Gradio on Hugging Face Spaces

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Visual-RFT: Visual Reinforcement Fine-Tuning

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Transformers without Normalization

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

START: Self-taught Reasoner with Tools

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

SurveyX: Academic Survey Automation via Large Language Models

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

MALT: Improving Reasoning with Multi-Agent LLM Training

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Tang

AI & ML interests

Organizations

lzZzZx328's activity

Tiny Agents: an MCP-powered agent in 50 lines of code

Run ComfyUI workflows for free with Gradio on Hugging Face Spaces

Illustrating Reinforcement Learning from Human Feedback (RLHF)