Boyuan Sun

BBBBCHAN

·

https://bbbbchan.github.io/

BBBBCHAN

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

upvoted a paper 20 days ago

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

submitted a paper about 1 month ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

View all activity

Organizations

upvoted a paper 18 days ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

Paper • 2606.09076 • Published 22 days ago • 63

upvoted a paper 20 days ago

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

Paper • 2606.06473 • Published 26 days ago • 19

upvoted a paper about 1 month ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Paper • 2605.18018 • Published May 18 • 33

upvoted a paper 2 months ago

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Paper • 2604.25819 • Published Apr 28 • 17

upvoted a paper 4 months ago

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

Paper • 2602.12617 • Published Feb 13 • 20

upvoted 4 papers 11 months ago

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3, 2025 • 14

Towards RAW Object Detection in Diverse Conditions

Paper • 2411.15678 • Published Nov 24, 2024 • 1

Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness

Paper • 2501.07978 • Published Jan 14, 2025 • 1

Gaussian Splatting with Discretized SDF for Relightable Assets

Paper • 2507.15629 • Published Jul 21, 2025 • 23

upvoted 8 papers 12 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 257

Depth Anything at Any Condition

Paper • 2507.01634 • Published Jul 2, 2025 • 49

LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding

Paper • 2501.05067 • Published Jan 9, 2025 • 1

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Paper • 2501.15111 • Published Jan 25, 2025 • 1

HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context

Paper • 2506.21277 • Published Jun 26, 2025 • 14

Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs

Paper • 2506.21656 • Published Jun 26, 2025 • 16

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published Apr 10, 2025 • 50

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Paper • 2506.21862 • Published Jun 27, 2025 • 36

upvoted 2 papers about 1 year ago

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

Paper • 2306.04300 • Published Jun 7, 2023 • 2

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published Mar 7, 2025 • 38