Amasia-MLLM (Amasia NYU)

jihanyang

authored a paper 2 months ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 55

jihanyang

authored 2 papers 5 months ago

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published Nov 6, 2025 • 10

EdwinHuang

authored a paper 5 months ago

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39

conan1024hao

authored a paper 6 months ago

VIR-Bench: Evaluating Geospatial and Temporal Understanding of MLLMs via Travel Video Itinerary Reconstruction

Paper • 2509.19002 • Published Sep 23, 2025 • 3

conan1024hao

updated a dataset 8 months ago

Amasia-MLLM/AmasiaIns-1M

Preview • Updated Aug 5, 2025 • 113

conan1024hao

authored a paper 10 months ago

Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs

Paper • 2505.15075 • Published May 21, 2025 • 1

conan1024hao

updated a model about 1 year ago

Amasia-MLLM/Qwen2.5-VL-3B-GRPO-he-5step-20250303

4B • Updated Mar 3, 2025 • 3

conan1024hao

published a model about 1 year ago

Amasia-MLLM/Qwen2.5-VL-3B-GRPO-he-5step-20250303

4B • Updated Mar 3, 2025 • 3

conan1024hao

updated a model about 1 year ago

Amasia-MLLM/Qwen2.5-VL-3B-GRPO-zh-5step-20250228

4B • Updated Feb 28, 2025 • 2

conan1024hao

published a model about 1 year ago

Amasia-MLLM/Qwen2.5-VL-3B-GRPO-zh-5step-20250228

4B • Updated Feb 28, 2025 • 2

jihanyang

authored a paper about 1 year ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27, 2025 • 30

conan1024hao

updated a dataset about 1 year ago

Amasia-MLLM/A-OKVQA

Viewer • Updated Feb 27, 2025 • 2.08k • 6

conan1024hao

published a dataset about 1 year ago

Amasia-MLLM/A-OKVQA

Viewer • Updated Feb 27, 2025 • 2.08k • 6

jihanyang

authored a paper about 1 year ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

conan1024hao

published a dataset about 1 year ago

Amasia-MLLM/GLDv2-images

Updated Jan 19, 2025 • 2

conan1024hao

updated a dataset about 1 year ago

Amasia-MLLM/Cambrian-Pangea-11M

Updated Jan 24, 2025 • 6

conan1024hao

published 2 datasets about 1 year ago

Amasia-MLLM/AmasiaIns-1M

Preview • Updated Aug 5, 2025 • 113

Amasia-MLLM/GMP-images

Updated Dec 27, 2024 • 2

conan1024hao

updated a dataset about 1 year ago

Amasia-MLLM/GLDv2-images

Updated Jan 19, 2025 • 2

AI & ML interests

Team members 3

Amasia-MLLM's activity