Rui Sun PRO
ThreeSR
AI & ML interests
Vision and Language Multimodal Learning, CV, NLP, LLM
Recent Activity
upvoted
a
paper
5 days ago
Aligning Agentic World Models via Knowledgeable Experience Learning
upvoted
a
paper
9 days ago
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
upvoted
an
article
24 days ago
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data