S.F.'s picture

S.F.

search-facility

·

ipv6

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

upvoted a paper 5 days ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

upvoted a paper 5 days ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

Paper • 2606.24876 • Published 3 days ago • 15

upvoted 2 papers 5 days ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Paper • 2606.19531 • Published 9 days ago • 18

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Paper • 2606.15133 • Published 13 days ago • 72

upvoted 2 papers 7 days ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

Paper • 2606.15378 • Published 13 days ago • 17

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 10 days ago • 61

upvoted a paper 8 days ago

Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models

Paper • 2606.16281 • Published 11 days ago • 34

upvoted a paper 16 days ago

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

Paper • 2606.06601 • Published 22 days ago • 26

upvoted 2 papers 20 days ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Paper • 2606.04923 • Published 23 days ago • 40

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 25 days ago • 135

upvoted a paper 21 days ago

From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain

Paper • 2605.23895 • Published May 22 • 52

upvoted a paper 28 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published about 1 month ago • 144

upvoted 9 papers about 1 month ago

Toto 2.0: Time Series Forecasting Enters the Scaling Era

Paper • 2605.20119 • Published May 19 • 39

Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos

Paper • 2605.18233 • Published May 18 • 93

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 50

GATES: Self-Distillation under Privileged Context with Consensus Gating

Paper • 2602.20574 • Published Feb 24 • 1

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

When Vision Speaks for Sound

Paper • 2605.16403 • Published May 13 • 161

Active Learners as Efficient PRP Rerankers

Paper • 2605.14236 • Published May 15 • 98

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

Pixal3D: Pixel-Aligned 3D Generation from Images

Paper • 2605.10922 • Published May 11 • 33