Warshawsky

EladofWar

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents

upvoted a paper 7 days ago

Semantic Browsing: Controllable Diversity for Image Generation

upvoted a paper 7 days ago

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

View all activity

Organizations

None yet

upvoted a paper 4 days ago

GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents

Paper • 2606.24551 • Published 11 days ago • 28

upvoted 2 papers 7 days ago

Semantic Browsing: Controllable Diversity for Image Generation

Paper • 2606.23679 • Published 11 days ago • 20

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Paper • 2606.25041 • Published 10 days ago • 111

upvoted 3 papers about 1 month ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published May 28 • 42

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published May 29 • 63

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Paper • 2605.31584 • Published May 29 • 43

upvoted a paper 2 months ago

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Paper • 2604.24625 • Published Apr 27 • 26

upvoted 2 papers 5 months ago

FrankenMotion: Part-level Human Motion Generation and Composition

Paper • 2601.10909 • Published Jan 15 • 19

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

upvoted 11 papers 6 months ago

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published Jan 6 • 104

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 183

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published Jan 4 • 62

On the Role of Discreteness in Diffusion LLMs

Paper • 2512.22630 • Published Dec 27, 2025 • 18

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 155

ProEdit: Inversion-based Editing From Prompts Done Right

Paper • 2512.22118 • Published Dec 26, 2025 • 19

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published Dec 19, 2025 • 115

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published Dec 24, 2025 • 70

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Paper • 2512.21338 • Published Dec 24, 2025 • 23

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 52

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 97