pix2pix-zero-library

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

akhaliq submitted a paper 20 days ago

Image Generators are Generalist Vision Learners

akhaliq submitted a paper about 1 month ago

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

akhaliq submitted a paper about 2 months ago

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

View all activity

akhaliq

submitted a paper to Daily Papers 20 days ago

Image Generators are Generalist Vision Learners

Paper • 2604.20329 • Published 21 days ago • 20

akhaliq

submitted a paper to Daily Papers about 1 month ago

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

Paper • 2603.06679 • Published Mar 30 • 6

akhaliq

submitted 3 papers to Daily Papers about 2 months ago

submitted 3 papers to Daily Papers 3 months ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

Paper • 2602.04811 • Published Feb 4 • 2

Visual Personalization Turing Test

Paper • 2601.22680 • Published Jan 30 • 2

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published Jan 29 • 31

akhaliq

submitted 6 papers to Daily Papers 4 months ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published Jan 20 • 10

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Paper • 2601.09499 • Published Jan 14 • 11

UM-Text: A Unified Multimodal Model for Image Understanding

Paper • 2601.08321 • Published Jan 13 • 12

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Paper • 2601.03955 • Published Jan 7 • 3

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published Dec 31, 2025 • 9

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Paper • 2512.24766 • Published Dec 31, 2025 • 9

akhaliq

submitted 3 papers to Daily Papers 5 months ago

What matters for Representation Alignment: Global Information or Spatial Structure?

Paper • 2512.10794 • Published Dec 11, 2025 • 9

Towards a Science of Scaling Agent Systems

Paper • 2512.08296 • Published Dec 9, 2025 • 17

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

multimodalart

posted an update 7 months ago

Post

27163

Want to iterate on a Hugging Face Space with an LLM?

Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model!

multimodalart/repo2txt

1 reply

akhaliq

authored a paper 7 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

multimodalart

posted an update 11 months ago

Post

18376

Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing

6 replies

AI & ML interests

Recent Activity

Team members 5

pix2pix-zero-library's activity