11 40 91

Sukesh Perla

hitchhiker3010

AI & ML interests

None yet

Recent Activity

new activity 8 days ago

unsloth/Qwen3.6-27B-MTP-GGUF:mmproj support for MTP

liked a dataset 13 days ago

yeates/omnipaint-bench

liked a model 13 days ago

yeates/OmniPaint

View all activity

Organizations

upvoted a paper about 1 month ago

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Paper • 2605.04128 • Published May 5 • 17

upvoted 6 papers about 2 months ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

Paper • 2604.19572 • Published Apr 21 • 23

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation

Paper • 2604.20841 • Published Apr 22 • 24

EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

Paper • 2604.10268 • Published Apr 11 • 12

Context Unrolling in Omni Models

Paper • 2604.21921 • Published Apr 23 • 14

TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale

Paper • 2604.21889 • Published Apr 23 • 12

upvoted 3 articles 4 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

•

Jan 27

• 77

Article

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

ibm-research

•

Jan 21

• 33

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 157

upvoted a collection 12 months ago

digital-human

Collection

27 items • Updated Feb 28 • 9

upvoted a collection about 1 year ago

X2I Dataset

Collection

Datasets used in OmniGen. • 5 items • Updated Jul 5, 2025 • 19

upvoted 4 papers about 1 year ago

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Paper • 2503.12937 • Published Mar 17, 2025 • 30

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published Mar 16, 2025 • 35

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Paper • 2503.13434 • Published Mar 17, 2025 • 28

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16, 2025 • 44

upvoted 2 papers over 1 year ago

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Paper • 2503.03751 • Published Mar 5, 2025 • 25

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3, 2025 • 225

upvoted a collection over 1 year ago

Gradio WebRTC Cookbook ⚡️

Collection

Collection of real-time voice and video demos built with gradio-webrtc custom component • 8 items • Updated Dec 10, 2024 • 19

upvoted a paper over 1 year ago

Shiksha: A Technical Domain focused Translation Dataset and Model for Indian Languages

Paper • 2412.09025 • Published Dec 12, 2024 • 4

Sukesh Perla

AI & ML interests

Recent Activity

Organizations

hitchhiker3010's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

We Got Claude to Build CUDA Kernels and teach open models!