7 21 33

Bowen Peng

bloc97

bloc97

AI & ML interests

Machine Learning, Computer Graphics, Language Models

Recent Activity

upvoted a paper 6 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

liked a model 18 days ago

ideogram-ai/ideogram-4-fp8

upvoted a paper 25 days ago

JLT: Clean-Latent Prediction in Latent Diffusion Transformers

View all activity

Organizations

upvoted a paper 6 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 9 days ago • 55

upvoted a paper 25 days ago

JLT: Clean-Latent Prediction in Latent Diffusion Transformers

Paper • 2605.27102 • Published about 1 month ago • 33

upvoted 4 papers about 1 month ago

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

Paper • 2604.27263 • Published May 14 • 11

upvoted a collection about 1 year ago

Nemotron-UltraLong

Collection

3 items • Updated 13 days ago • 19

upvoted a paper over 1 year ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21, 2025 • 65

upvoted 2 papers almost 2 years ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 60

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28, 2024 • 31

upvoted a collection about 2 years ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 975

upvoted 7 papers over 2 years ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 130

V3D: Video Diffusion Models are Effective 3D Generators

Paper • 2403.06738 • Published Mar 11, 2024 • 30

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 191

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Paper • 2403.00071 • Published Feb 29, 2024 • 24

Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29, 2024 • 53

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 116

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264

upvoted 2 papers almost 3 years ago

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 85

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 40

Bowen Peng

AI & ML interests

Recent Activity

Organizations

bloc97's activity