BAIDU

company

Verified

https://www.baidu.com/

AI & ML interests

None defined yet.

Recent Activity

HYPERUU authored a paper 2 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

HYPERUU updated a model 3 days ago

baidu/Unlimited-OCR

jzhang533 updated a Space 4 days ago

View all activity

Papers

Unlimited OCR Works

Memento: Reconstruct to Remember for Consistent Long Video Generation

View all Papers

Articles

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

View all articles

authored a paper 2 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 54

updated a model 3 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 3 days ago • 213k • 1.13k

updated 2 Spaces 4 days ago

README

Unlimited OCR

Extract text from images and PDFs with streaming OCR

authored 2 papers 4 days ago

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published Dec 30, 2024 • 16

Unlimited OCR Works

Paper • 2606.23050 • Published 6 days ago • 37

submitted a paper to Daily Papers 5 days ago

Unlimited OCR Works

Paper • 2606.23050 • Published 6 days ago • 37

updated a Space 5 days ago

NAVA Audio-Video Generator

Native AV alignment — joint video + audio generation

published a model 6 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 3 days ago • 213k • 1.13k

authored a paper 9 days ago

PP-OCRv6: From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks

Paper • 2606.13108 • Published 17 days ago • 8

authored a paper 11 days ago

Memento: Reconstruct to Remember for Consistent Long Video Generation

Paper • 2606.14667 • Published 16 days ago • 17

authored a paper 11 days ago

Memento: Reconstruct to Remember for Consistent Long Video Generation

Paper • 2606.14667 • Published 16 days ago • 17

submitted a paper to Daily Papers 12 days ago

Memento: Reconstruct to Remember for Consistent Long Video Generation

Paper • 2606.14667 • Published 16 days ago • 17

submitted a paper to Daily Papers 19 days ago

DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning

Paper • 2606.07299 • Published 23 days ago • 7

in baidu/NAVA 19 days ago

Please make a Comfyui workflow.

#3 opened 27 days ago by

submitted a paper to Daily Papers 20 days ago

When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents

Paper • 2606.05806 • Published 24 days ago • 23

authored 4 papers 22 days ago

Sparse Growing Transformer: Training-Time Sparse Depth Allocation via Progressive Attention Looping

Paper • 2603.23998 • Published Apr 16 • 1

Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models

Paper • 2603.06043 • Published Mar 6

Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding

Paper • 2512.10548 • Published May 23

V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention

Paper • 2512.03542 • Published Dec 3, 2025