Hiroaki OGASAWARA

xhiroga

1 45

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

numind/NuExtract3

liked a Space about 2 months ago

Qwen/Qwen3-VL-Demo

liked a model 2 months ago

datalab-to/chandra-ocr-2

View all activity

Organizations

liked a model about 1 month ago

numind/NuExtract3

Image-to-Text • 5B • Updated 10 days ago • 738k • 277

liked a Space about 2 months ago

Qwen3 VL Demo

😻

451

Chat with AI using text and images

liked a model 2 months ago

datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated 13 days ago • 1.25M • 436

liked a dataset 3 months ago

DataPilot/AItuber-Personas-Japan

Viewer • Updated Mar 16 • 195 • 62 • 28

liked a model 5 months ago

deepseek-ai/DeepSeek-OCR-2

Image-Text-to-Text • 3B • Updated Feb 3 • 3.32M • 1.03k

liked a Space 5 months ago

Qwen3-TTS Demo

🎙

2.02k

Generate speech from text using voice design, cloning or presets

updated a dataset 6 months ago

xhiroga/data

Viewer • Updated Jan 3 • 1 • 12 • 1

liked a dataset 7 months ago

Seed3D/Articulation-XL2.0

Updated Sep 19, 2025 • 187 • 33

liked a model 7 months ago

VAST-AI/UniRig

Updated Aug 1, 2025 • 86

liked a model 9 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 536k • 1.61k

liked a Space 9 months ago

Open ASR Leaderboard

🏆

1.4k

Compare speech‑recognition models by WER and speed

liked 2 models 9 months ago

nguyenvulebinh/AV-HuBERT-MuAViC-multilingual

Text Generation • 0.4B • Updated Mar 6, 2025 • 38 • 2

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 646k • • 851

upvoted a paper 9 months ago

Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations

Paper • 2503.06273 • Published Mar 8, 2025 • 6

liked a model 10 months ago

fierce-cats/beatrice-trainer

Audio-to-Audio • Updated Aug 30, 2025 • 43

updated a dataset 10 months ago

xhiroga/hiroga-speech

Updated Sep 14, 2025 • 22

published a dataset 10 months ago