nekomeowww (Neko Ayaka)

upvoted 2 papers 3 months ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 81

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

upvoted 3 papers 4 months ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 146

FrankenMotion: Part-level Human Motion Generation and Composition

Paper • 2601.10909 • Published Jan 15 • 19

RigMo: Unifying Rig and Motion Learning for Generative Animation

Paper • 2601.06378 • Published Jan 10 • 12

upvoted a paper 8 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 180

upvoted 2 articles 10 months ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

+4

mfuntowicz, freddyaboulton, Steveeeeeeen, reach-vb, erikkaum, michellehbn

•

May 13, 2025

• 82

Article

Vision Language Models (Better, faster, stronger)

+3

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted a paper 12 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 68

upvoted a collection about 1 year ago

Physical AI

Collection

Collection of open, commercial-grade datasets for physical AI developers • 50 items • Updated about 20 hours ago • 156

upvoted an article about 1 year ago

Article

Cohere on Hugging Face Inference Providers 🔥

+5

reach-vb, burtenshaw, merve, celinah, alexrs, julien-c, sbrandeis

•

Apr 16, 2025

• 129

upvoted a collection about 1 year ago

Spaces for Audio / Voices

Collection

543 items • Updated 4 days ago • 33

upvoted 3 articles about 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 338

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

+2

saurabhdash, olivernan, ArashAhmadian, johndang-cohere

•

Mar 4, 2025

• 78

upvoted a collection about 1 year ago

InternVL2.5

Collection

Better than InternVL 2.0 • 17 items • Updated Mar 2 • 93

upvoted an article about 1 year ago

Article

Making Browser-Based Inference Actually Usable

wizenheimer

•

Mar 1, 2025

• 10

Neko Ayaka

AI & ML interests

Organizations

DFlash: Block Diffusion for Flash Speculative Decoding

ProAct: Agentic Lookahead in Interactive Environments

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

FrankenMotion: Part-level Human Motion Generation and Composition

RigMo: Unifying Rig and Motion Learning for Generative Animation

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Blazingly fast whisper transcriptions with Inference Endpoints

Vision Language Models (Better, faster, stronger)

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Physical AI

Cohere on Hugging Face Inference Providers 🔥

Spaces for Audio / Voices

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

SmolVLM2: Bringing Video Understanding to Every Device

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

InternVL2.5

Making Browser-Based Inference Actually Usable

Neko Ayaka

AI & ML interests

Organizations

nekomeowww's activity

Blazingly fast whisper transcriptions with Inference Endpoints

Vision Language Models (Better, faster, stronger)

Cohere on Hugging Face Inference Providers 🔥

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

SmolVLM2: Bringing Video Understanding to Every Device

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Making Browser-Based Inference Actually Usable