Kyutai

non-profit

Verified

https://kyutai.org/

AI & ML interests

None defined yet.

Recent Activity

ameroyer updated a collection 11 days ago

MoshiRAG Release

ameroyer updated a collection 11 days ago

gabrielatkyutail updated a model 16 days ago

kyutai/pocket-tts-without-voice-cloning

View all activity

Papers

One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

View all Papers

updated 2 collections 11 days ago

MoshiRAG Release

Candle & PyTorch model checkpoints released as part of the MoshiRAG release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi-rag • 3 items • Updated 11 days ago • 1

Hibiki-Zero

Streaming speech translation without the need for word-level alignments • 4 items • Updated 11 days ago • 2

gabrielatkyutail

updated 2 models 16 days ago

kyutai/pocket-tts-without-voice-cloning

Updated 16 days ago • 7.83k • 24

kyutai/pocket-tts

Updated 16 days ago • 6.71k • 629

updated a model 21 days ago

kyutai/moshi-rag-artifacts

Updated 21 days ago • 1

published a model 21 days ago

kyutai/moshika-rag-pytorch-bf16

Audio-to-Audio • Updated Apr 17 • 616 • 4

in kyutai/mimi 29 days ago

Is this model can called "VAE" (Varietional AutoEncoder)?

#10 opened 29 days ago by

updated 2 models about 1 month ago

kyutai/moshika-rag-pytorch-bf16

Audio-to-Audio • Updated Apr 17 • 616 • 4

kyutai/moshika-rag-candle-bf16

Audio-to-Audio • Updated Apr 17 • 418 • 7

updated a model about 1 month ago

kyutai/ovie

Image-to-Image • Updated Apr 16 • 908 • 12

published a dataset about 1 month ago

kyutai/HaluEvalAudio_1000

Viewer • Updated Apr 15 • 1k • 198

updated a dataset about 1 month ago

kyutai/HaluEvalAudio_1000

Viewer • Updated Apr 15 • 1k • 198

published a model about 1 month ago

kyutai/moshi-rag-artifacts

Updated 21 days ago • 1

updated a model about 1 month ago

kyutai/moshi-rag-artifacts

Updated 21 days ago • 1

published a model about 1 month ago

kyutai/moshika-rag-candle-bf16

Audio-to-Audio • Updated Apr 17 • 418 • 7

published a model about 2 months ago

kyutai/ovie

Image-to-Image • Updated Apr 16 • 908 • 12

updated a collection about 2 months ago

ARC-Encoders

Pretrained ARC-Encoders and a fine-tuning dataset: context compression for unmodified LLMs. • 6 items • Updated Mar 26 • 4

updated a model about 2 months ago

kyutai/ARC4_Encoder_Llama

Feature Extraction • Updated Mar 26

published a model about 2 months ago

kyutai/ARC4_Encoder_Llama

Feature Extraction • Updated Mar 26

authored a paper about 2 months ago

One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

Paper • 2603.23488 • Published Mar 24 • 5