Transcribe speech instantly with real‑time captions
Configurable Generalist Agent, leader in AppWorld Benchmark
Compare audio representation models