Extract speaker information from audio
Transcribe audio to text with optional punctuation
Compare and rank AI models using performance metrics