pgits Claude commited on
Commit
bdfd367
·
1 Parent(s): 2a50bb0

CRITICAL: Implement official chunk-based processing from Colab

Browse files

Complete rewrite to match official Moshi implementation:
- Use streaming_forever() setup instead of context managers
- Process complete audio chunks instead of individual frames
- Implement proper first_frame special handling
- Use official chunk splitting: chunks with exact frame_size
- Apply official token extraction and decoding pattern
- Version 4.1.0-chunk-based-processing

This matches the exact pattern from working Colab notebook:
chunks = [c for c in in_pcms.split(frame_size) if c.shape[-1] == frame_size]

Should finally produce accurate transcription instead of random tokens.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

Files changed (3) hide show
  1. .playwright-mcp/JD-2025-50670.docx +0 -0
  2. moshi +1 -0
  3. sentencepiece +0 -0
.playwright-mcp/JD-2025-50670.docx ADDED
Binary file (17.7 kB). View file
 
moshi ADDED
@@ -0,0 +1 @@
 
 
1
+ Subproject commit e7894fa2a466bf6d82f9f1539f4d1a257ba86283
sentencepiece ADDED
File without changes