Spaces:

Rcarvalo
/

speech-to-speech

Runtime error

App Files Files Community

Rcarvalo commited on 19 days ago

Commit

ce25b23

verified ·

1 Parent(s): 61d492a

Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +56 -6

README.md CHANGED Viewed

@@ -1,13 +1,63 @@
 ---
-title: Speech To Speech
-emoji: 🐠
 colorFrom: blue
-colorTo: gray
 sdk: gradio
-sdk_version: 6.0.2
 app_file: app.py
 pinned: false
-license: apache-2.0
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: LFM2-Audio Speech-to-Speech
+emoji: 🎤
 colorFrom: blue
+colorTo: purple
 sdk: gradio
+sdk_version: 4.0.0
 app_file: app.py
 pinned: false
+license: other
 ---
+# LFM2-Audio Speech-to-Speech Chat
+This is a demo of LFM2-Audio-1.5B, Liquid AI's first end-to-end audio foundation model. Built with low-latency in mind, the lightweight LFM2 backbone enables real-time speech-to-speech conversations without sacrificing quality.
+## Features
+- **Real-time speech-to-speech**: Talk to the model and get audio responses
+- **Multi-turn conversations**: Maintain context across multiple exchanges
+- **Interleaved text and audio**: See the text transcription while hearing the audio
+## How to Use
+1. **Record your voice**: Click the microphone button and speak your message
+2. **Adjust parameters** (optional):
+   - Temperature: Controls randomness (higher = more creative)
+   - Top-k: Limits sampling to top k tokens
+3. **Generate Response**: Click the button to get the model's response
+4. **Listen & Read**: Hear the audio response and read the text transcription
+## Parameters
+- **Temperature**:
+  - 0 = Greedy decoding (most deterministic)
+  - 1.0 = Default (balanced)
+  - 2.0 = Very creative (more random)
+- **Top-k**:
+  - 0 = No filtering
+  - 4 = Default (conservative)
+  - Higher values = more diversity
+## Technical Details
+- Model: LFM2-Audio-1.5B
+- Audio Codec: Mimi (24kHz)
+- Mode: Interleaved generation (optimal for real-time conversations)
+## Requirements
+- GPU recommended for real-time performance
+- Microphone access in your browser
+## Links
+- [Liquid AI Website](https://www.liquid.ai/)
+- [GitHub Repository](https://github.com/Liquid4All/liquid-audio/)
+- [Model on Hugging Face](https://huggingface.co/LiquidAI/LFM2-Audio-1.5B)
+## License
+Licensed under the LFM Open License v1.0