metadata
language:
- en
pipeline_tag: text-to-speech
license: apache-2.0
base_model: unsloth/orpheus-3b-0.1-ft
datasets:
- nyuuzyou/asmr
tags:
- asmr
- lora
co2_eq_emissions:
emissions: 1280
source: Calculated based on power consumption and regional carbon intensity
training_type: fine-tuning
geographical_location: Chelyabinsk, Russia
hardware_used: 1 RTX 4090 GPU
Orpheus 3B ASMR (Merged)
Orpheus 3B model fine-tuned on ASMR data and merged for improved soft-spoken speech generation.
Model Details
- Base: unsloth/orpheus-3b-0.1-ft
- Training Data: nyuuzyou/asmr (283K clips, 307 hours)
- Training: 170,000 steps (~40 hours on RTX 4090)
- Type: Merged model (LoRA + base weights)
Capabilities
- Enhanced soft-spoken speech on pre-trained voices (e.g., "tara")
- Improved gentle vocal characteristics
- Standard Orpheus features: voice cloning, emotion control, streaming
Limitations
- Cannot generate true whispering - training method insufficient for complex whisper synthesis
- Limited ASMR authenticity - does not produce human-like ASMR content
- Best results with existing voice profiles rather than novel ASMR voices
Ethics
Do not use for impersonation without consent or deceptive purposes.