FlameF0X commited on
Commit
53d33ac
·
verified ·
1 Parent(s): 73ecf3f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -2
README.md CHANGED
@@ -7,6 +7,62 @@ tags:
7
  - music
8
  ---
9
 
10
- # NanoStudio
 
 
 
 
 
 
11
 
12
- Introduction **NanoStudio**, a under development T2A music generator that i haven't decided its size. I want to make this model capable to generate uncompressed music that feels raw and actually follows your prompt and lyrics.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  - music
8
  ---
9
 
10
+ <div align="center">
11
+ <img src="https://img.shields.io/badge/Status-In_Development-orange?style=for-the-badge" />
12
+ <img src="https://img.shields.io/badge/Phase-Architecture_Planning-blue?style=for-the-badge" />
13
+ <br />
14
+ <h1>🎵 NanoStudio</h1>
15
+ <p><i>High-fidelity music generation with raw, uncompressed output.</i></p>
16
+ </div>
17
 
18
+ ---
19
+
20
+ # 🚀 Introduction
21
+ **NanoStudio** is a next-generation Text-to-Audio (T2A) model currently in its architectural infancy. Unlike models that rely on heavy neural compression, NanoStudio aims to deliver audio that feels raw, atmospheric, and stays true to the user's lyrical intent.
22
+
23
+ # 🗺️ Roadmap
24
+
25
+ <div style="background: #111; border: 1px solid #333; border-radius: 10px; padding: 20px; margin-bottom: 10px;">
26
+ <h3 style="color: #58a6ff; margin-top: 0;">📍 Phase 1: The Blueprint (Current)</h3>
27
+ <p><i>Focusing on the "How" before the "What".</i></p>
28
+ <ul>
29
+ <li>✅ Vision & Goal Setting</li>
30
+ <li>🟡 <b>Architecture Design</b> (Evaluating Diffusion vs. Flow-Matching)</li>
31
+ <li>⬜ Dataset Collection (Lossless 44.1kHz focus)</li>
32
+ </ul>
33
+ <div style="background: #333; border-radius: 20px; height: 12px; width: 100%;">
34
+ <div style="background: linear-gradient(90deg, #58a6ff, #bc8cff); width: 25%; height: 100%; border-radius: 20px;"></div>
35
+ </div>
36
+ <p align="right" style="font-size: 12px; margin-top: 5px;">25% Complete</p>
37
+ </div>
38
+
39
+ <div style="background: #111; border: 1px solid #222; border-radius: 10px; padding: 20px; margin-bottom: 10px; opacity: 0.6;">
40
+ <h3 style="color: #8b949e; margin-top: 0;">🧪 Phase 2: Alpha Training</h3>
41
+ <ul>
42
+ <li>⬜ Initial weights training</li>
43
+ <li>⬜ Lyric-to-Vocal alignment testing</li>
44
+ <li>⬜ Community feedback loop</li>
45
+ </ul>
46
+ </div>
47
+
48
+ <div style="background: #111; border: 1px solid #222; border-radius: 10px; padding: 20px; opacity: 0.6;">
49
+ <h3 style="color: #8b949e; margin-top: 0;">🏁 Phase 3: Public Release</h3>
50
+ <ul>
51
+ <li>⬜ Model Weights release on HF Hub</li>
52
+ <li>⬜ Live Gradio Demo Space</li>
53
+ </ul>
54
+ </div>
55
+
56
+ ---
57
+
58
+ # 🎓 Dev Status
59
+ > [!IMPORTANT]
60
+ > I am currently a student and participating in a **hackathon**.
61
+ > Development is active but happens in the "gaps" of my schedule. Thank you for your patience.
62
+
63
+ ### 🛠️ Technical Specs (Tentative)
64
+ | Feature | Target |
65
+ | :--- | :--- |
66
+ | **Sample Rate** | 44.1 kHz / 48 kHz |
67
+ | **Compression** | Zero/Minimal |
68
+ | **Control** | Text + Lyrics + Style Tags |