Avik Rao commited on
Commit
2165a9a
·
1 Parent(s): 06d43f8

Update README.md

Browse files
Files changed (2) hide show
  1. README.md +39 -12
  2. assets/system_architecture.png +0 -0
README.md CHANGED
@@ -1,15 +1,42 @@
1
- ---
2
- title: Fire Coml FALL 2022
3
- emoji: 🎶
4
- colorFrom: green
5
- colorTo: indigo
6
- sdk: streamlit
7
- sdk_version: 1.10.0
8
- app_file: app.py
9
- pinned: false
10
- ---
11
 
12
- Link to Huggingface App: https://huggingface.co/spaces/SLAYEROFALL3050/Audio_Generator_Using_GAN
13
 
14
- # FIRE COML Fall 2022
15
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Tag-based Audio Generation
 
 
 
 
 
 
 
 
 
2
 
3
+ ## Description
4
 
5
+ Model to generate audio given input genre(s), mood(s), and instrument(s)
6
 
7
+ ## Link to demo app
8
+
9
+ https://huggingface.co/spaces/SLAYEROFALL3050/Audio_Generator_Using_GAN
10
+
11
+ ## Youtube Video demo
12
+
13
+ TODO
14
+
15
+ ## System Architecture Diagram
16
+
17
+ ![System Architecture Diagram](./assets/system_architecture.png)
18
+
19
+ ### Explanation
20
+
21
+ User inputs one genre tag, mood tag, and instrument tag into frontend. Each tag is passed to semantic similarity NLP model to determine nearest tags within training space, and coerces to (outputs) found training space genre, mood, and instrument tag. Those tags are passed to the audio generation model as input, which produces generated audio which is playable on the frontend.
22
+
23
+ ## Model Architecture Diagrams
24
+
25
+ TODO: NLP model diagram
26
+ TODO: Audio generation model diagram
27
+
28
+ ## Directory Guide
29
+
30
+ TODO
31
+
32
+ ## Training Instructions
33
+
34
+ TODO
35
+
36
+ ## Testing Instructions
37
+
38
+ TODO
39
+
40
+ ## Citations and References
41
+
42
+ TODO
assets/system_architecture.png ADDED