File size: 1,103 Bytes
2073144
 
 
 
 
 
 
 
 
 
 
2165a9a
fca1098
2165a9a
cfdd71b
2165a9a
8a8996c
2165a9a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2073144
2165a9a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---
title: Fire Coml FALL 2022
emoji: 🎶
colorFrom: green
colorTo: indigo
sdk: streamlit
sdk_version: 1.10.0
app_file: app.py
pinned: false
---

# Tag-based Audio Generation

## Description

Model to generate audio given input genre(s), mood(s), and instrument(s)

## Link to demo app

https://huggingface.co/spaces/SLAYEROFALL3050/Audio_Generator_Using_GAN

## Youtube Video demo

TODO

## System Architecture Diagram

![System Architecture Diagram](./assets/system_architecture.png)

### Explanation

User inputs one genre tag, mood tag, and instrument tag into frontend. Each tag is passed to semantic similarity NLP model to determine nearest tags within training space, and coerces to (outputs) found training space genre, mood, and instrument tag. Those tags are passed to the audio generation model as input, which produces generated audio which is playable on the frontend.

## Model Architecture Diagrams

TODO: NLP model diagram

TODO: Audio generation model diagram

## Directory Guide

TODO

## Training Instructions

TODO

## Testing Instructions

TODO

## Citations and References

TODO