Spaces:
Runtime error
Runtime error
add intro
Browse files- article.md +9 -5
article.md
CHANGED
|
@@ -20,20 +20,24 @@ epoch train_loss valid_loss accuracy time
|
|
| 20 |
1 0.260431 0.200901 0.945017 00:39
|
| 21 |
2 0.090158 0.164748 0.950745 00:40
|
| 22 |
|
| 23 |
-
|
| 24 |
|
| 25 |
[Classical approaches on this dataset as of 2019](https://www.researchgate.net/publication/335862311_Evaluation_of_Classical_Machine_Learning_Techniques_towards_Urban_Sound_Recognition_on_Embedded_Systems)
|
| 26 |
|
| 27 |
## State of the Art Approaches
|
| 28 |
|
| 29 |
-
The state-of-the-art methods for audio classification approach this problem as an image classification task. For such image classification problems from audio samples, three common(https://scottmduda.medium.com/urban-environmental-audio-classification-using-mel-spectrograms-706ee6f8dcc1)
|
| 30 |
transformation approaches are:
|
| 31 |
|
| 32 |
-
Linear Spectrograms
|
| 33 |
-
Log Spectrograms
|
| 34 |
-
[Mel Spectrograms](https://towardsdatascience.com/audio-deep-learning-made-simple-part-2-why-mel-spectrograms-perform-better-aad889a93505)
|
| 35 |
|
| 36 |
|
| 37 |
## Credits
|
| 38 |
|
| 39 |
Thanks to [Kurian Benoy](https://kurianbenoy.com/) and countless others that generously leave code public.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
1 0.260431 0.200901 0.945017 00:39
|
| 21 |
2 0.090158 0.164748 0.950745 00:40
|
| 22 |
|
| 23 |
+
## Classical Approaches
|
| 24 |
|
| 25 |
[Classical approaches on this dataset as of 2019](https://www.researchgate.net/publication/335862311_Evaluation_of_Classical_Machine_Learning_Techniques_towards_Urban_Sound_Recognition_on_Embedded_Systems)
|
| 26 |
|
| 27 |
## State of the Art Approaches
|
| 28 |
|
| 29 |
+
The state-of-the-art methods for audio classification approach this problem as an image classification task. For such image classification problems from audio samples, [three common] (https://scottmduda.medium.com/urban-environmental-audio-classification-using-mel-spectrograms-706ee6f8dcc1)
|
| 30 |
transformation approaches are:
|
| 31 |
|
| 32 |
+
- Linear Spectrograms
|
| 33 |
+
- Log Spectrograms
|
| 34 |
+
- [Mel Spectrograms](https://towardsdatascience.com/audio-deep-learning-made-simple-part-2-why-mel-spectrograms-perform-better-aad889a93505)
|
| 35 |
|
| 36 |
|
| 37 |
## Credits
|
| 38 |
|
| 39 |
Thanks to [Kurian Benoy](https://kurianbenoy.com/) and countless others that generously leave code public.
|
| 40 |
+
|
| 41 |
+
## Code Repo & Blog
|
| 42 |
+
|
| 43 |
+
Additional details on my [Github Repo] (https://github.com/gputrain/fastai2-coursework/tree/main/HW) and [my blog](https://www.gputrain.com/) where I will add additional details on model build, audio transforms and more.
|