Universal-Scene Text Recognition Model with High-Accuracy
Separate audio into vocals, bass, drums, and other
compare the performance of YOLO algorithms