Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,15 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
This model is a Sound Event Detection (SED) system designed to detect and extract specific audio events from long-form recordings. It is built on the PANNs (Pretrained Audio Neural Networks) architecture, specifically utilizing the CNN14 backbone with a Multiple Instance Learning (MIL) head for weak-label training and frame-level inference.
|
| 2 |
+
|
| 3 |
+
## Model Details
|
| 4 |
+
|
| 5 |
+
- **Architecture:** CNN14 (PANNs) + MIL Head
|
| 6 |
+
- **Task:** Sound Event Detection (SED) / Audio Classification
|
| 7 |
+
- **Input Sampling Rate:** 32,000 Hz
|
| 8 |
+
- **Primary Target:** Specific sound event detection (e.g., sniffing, breathing, or vocalizations depending on the finetuning dataset).
|
| 9 |
+
- **Time Resolution:** ~10ms - 160ms depending on the hop size and pooling configuration.
|
| 10 |
+
|
| 11 |
+
## Disclaimer
|
| 12 |
+
|
| 13 |
+
This repository contains the **model weights only**. It is not a standalone executable and requires a compatible PANNs-based inference script to function.
|
| 14 |
+
|
| 15 |
+
This model is part of a project called AccuSlice-SED that hasn't rolled out yet. I wanted to upload the weight to my private repository, but I was told "You've reached private storage quota. Upload it to public!" so I'm doing this reluctantly. :/
|