nubahador
/

Fine_Tuned_Transformer_Model_for_Chirp_Localization

vision-transformer

spectrogram-analysis

Model card Files Files and versions

nubahador commited on Apr 1, 2025

Commit

0a34130

·

verified ·

1 Parent(s): 3bfc74f

Update README.md

Files changed (1) hide show

README.md +27 -0

README.md CHANGED Viewed

@@ -2,6 +2,33 @@
 license: mit
 ---
 <div style="display: flex; flex-wrap: wrap; gap: 15px; margin-top: 15px;">
     <div style="flex: 1; min-width: 200px; background: white; border-radius: 8px; padding: 15px; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
         <h4 style="margin-top: 0; color: #5f6368;">🧑‍💻 Curated by</h4>

 license: mit
 ---
+### Vision Transformer (ViT) with LoRA for Spectrogram Regression
+---
+### Fine-Tuning Details
+| Category              | Specification                                                                                     |
+|-----------------------|---------------------------------------------------------------------------------------------------|
+| **Framework**         | PyTorch                                                                                          |
+| **Architecture**      | Pre-trained Vision Transformer (ViT)                                                             |
+| **Adaptation Method** | LoRA (Low-Rank Adaptation)                                                                        |
+| **Task**             | Regression on time-frequency representations                                                      |
+| **Target Variables**  | 1. Chirp start time (ms)<br>2. Start frequency (kHz)<br>3. End frequency (kHz)                   |
+| **Training Protocol** | • Automatic Mixed Precision (AMP)<br>• Early stopping<br>• Learning Rate scheduling              |
+| **Output**           | Quantitative predictions + optional natural language descriptions                                 |
+---
+### Resource Details
+| Resource | Description | Link |
+|----------|-------------|------|
+| Trained Vision Transformer Model | Access to a pre-trained Vision Transformer model fine-tuned on synthetic spectrograms for chirp localization | [HuggingFace Model Hub](https://huggingface.co/nubahador/Fine_Tuned_Transformer_Model_for_Chirp_Localization/tree/main) |
+| Synthetic Spectrogram Dataset | Download link for 100,000 synthetic spectrograms with corresponding labels for chirp localization | [HuggingFace Dataset Hub](https://huggingface.co/datasets/nubahador/ChirpLoc100K___A_Synthetic_Spectrogram_Dataset_for_Chirp_Localization/tree/main) |
+| PyTorch Implementation | Repository containing the PyTorch code for fine-tuning the Vision Transformer on spectrograms | [Implementation GitHub Repository](https://github.com/nbahador/Train_Spectrogram_Transformer) |
+| Synthetic Chirp Generator | Python package for generating synthetic chirp spectrograms (images with corresponding labels) | [Dataset GitHub Repository](https://github.com/nbahador/chirp_spectrogram_generator) |
+---
 <div style="display: flex; flex-wrap: wrap; gap: 15px; margin-top: 15px;">
     <div style="flex: 1; min-width: 200px; background: white; border-radius: 8px; padding: 15px; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
         <h4 style="margin-top: 0; color: #5f6368;">🧑‍💻 Curated by</h4>