BiologgingSolutions
/

OceanBEATs

Audio Classification

sound-event-detection

underwater-acoustics

self-supervised-learning

Model card Files Files and versions

OceanBEATs / README.md

alohajazz's picture

Update README.md

743acb5 verified about 1 month ago

|

history blame contribute delete

3.07 kB

	---
	license: cc-by-nc-4.0
	tags:
	- audio
	- sound-event-detection
	- underwater-acoustics
	- self-supervised-learning
	- beats
	datasets:
	- world-dapt
	language:
	- en
	metrics:
	- f1
	library_name: torch
	pipeline_tag: audio-classification
	---

	# OceanBEATs

	OceanBEATs is a foundation model for underwater acoustic monitoring, adapted from [BEATs](https://github.com/microsoft/unilm/tree/master/beats) via Domain-Adaptive Pretraining (DAPT) on approximately 4,400 hours of global ocean soundscapes (World-DAPT corpus).

	This model serves as the "ears" for underwater soundscapes described in our paper: "A stethoscope for the ocean: Unknownness-aware monitoring under false-positives-per-hour constraints in underwater soundscapes" (Under Review, Scientific Reports).

	## Model Details

	- Model Type: Audio Transformer (BEATs architecture)
	- Pretraining: Masked Audio Modeling + DAPT (SimCLR/InfoNCE) on underwater data
	- Input: 16kHz mono audio waveform
	- Backbone: BEATs AS-2M (iter3+)

	## Available Files

	This repository hosts the pretrained weights required to reproduce the results in our paper.

	1. `beats_dapt_topup_encoder.pt`
	* The core encoder (backbone) adapted to underwater acoustics.
	* Use this for feature extraction, unknown detection (CCED2), or fine-tuning on new marine datasets.

	2. `sed_head_56_topup_ep8.pt`
	* A 56-class Sound Event Detection (SED) head trained on coastal/lagoon data (Okinawa, Japan).
	* Detects fish, mammals, vessels, and environmental sounds.

	## Usage

	These weights are designed to be used with the official code repository:

	GitHub Repository: [alohajazz/openworld-soundscape-cced2-dgpu](https://github.com/alohajazz/openworld-soundscape-cced2-dgpu)

	Please download the `.pt` files and place them in the `weights/` directory of the cloned GitHub repository.

	```bash
	# Example directory structure after download
	openworld-soundscape-cced2-dgpu/
	└── weights/
	├── beats_dapt_topup_encoder.pt
	├── sed_head_56_topup_ep8.pt
	└── cced2/ ...
	```

	## License & Data Availability

	License: CC BY-NC 4.0 (Creative Commons Attribution-NonCommercial 4.0 International)

	These weights are released for non-commercial research purposes only.
	Commercial use is strictly prohibited without prior permission from the authors.

	> Note: The source code for using these models is released under the MIT License at the GitHub repository linked above.

	## Citation

	If you use this model in your research, please cite our paper:

	```bibtex
	@article{noda2026stethoscope,
	title={A stethoscope for the ocean: Unknownness-aware monitoring under false-positives-per-hour constraints in underwater soundscapes},
	author={Noda, Takuji and Koizumi, Takuya},
	journal={Scientific Reports},
	note={Under Review},
	year={2026}
	}
	```

	## Acknowledgements

	The base model architecture is based on BEATs (Microsoft). We acknowledge the creators of the BEATs model and the various open-source ocean acoustic datasets (SanctSound, ONC, PALAOA, etc.) used for DAPT.