yyliu01
/

AuralSAM2

Model card Files Files and versions

AuralSAM2 / docs /before_start.md

yyliu01's picture

Upload folder using huggingface_hub

c6dfc69 verified 4 days ago

|

history blame contribute delete

1.65 kB

	# Before Start

	This document provides a concise workflow to run AuralSAM2 experiments.

	## ⚙️ Prepare environment and data

	Please complete all setup steps in [installation](./installation.md) first.

	## 🚀 Training

	Use the unified launcher script:

	```bash
	cd scripts
	./run_avs_train.sh <v1s\|v1m\|v2> [gpus]
	./run_ref_train.sh [gpus]
	```
	The experiments are implemented by 4 GPUs by default.

	## 🔍 Inference (example)

	```bash
	cd avs.code/v2.code
	python inference.py --gpus 1 --batch_size 1 --inference_ckpt /absolute/path/to/checkpoint.pth
	```

	## 📊 Training Logs (Reproducibility)

	Some examples of training details, please see [this wandb link](https://wandb.ai/pyedog1976/AVS-final-report/workspace?nw=nwuserpyedog1976).

	In details, after clicking the run (e.g., [v1m-hiera-l](https://wandb.ai/pyedog1976/AVS-final-report/runs/gzp5dmwi/logs?nw=nwuserpyedog1976)), you can checkout:

	1) <img src="https://user-images.githubusercontent.com/102338056/167979073-1c1b3144-8a72-4d8d-9084-31d7fdab3e9b.png" width="26" height="22"> overall information (e.g., command line, hardware information and training time).
	2) <img src="https://user-images.githubusercontent.com/102338056/167978940-8c1f3d79-d062-4e7b-b56e-30b97d273ae8.png" width="26" height="22"> training curves and validation visualisation.
	3) <img src="https://user-images.githubusercontent.com/102338056/167979238-4847430f-aa0b-483d-b735-8a10b43293a1.png" width="26" height="22"> output logs.


	## 💾 Checkpoints
	We release both checkpoints and training logs in this [Google Drive link](https://drive.google.com/drive/folders/1n0HaCHMn48KaImXvX2mu4qKHUQg4mo9R?usp=sharing).