File size: 983 Bytes
86757a9
 
 
 
 
10265c5
86757a9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6b1a955
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
base_model:
- coml/BabyHuBERT
---

# VTC 2.1

This repository only contains the model weights.
For more informations on how to use the model, please look over at the [LAAC-LSCP/VTC](https://github.com/LAAC-LSCP/VTC) repository.

---

## Citation

To cite this work, please use the following bibtex.

```bibtex
@misc{charlot2025babyhubertmultilingualselfsupervisedlearning,
    title={BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings}, 
    author={Théo Charlot and Tarek Kunze and Maxime Poli and Alejandrina Cristia and Emmanuel Dupoux and Marvin Lavechin},
    year={2025},
    eprint={2509.15001},
    archivePrefix={arXiv},
    primaryClass={eess.AS},
    url={https://arxiv.org/abs/2509.15001}, 
}
```

---

## Changelog
To retrieve a specific version:
```bash
git clone --branch v2.1 --single-branch https://huggingface.co/coml/VTC-2
```

- `10265c5`: VTC 2.1 - tag: `v2.1`
- `91e67b5`: VTC 2.0 - tag: `v2.0`