File size: 1,168 Bytes
6da94db
 
 
 
 
 
 
 
 
3f8dd1a
 
0fd381b
3f8dd1a
e6b8876
3f8dd1a
e6b8876
 
0fd381b
3f8dd1a
e6b8876
3f8dd1a
e6b8876
 
 
3f8dd1a
e6b8876
3f8dd1a
e6b8876
3f8dd1a
 
e6b8876
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
title: README
emoji: 📉
colorFrom: gray
colorTo: purple
sdk: static
pinned: false
---

# FormosanBank

FormosanBank is a machine-readable corpus and tooling ecosystem for Taiwan’s Indigenous Formosan languages. This Hugging Face organization hosts datasets and related resources for research, education, language revitalization, and speech/language technology.

## What’s here

- datasets and corpus releases
- text, metadata, and audio-linked resources
- materials for ASR, MT, and NLP

## Links

- [Documentation](https://ai4commsci.gitbook.io/formosanbank)
- [GitHub](https://github.com/FormosanBank/FormosanBank)
- [Hugging Face guide](https://ai4commsci.gitbook.io/formosanbank/the-bank-architecture/developers/huggingface)

## Use and licensing

Licensing may vary by corpus. Please check each dataset card and the project documentation before reuse.

- [Terms of Use](https://ai4commsci.gitbook.io/formosanbank/additional-resources/terms-of-use)
- [Contributing](https://ai4commsci.gitbook.io/formosanbank/additional-resources/contributing-to-formosanbank)
- [Publications](https://ai4commsci.gitbook.io/formosanbank/additional-resources/publications)