---
license: apache-2.0
language:
- vi
- en
- zh
- ja
- ko
- fr
- de
- es
- th
- lo
- km
tags:
- language-detection
- language-identification
- vietnamese
- multilingual
library_name: underthesea
pipeline_tag: text-classification
metrics:
- accuracy
- f1
---

# Radar-1

Radar-1 is a language detection model developed by UnderTheSea NLP.

## Model Description

- **Model Type:** Language Detection (Text Classification)
- **Task:** Identify the language of input text
- **Language:** Multilingual
- **License:** Apache 2.0

## Supported Languages

| Code | Language |
|------|----------|
| vi | Vietnamese |
| en | English |
| zh | Chinese |
| ja | Japanese |
| ko | Korean |
| fr | French |
| de | German |
| es | Spanish |
| th | Thai |
| lo | Lao |
| km | Khmer |

## Installation

```bash
pip install underthesea
```

## Usage

```python
from underthesea import lang_detect

text = "Xin chào, tôi là người Việt Nam"
language = lang_detect(text)
print(language)  # vi
```

## API

```python
from radar import RadarLangDetector, detect

# Quick detection
lang = detect("Hello world")
print(lang)  # en

# With confidence scores
detector = RadarLangDetector.load("models/radar-1")
result = detector.predict("Xin chào Việt Nam")
print(result.lang)   # vi
print(result.score)  # 0.98
```

## Training

```bash
python src/train.py
```

## Technical Report

See [TECHNICAL_REPORT.md](TECHNICAL_REPORT.md) for detailed methodology and evaluation.