File size: 1,343 Bytes
4be086f 5ee8af0 4be086f d9bc945 5ee8af0 4be086f 5ee8af0 4be086f 9420d73 2cb9d2d 5ee8af0 2cb9d2d 9420d73 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
title: EN-VI-JA Triplet Dataset Viewer
emoji: 🌐
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: "4.44.0"
app_file: app.py
pinned: false
license: cc-by-4.0
datasets:
- sotalab/en-vi-ja-300k-triplets
---

# SOTA Lab
SOTA Lab is a research lab with the goal of heading to quality, core values building, and core technologies development in Software, Hardware, and Robotics.
We focus on building foundational technologies and high-quality datasets that push the boundaries of what's possible. Our work spans across:
- **Software** - Machine learning, natural language processing, and data engineering
- **Hardware** - Embedded systems and computing infrastructure
- **Robotics** - Intelligent systems and automation
### AI/ML Research
Our AI and machine learning efforts focus on:
- **Multilingual NLP** - Building parallel corpora and translation datasets across multiple languages
- **Data Quality** - Developing pipelines for cleaning, filtering, and validating large-scale datasets
- **Model Training** - Creating high-quality training data for language models and translation systems
- **Open Datasets** - Publishing curated datasets for the research community
We believe in open research and contributing to the global community through open-source projects and publicly available datasets.
|