barom2 commited on
Commit
b552efb
Β·
verified Β·
1 Parent(s): 4b793c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +114 -6
README.md CHANGED
@@ -1,10 +1,118 @@
1
  ---
2
- title: README
3
- emoji: 🐒
4
  colorFrom: blue
5
- colorTo: pink
6
- sdk: static
7
- pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: DATUMO
3
+ emoji: ⭐
4
  colorFrom: blue
5
+ colorTo: indigo
 
 
6
  ---
7
 
8
+ <div align="center">
9
+
10
+ <img src="https://cdn-avatars.huggingface.co/v1/production/uploads/63aa4990769a10efc403771c/-hPclrsYl0IW6kqD2DWBL.png" width="140" alt="DATUMO logo"/>
11
+
12
+ # ⭐ DATUMO
13
+ ### *The Data-centric AI Company*
14
+
15
+ **Built by [Selectstar](https://selectstar.ai/) β€” data infrastructure for trustworthy AI**
16
+
17
+ [![Website](https://img.shields.io/badge/🌐_Website-selectstar.ai-4f46e5?style=for-the-badge)](https://selectstar.ai/)
18
+ [![Blog](https://img.shields.io/badge/πŸ“°_Blog-Read-0ea5e9?style=for-the-badge)](https://selectstar.ai/blog/)
19
+ [![LinkedIn](https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white)](https://kr.linkedin.com/company/datumo-usa)
20
+ [![Contact](https://img.shields.io/badge/Contact-Us-EA4335?style=for-the-badge&logo=gmail&logoColor=white)](https://selectstar.ai/contact/)
21
+
22
+ </div>
23
+
24
+ ---
25
+
26
+ ## πŸ‘‹ About Us
27
+
28
+ We're **Selectstar** β€” a Korean AI company building the **data foundation for trustworthy AI**.
29
+ Since 2018, we've partnered with AI teams across the entire data value-chain: from dataset design and construction to **LLM reliability evaluation and red-teaming**.
30
+
31
+ Our flagship **Datumo Platform** is Korea's first end-to-end AI trust evaluation solution, unifying dataset preparation, automated evaluation, red-teaming, and improvement analytics in a single pipeline.
32
+
33
+ > πŸ‡°πŸ‡· **μ•ˆλ…•ν•˜μ„Έμš”, μ…€λ ‰νŠΈμŠ€νƒ€μž…λ‹ˆλ‹€.**
34
+ > 데이터 섀계·ꡬ좕뢀터 LLM μ‹ λ’°μ„± κ²€μ¦κΉŒμ§€, AI 개발의 λͺ¨λ“  단계λ₯Ό ν•¨κ»˜ν•˜λŠ” **Data-centric AI κΈ°μ—…**μž…λ‹ˆλ‹€.
35
+ > 이 νŽ˜μ΄μ§€μ—μ„œλŠ” 저희가 연ꡬ·싀무에 μ‚¬μš©ν•˜λŠ” 데이터셋과 λͺ¨λΈμ„ μ˜€ν”ˆμ†ŒμŠ€λ‘œ κ³΅μœ ν•˜κ³  μžˆμ–΄μš”.
36
+
37
+ ---
38
+
39
+ ## 🎯 What We Do
40
+
41
+ <table>
42
+ <tr>
43
+ <td width="33%" valign="top">
44
+
45
+ ### πŸ—‚οΈ Data Construction
46
+ Training data design &amp; build
47
+ Pre-training data licensing
48
+ RAG knowledge pipelines
49
+ Crowdsourced at scale
50
+
51
+ </td>
52
+ <td width="33%" valign="top">
53
+
54
+ ### πŸ›‘οΈ AI Trust &amp; Safety
55
+ LLM red-teaming
56
+ Reliability benchmarks
57
+ Safety evaluation datasets
58
+ Guardrail testing
59
+
60
+ </td>
61
+ <td width="33%" valign="top">
62
+
63
+ ### πŸ“Š Datumo Platform
64
+ Automated LLM evaluation
65
+ Dashboard analytics
66
+ **45 days β†’ 45 minutes**
67
+ End-to-end eval pipeline
68
+
69
+ </td>
70
+ </tr>
71
+ </table>
72
+
73
+ ---
74
+
75
+ ## πŸ“š Featured Collections
76
+
77
+ ### πŸ›‘οΈ [Safety-Data](https://huggingface.co/collections/datumo/safety-data)
78
+ Curated by our **AI Safety team** β€” Korean-language safety and reliability benchmarks for LLM evaluation.
79
+
80
+ - πŸ”Έ [**KorSET**](https://huggingface.co/datasets/datumo/KorSET) β€” Korean Safety Evaluation Toolkit
81
+ - πŸ”Έ [**KorNAT**](https://huggingface.co/datasets/datumo/KorNAT) β€” Korea's first LLM reliability / national-alignment benchmark
82
+
83
+ ### πŸ“¦ [Data-Data](https://huggingface.co/collections/datumo/data-data)
84
+ Research outputs from our **Data team**.
85
+
86
+ - πŸ”Έ [**CAC-CoT**](https://huggingface.co/datumo/CAC-CoT) β€” 7B Chain-of-Thought feature extraction model
87
+ - πŸ”Έ [**CAC-CoT dataset**](https://huggingface.co/datasets/datumo/CAC-CoT) β€” accompanying training data
88
+
89
+ > πŸ’‘ νŒ”λ‘œμš°ν•˜μ‹œλ©΄ μƒˆ 데이터셋과 λͺ¨λΈμ΄ 곡개될 λ•Œ μ•Œλ¦Όμ„ λ°›μœΌμ‹€ 수 μžˆμ–΄μš”.
90
+
91
+ ---
92
+
93
+ ## πŸ† Milestones
94
+
95
+ - πŸ‡°πŸ‡· **K-AI Company** β€” Selected for Korea's Sovereign AI Foundation Model Project *(SKT Consortium, data lead)*
96
+ - πŸ… **Forbes Korea "2025 AI 50"**
97
+ - πŸ… **Forbes "30 Under 30 Asia"** β€” Enterprise Technology
98
+ - πŸš€ **Datumo Eval** β€” Korea's first automated LLM reliability evaluation platform (2025)
99
+ - πŸ“ˆ **200M+ annotations** Β· **287+ enterprise clients** Β· **250K+ crowdworkers**
100
+ - πŸ“ Co-built landmark Korean benchmarks including **KLUE** and **KorQuAD 2.0**
101
+ - πŸ”¬ Publications at **NeurIPS Β· EMNLP Β· CVPR**
102
+
103
+ ---
104
+
105
+ ## 🀝 Connect
106
+
107
+ | | |
108
+ |---|---|
109
+ | 🌐 Website | [selectstar.ai](https://selectstar.ai/) |
110
+ | πŸ“° Blog | [selectstar.ai/blog](https://selectstar.ai/blog/) |
111
+ | πŸ’Ό Enterprise inquiries | [Contact form](https://selectstar.ai/contact/) |
112
+ | πŸ’¬ Community | Join the [discussion tab](https://huggingface.co/spaces/datumo/README/discussions) |
113
+
114
+ ---
115
+
116
+ <div align="center">
117
+ <sub>⭐ Building the data foundation for trustworthy AI &middot; Made with care in Seoul πŸ‡°πŸ‡·</sub>
118
+ </div>