combe4259 commited on
Commit
2975847
Β·
verified Β·
1 Parent(s): 7c0da7b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +74 -0
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: ko
3
+ license: apache-2.0
4
+ tags:
5
+ - sql
6
+ - text-to-sql
7
+ - nl2sql
8
+ - financial-domain
9
+ - pytorch
10
+ datasets:
11
+ - custom
12
+ metrics:
13
+ - accuracy
14
+ - f1
15
+ ---
16
+ ## Colab Notebook
17
+
18
+
19
+ [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1vaGZTZ7y0SYLarCX0QemkUernLyohswz?usp=sharing)
20
+
21
+
22
+ # NHSQLNL: 금육 μžμ—°μ–΄ β†’ SQL λ³€ν™˜ λͺ¨λΈ
23
+
24
+ `NHSQLNL`은 ν•œκ΅­μ–΄ 금육 μžμ—°μ–΄ 질의λ₯Ό SQL 쿼리둜 λ³€ν™˜ν•˜λŠ” **Text-to-SQL (NL2SQL)** λͺ¨λΈμž…λ‹ˆλ‹€.
25
+ 은행 및 금육ꢌ 도메인 질의λ₯Ό λ°μ΄ν„°λ² μ΄μŠ€ 질의(SQL)둜 μžλ™ λ³€ν™˜ν•˜μ—¬, 고객 질의 응닡 μ‹œμŠ€ν…œ 및 금육 데이터 뢄석에 ν™œμš©ν•  수 μžˆμŠ΅λ‹ˆλ‹€.
26
+
27
+ ---
28
+
29
+ ## μ£Όμš” κΈ°λŠ₯ (Features)
30
+
31
+ - ν•œκ΅­μ–΄ 금육 도메인 μžμ—°μ–΄ μž…λ ₯을 SQL 쿼리둜 λ³€ν™˜
32
+ - 사전 μ •μ˜λœ μŠ€ν‚€λ§ˆμ— 맞좘 μ•ˆμ „ν•œ SQL 생성
33
+ - PyTorch 및 Hugging Face `transformers` 기반
34
+
35
+ ---
36
+
37
+ ## μ‚¬μš© 방법 (How to Use)
38
+
39
+ ```python
40
+ from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
41
+
42
+ # λͺ¨λΈ λ‘œλ“œ
43
+ MODEL_PATH = "combe4259/NHSQLNL"
44
+ tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH)
45
+ model = AutoModelForSeq2SeqLM.from_pretrained(MODEL_PATH)
46
+
47
+ # μž…λ ₯ 질의
48
+ query = "2023년에 κ°œμ„€λœ 예금 κ³„μ’Œ 수λ₯Ό μ•Œλ €μ€˜"
49
+
50
+ inputs = tokenizer(query, return_tensors="pt")
51
+
52
+ # SQL 예츑
53
+ outputs = model.generate(**inputs, max_length=128)
54
+ sql = tokenizer.decode(outputs[0], skip_special_tokens=True)
55
+
56
+ print("μž…λ ₯:", query)
57
+ print("μƒμ„±λœ SQL:", sql)
58
+
59
+
60
+ ---
61
+
62
+ ## ν•™μŠ΅ 데이터 (Training Data)
63
+
64
+ - 자체 κ΅¬μΆ•ν•œ 금육 도메인 **μžμ—°μ–΄ ↔ SQL λ§€ν•‘ 데이터셋** μ‚¬μš©
65
+ - 데이터 μ „μ²˜λ¦¬: SQL μŠ€ν‚€λ§ˆ μ •κ·œν™” 및 ν† ν¬λ‚˜μ΄μ € 기반 μž…λ ₯ λ³€ν™˜
66
+
67
+ ---
68
+ ---
69
+
70
+ ## ν™œμš© κ°€λŠ₯ λΆ„μ•Ό (Applications)
71
+
72
+ - 금육ꢌ 챗봇 및 상담 μžλ™ν™”
73
+ - μžμ—°μ–΄ 기반 데이터 쑰회 및 리포트 생성
74
+ - λΉ„μ „λ¬Έκ°€ λŒ€μƒ SQL ν•™μŠ΅/μ—°μŠ΅ 도ꡬ