bin778 commited on
Commit
7bcd136
ยท
verified ยท
1 Parent(s): e568291

docs: Write README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -3
README.md CHANGED
@@ -1,3 +1,66 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language: ko
4
+ tags:
5
+ - regression
6
+ - pytorch
7
+ - xgboost
8
+ - sports-car
9
+ ---
10
+
11
+ # ์Šคํฌ์ธ ์นด ๊ฐ€๊ฒฉ ๋ฐ ์„ฑ๋Šฅ ์˜ˆ์ธก ๋ชจ๋ธ
12
+
13
+ ์ด ๋ชจ๋ธ์€ ์Šคํฌ์ธ ์นด์˜ ๋‹ค์–‘ํ•œ ์ŠคํŽ™(์ œ์กฐ์‚ฌ, ์—ฐ์‹, ์—”์ง„ ํฌ๊ธฐ ๋“ฑ)์„ ๊ธฐ๋ฐ˜์œผ๋กœ **๊ฐ€๊ฒฉ, ๋งˆ๋ ฅ, ์ œ๋กœ๋ฐฑ**์„ ์˜ˆ์ธกํ•˜๋Š” ๋”ฅ๋Ÿฌ๋‹ ๋ฐ ๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ํฌํ•จํ•˜๊ณ  ์žˆ๋‹ค.
14
+
15
+ ## ํ”„๋กœ์ ํŠธ ๊ฐœ์š”
16
+
17
+ ๋‹ค์–‘ํ•œ ์Šคํฌ์ธ ์นด ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„ํ•˜๊ณ , ์ตœ์ ์˜ ์˜ˆ์ธก ๋ชจ๋ธ์„ ์ฐพ๊ธฐ ์œ„ํ•ด ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๊ณผ์ •์„ ๊ฑฐ์ณค๋‹ค.
18
+ 1. ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ ๋ฐ ํ”ผ์ฒ˜ ์—”์ง€๋‹ˆ์–ด๋ง (GroupBy ํ™œ์šฉ)
19
+ 2. **๋”ฅ๋Ÿฌ๋‹(TensorFlow/Keras)** ๋ฐ **๋จธ์‹ ๋Ÿฌ๋‹(XGBoost)** ๋ชจ๋ธ ๊ตฌ์ถ•
20
+ 3. ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹์„ ํ†ตํ•œ ๋ชจ๋ธ ์ตœ์ ํ™”
21
+ 4. ๋‘ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ(MSE) ๋น„๊ต ๋ฐ ์ตœ์ข… ๋ชจ๋ธ ์„ ์ •
22
+
23
+ ## ๋ชจ๋ธ (Models)
24
+
25
+ ์ด ํ”„๋กœ์ ํŠธ๋Š” ๋‘ ๊ฐ€์ง€ ์ตœ์ ํ™”๋œ ๋ชจ๋ธ์„ ์ œ๊ณตํ•œ๋‹ค.
26
+
27
+ | ๋ชจ๋ธ ์ข…๋ฅ˜ | ํŒŒ์ผ๋ช… | ์ฃผ์š” ํŠน์ง• |
28
+ | :--- | :--- | :--- |
29
+ | **๋”ฅ๋Ÿฌ๋‹ (Keras)** | `best_model.keras` | ReLU ํ™œ์„ฑํ™” ํ•จ์ˆ˜์™€ Dropout์„ ์‚ฌ์šฉํ•œ 3-Layer ์‹ ๊ฒฝ๋ง |
30
+ | **๋จธ์‹ ๋Ÿฌ๋‹ (XGBoost)**| `xgboost-model.skops`| ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹์œผ๋กœ ์ตœ์ ํ™”๋œ Gradient Boosting ๋ชจ๋ธ |
31
+
32
+ ### ๋ชจ๋ธ ๊ตฌ์กฐ (๋”ฅ๋Ÿฌ๋‹)
33
+ ![๋ชจ๋ธ ๊ตฌ์กฐ](model.png)
34
+
35
+ ## ๐Ÿ“Š ๋ฐ์ดํ„ฐ์…‹ (Dataset)
36
+
37
+ - **๋ฐ์ดํ„ฐ ์ถœ์ฒ˜**: [Sports Car Price Dataset on Kaggle](https://www.kaggle.com/datasets/kikun1234/sports-car-prices-dataset) (์˜ˆ์‹œ ๋งํฌ)
38
+ - **ํƒ€๊ฒŸ ๋ณ€์ˆ˜ (์˜ˆ์ธก ๋Œ€์ƒ)**: `๊ฐ€๊ฒฉ(์›ํ™”)`, `๋งˆ๋ ฅ`, `์ œ๋กœ๋ฐฑ (0-100km)`
39
+ - **์ฃผ์š” ํ”ผ์ฒ˜**: `์ œ์กฐ์‚ฌ`, `๋ชจ๋ธ`, `์—ฐ์‹`, `์—”์ง„ ํฌ๊ธฐ`, `ํ† ํฌ` ๋“ฑ
40
+
41
+ ## ๐Ÿ› ๏ธ ์‚ฌ์šฉ ๋ฐฉ๋ฒ•
42
+
43
+ ์ด ๋ชจ๋ธ์„ ๋ถˆ๋Ÿฌ์™€ ์‚ฌ์šฉํ•˜๋ ค๋ฉด `tensorflow`, `xgboost`, `scikit-learn`, `skops` ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๊ฐ€ ํ•„์š”ํ•˜๋‹ค.
44
+
45
+ **XGBoost ๋ชจ๋ธ ๋ถˆ๋Ÿฌ์˜ค๊ธฐ ๋ฐ ์˜ˆ์ธก**
46
+ ```python
47
+ import skops.io as sio
48
+
49
+ # ์ €์žฅ์†Œ์—์„œ ๋ชจ๋ธ์„ ์ง์ ‘ ๋ถˆ๋Ÿฌ์˜ฌ ์ˆ˜ ์žˆ๋‹ค (๋˜๋Š” ๋‹ค์šด๋กœ๋“œ ํ›„)
50
+ # loaded_model = sio.load("hf://your-hf-username/your-repo-name/xgboost-model.skops")
51
+ loaded_model = sio.load("xgboost-model.skops")
52
+
53
+ # ์˜ˆ์ธกํ•  ๋ฐ์ดํ„ฐ๋ฅผ ์ค€๋น„ํ•œ๋‹ค (์ „์ฒ˜๋ฆฌ ๋ฐ ์Šค์ผ€์ผ๋ง ํ•„์š”)
54
+ # preprocessed_data = ...
55
+ # prediction = loaded_model.predict(preprocessed_data)
56
+ # print(prediction)
57
+ ```
58
+
59
+ ## ๐Ÿ“ˆ ์ตœ์ข… ์„ฑ๋Šฅ
60
+
61
+ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹ ํ›„, ๋‘ ๋ชจ๋ธ์˜ ํ…Œ์ŠคํŠธ ๋ฐ์ดํ„ฐ์…‹์— ๋Œ€ํ•œ **ํ‰๊ท  ์ œ๊ณฑ ์˜ค์ฐจ(MSE)**๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค.
62
+
63
+ - **(ํŠœ๋‹) ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ MSE**: `0.010617`
64
+ - **(ํŠœ๋‹) XGBoost ๋ชจ๋ธ MSE**: `0.010617`
65
+
66
+ ๋‘ ๋ชจ๋ธ์ด ๊ฑฐ์˜ ๋™์ผํ•œ ์ตœ๊ณ  ์„ฑ๋Šฅ์„ ๊ธฐ๋กํ–ˆ์œผ๋ฉฐ, ์ด๋Š” ๋ฐ์ดํ„ฐ์˜ ํŠน์„ฑ์„ ๊ฐ๊ธฐ ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ์™„๋ฒฝํ•˜๊ฒŒ ํ•™์Šตํ–ˆ์Œ์„ ์‹œ์‚ฌํ•œ๋‹ค.