ewhk9887 commited on
Commit
aaa3265
ยท
verified ยท
1 Parent(s): deb2d19

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -6
README.md CHANGED
@@ -10,14 +10,54 @@ tags:
10
  license: apache-2.0
11
  language:
12
  - en
 
13
  ---
14
 
15
- # Uploaded model
16
 
17
- - **Developed by:** ewhk9887
18
- - **License:** apache-2.0
19
- - **Finetuned from model :** unsloth/qwen2.5-coder-14b-instruct-bnb-4bit
20
 
21
- This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
10
  license: apache-2.0
11
  language:
12
  - en
13
+ - ko
14
  ---
15
 
16
+ # Qwen2.5 Korean Code Review LLM
17
 
18
+ [![Unsloth](https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png)](https://github.com/unslothai/unsloth)
 
 
19
 
20
+ ## Overview
21
+ This model is a fine-tuned version of [`unsloth/qwen2.5-coder-14b-instruct-bnb-4bit`](https://huggingface.co/unsloth/qwen2.5-coder-14b-instruct-bnb-4bit). It is optimized for Korean-language code reviews and programming education.
22
+
23
+ The model was trained using [ewhk9887/korean_code_reviews_from_github](https://huggingface.co/datasets/ewhk9887/korean_code_reviews_from_github), a dataset consisting of Korean code reviews collected from GitHub. The fine-tuning process was done using [Unsloth](https://github.com/unslothai/unsloth) and Hugging Face's `transformers` and `trl` libraries, enabling a **2x faster** training process.
24
+
25
+ ## ๋ชจ๋ธ ๊ฐœ์š”
26
+ ์ด ๋ชจ๋ธ์€ [`unsloth/qwen2.5-coder-14b-instruct-bnb-4bit`](https://huggingface.co/unsloth/qwen2.5-coder-14b-instruct-bnb-4bit)๋ฅผ ํŒŒ์ธํŠœ๋‹ํ•œ ๋ฒ„์ „์œผ๋กœ, **ํ•œ๊ตญ์–ด ์ฝ”๋“œ ๋ฆฌ๋ทฐ ๋ฐ ์ฝ”๋”ฉ ํ•™์Šต**์„ ์œ„ํ•œ ์ตœ์ ํ™”๋ฅผ ๊ฑฐ์ณค์Šต๋‹ˆ๋‹ค.
27
+
28
+ [GitHub์—์„œ ์ˆ˜์ง‘๋œ ์ฝ”๋“œ ๋ฆฌ๋ทฐ ๋ฐ์ดํ„ฐ์…‹](https://huggingface.co/datasets/ewhk9887/korean_code_reviews_from_github)์„ ์‚ฌ์šฉํ•˜์—ฌ ํ•™์Šตํ–ˆ์œผ๋ฉฐ, [Unsloth](https://github.com/unslothai/unsloth) ๋ฐ Hugging Face์˜ `transformers`, `trl` ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ **2๋ฐฐ ๋น ๋ฅธ** ํ•™์Šต์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ–ˆ์Šต๋‹ˆ๋‹ค.
29
+
30
+ ## Features / ํŠน์ง•
31
+ - **Korean Code Review Support**: Designed specifically for analyzing and reviewing code in Korean.
32
+ - **Efficient Fine-Tuning**: Utilized `bnb-4bit` quantization and Unsloth for optimized performance.
33
+ - **Bilingual Support**: Can process both Korean and English inputs.
34
+ - **Transformer-based Model**: Leverages Qwen2.5's strong coding capabilities.
35
+
36
+ - **ํ•œ๊ตญ์–ด ์ฝ”๋“œ ๋ฆฌ๋ทฐ ์ตœ์ ํ™”**: ์ฝ”๋“œ ๋ฆฌ๋ทฐ๋ฅผ ํ•œ๊ตญ์–ด๋กœ ๋ถ„์„ํ•˜๊ณ  ์ž‘์„ฑํ•˜๋Š” ๋ฐ ์ตœ์ ํ™”๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
37
+ - **ํšจ์œจ์ ์ธ ํŒŒ์ธํŠœ๋‹**: `bnb-4bit` ์–‘์žํ™” ๋ฐ Unsloth ๊ธฐ์ˆ ์„ ํ™œ์šฉํ•˜์—ฌ ๋น ๋ฅธ ํ•™์Šต์ด ๊ฐ€๋Šฅํ–ˆ์Šต๋‹ˆ๋‹ค.
38
+ - **ํ•œ์˜ ์ง€์›**: ํ•œ๊ตญ์–ด์™€ ์˜์–ด ์ž…๋ ฅ์„ ๋ชจ๋‘ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
39
+ - **๊ฐ•๋ ฅํ•œ ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜**: Qwen2.5 ๋ชจ๋ธ์„ ํ™œ์šฉํ•œ ์ฝ”๋“œ ๋ถ„์„ ์„ฑ๋Šฅ.
40
+
41
+ ## Usage / ์‚ฌ์šฉ ๋ฐฉ๋ฒ•
42
+ ```python
43
+ from transformers import AutoModelForCausalLM, AutoTokenizer
44
+ import torch
45
+
46
+ model_name = "ewhk9887/qwen2.5-korean-code-review"
47
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
48
+ model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.float16, device_map="auto")
49
+
50
+ inputs = tokenizer("์ฝ”๋“œ๋ฅผ ๋ฆฌ๋ทฐํ•ด ์ฃผ์„ธ์š”: def add(a, b): return a + b", return_tensors="pt").to("cuda")
51
+ outputs = model.generate(**inputs, max_new_tokens=100)
52
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
53
+ ```
54
+
55
+ ## Developer / ๊ฐœ๋ฐœ์ž
56
+ - **Name**: ์€์€์ˆ˜ (Eunsoo Max Eun)
57
+ - **License**: Apache-2.0
58
+
59
+ ## Acknowledgments / ์ฐธ๊ณ  ์ž๋ฃŒ
60
+ - [Unsloth](https://github.com/unslothai/unsloth)
61
+ - [Hugging Face TRL](https://huggingface.co/docs/trl)
62
+ - [Dataset Used](https://huggingface.co/datasets/ewhk9887/korean_code_reviews_from_github)
63