prismdata's picture
Add tokenizer and update model
34811aa verified
# Korean GPT
ํ•œ๊ตญ์–ด GPT ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
## ์‚ฌ์šฉ๋ฒ•
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained(
"oz1115/korean-gpt-quick-test",
trust_remote_code=True
)
model = AutoModelForCausalLM.from_pretrained(
"oz1115/korean-gpt-quick-test",
trust_remote_code=True
)
inputs = tokenizer("์•ˆ๋…•ํ•˜์„ธ์š”", return_tensors="pt")
outputs = model.generate(**inputs, max_length=50)
print(tokenizer.decode(outputs[0]))
```
## ๋ชจ๋ธ ์ •๋ณด
- Vocabulary: 32,000
- Hidden Size: 512
- Layers: 8
- Attention Heads: 8