iris-7b / README.md
davidkim205's picture
Update README.md
6587a8a verified
|
raw
history blame
5.18 kB
metadata
library_name: transformers
license: apache-2.0
language:
  - en
  - ko
pipeline_tag: translation

iris

image/jpeg

Iris is a model for Korean-English sentence translation based on deep learning.

It is used to translate Korean sentences into English or English sentences into Korean by utilizing advanced natural language processing technology. The model is trained to understand the grammar, vocabulary, and context of each language and generate appropriate translations. Iris provides efficient and accurate translation and can be used in a variety of applications.

Model Details

  • Model Developers : davidkim(changyeon kim)
  • Repository : will be updated soon.
  • base mode : mistralai/Mistral-7B-v0.1
  • dataset : translation_v3_346k

usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

repo = "davidkim205/iris-7b"
model = AutoModelForCausalLM.from_pretrained(repo, torch_dtype=torch.bfloat16, device_map='auto')
tokenizer = AutoTokenizer.from_pretrained(repo)

def generate(prompt):
    encoding = tokenizer(
        prompt,
        return_tensors='pt',
        return_token_type_ids=False
    ).to("cuda")
    gen_tokens = model.generate(
        **encoding,
        max_new_tokens=2048,
        temperature=1.0,
        num_beams=5,
    )
    prompt_end_size = encoding.input_ids.shape[1]
    result = tokenizer.decode(gen_tokens[0, prompt_end_size:])

    return result


def translate_ko2en(text):
    prompt = f"[INST] 다음 문장을 영어로 번역하세요.{text} [/INST]"
    return generate(prompt)


def translate_en2ko(text):
    prompt = f"[INST] 다음 문장을 한글로 번역하세요.{text} [/INST]"
    return generate(prompt)


def main():
    while True:
        text = input('>')
        en_text = translate_ko2en(text)
        ko_text = translate_en2ko(en_text)
        print('en_text', en_text)
        print('ko_text', ko_text)

if __name__ == "__main__":
    main()

output

$ python iris_test.py 
Downloading shards: 100%|█████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:00<00:00,  4.72it/s]
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████| 3/3 [00:02<00:00,  1.07it/s]
>아이리스는 딥러닝을 기반으로 한 한-영어 문장 번역을 위한 모델이다.
en_text Iris is a model for Korean-to-English sentence translation based on deep learning.</s>
ko_text 아이리스는 딥러닝을 기반으로 한 한국어-영어 문장 번역을 위한 모델이다.</s>

template

ko -> en

[INST] 다음 문장을 영어로 번역하세요.{text} [/INST]

en -> ko

"[INST] 다음 문장을 한글로 번역하세요.{text} [/INST]"

dataset info : translation_v3_346k

dataset name ratio size
sharegpt-deepl-ko-translation 33.86% 117346/346559
aihub-MTPE 5.77% 20000/346559
aihub-techsci2 5.77% 20000/346559
aihub-expertise 5.77% 20000/346559
aihub-humanities 5.77% 20000/346559
aihub-MT-new-corpus 5.77% 20000/346559
aihub-socialsci 5.77% 20000/346559
aihub-parallel-translation 5.77% 20000/346559
aihub-food 5.77% 20000/346559
aihub-techsci 5.77% 20000/346559
aihub-broadcast-content 5.77% 20000/346559
para_pat 5.74% 19898/346559
aihub-patent 2.69% 9315/346559

Evaluation

will be updated soon.

https://github.com/davidkim205/translation

TYPE Model BLEU SBLEU Duplicate Length Exceeds
HuggingFace facebook/nllb-200-distilled-1.3B 0.26 0.30 1 3
HuggingFace jbochi/madlad400-10b-mt 0.29 0.38 3 6
HuggingFace Unbabel/TowerInstruct-7B-v0.1 0.32 0.39 1 9
HuggingFace squarelike/Gugugo-koen-7B-V1.1 0.32 0.36 1 3
Cloud deepl 0.39 0.45 0 1
Cloud azure 0.40 0.49 0 3
Cloud google 0.40 0.49 0 2
HuggingFace davidkim205/iris-7b(ours) 0.40 0.43 0 3