BataAI — Монгол хэлний AI

Phi-3-mini-4k-instruct загварыг монгол хэлний датасет дээр fine-tune хийсэн.

Ашиглах

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_id = "battsengel4567/BataAiNew0.4V"
tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
    trust_remote_code=True,
)

system = "Чи бол BataAI — монгол хэлээр туслах чадварлаг хиймэл оюун ухаан."

def chat(user_input):
    text = "<|system|>\n" + system + "<|end|>\n<|user|>\n" + user_input + "<|end|>\n<|assistant|>\n"
    inputs = tokenizer(text, return_tensors="pt").to(model.device)
    with torch.no_grad():
        out = model.generate(
            **inputs,
            max_new_tokens=200,
            temperature=0.7,
            do_sample=True,
            repetition_penalty=1.1,
            pad_token_id=tokenizer.eos_token_id,
        )
    return tokenizer.decode(out[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True)

print(chat("Чи хэн бэ?"))

Мэдээлэл

  • Суурь: Phi-3-mini-4k-instruct (3.8B)
  • Датасет: 30,527 монгол бичлэг
  • Хэл: Монгол, Англи
Downloads last month
5
Safetensors
Model size
4B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for battsengel4567/BataAiNew0.4V

Finetuned
(853)
this model
Quantizations
1 model