dmitry315 commited on
Commit
7a040ed
·
verified ·
1 Parent(s): 0f64af7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -3
README.md CHANGED
@@ -1,3 +1,60 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - dmitry315/fineweb2-modern-greece-sample
5
+ language:
6
+ - el
7
+ pipeline_tag: text-generation
8
+ ---
9
+
10
+ # ELlama1
11
+
12
+ Серия LLM обученных на греческом языке
13
+
14
+ # ELlama1-0.7b
15
+
16
+ Модель в основе которой лежит Qwen (да-да не удивляейтесь).
17
+
18
+ ELlama1-0.7b - pretrain модель, обученная на семпле из fineweb2: [fineweb2-modern-greece-sample](https://huggingface.co/datasets/dmitry315/fineweb2-modern-greece-sample).
19
+
20
+ # Quick Start
21
+
22
+ ## Hugging face
23
+ ```
24
+ import torch
25
+ from transformers import AutoModelForCausalLM, PreTrainedTokenizerFast
26
+
27
+ model_path = "dmitry315/ELlama1-0.7b"
28
+ model = AutoModelForCausalLM.from_pretrained(model_path, torch_dtype=torch.float16, trust_remote_code=True)
29
+ tokenizer = PreTrainedTokenizerFast.from_pretrained(model_path, trust_remote_code=True)
30
+
31
+ text = "Γεια σας , δεν ξερω τιποτα για τον Ηροδοτο , μπορειτε να μου πειτε γι ' αυτον ;"
32
+
33
+ with torch.no_grad():
34
+ inputs = tokenizer(
35
+ text,
36
+ return_tensors="pt",
37
+ padding=True,
38
+ truncation=True,
39
+ max_length=128
40
+ ).to(device)
41
+ outputs = model.generate(
42
+ inputs.input_ids,
43
+ max_length=128,
44
+ temperature=args.temperature,
45
+ top_p=args.top_p,
46
+ do_sample=True,
47
+ pad_token_id=tokenizer.eos_token_id,
48
+ num_return_sequences=1
49
+ )
50
+ generated_text = tokenizer.decode(
51
+ outputs[0],
52
+ skip_special_tokens=True
53
+ )
54
+
55
+ print(generated_text)
56
+ ```
57
+
58
+ # Github
59
+
60
+ Код обучения: [ELlama](https://github.com/Dmitry315/ELlama)