lbourdois commited on
Commit
dc3b97e
·
verified ·
1 Parent(s): 61a6d85

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +73 -61
README.md CHANGED
@@ -1,62 +1,74 @@
1
- ---
2
- license: other
3
- license_name: qwen
4
- license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
5
- language:
6
- - en
7
- pipeline_tag: text-generation
8
- base_model: Qwen/Qwen2.5-72B
9
- tags:
10
- - chat
11
- library_name: transformers
12
- ---
13
-
14
- <p style="font-size:20px;" align="left">
15
- <div style="width: 80px; height: 80px; border-radius: 15px;">
16
- <img
17
- src="https://shuttleai.com/shuttle.png"
18
- alt="ShuttleAI Thumbnail"
19
- style="width: auto; height: auto; margin-left: 0; object-fit: cover; border-radius: 15px;">
20
- </div>
21
-
22
- <p align="left">
23
- 💻 <a href="https://shuttleai.com/" target="_blank">Use via API</a>
24
- </p>
25
-
26
- ## Shuttle-3 (beta) [2024/10/25]
27
-
28
- We are excited to introduce Shuttle-3, our next-generation state-of-the-art language model designed to excel in complex chat, multilingual communication, reasoning, and agent tasks.
29
-
30
- - **Shuttle-3** is a fine-tuned version of [Qwen-2.5-72b-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct), emulating the writing style of Claude 3 models and thoroughly trained on role-playing data.
31
-
32
- ## Model Details
33
-
34
- * **Model Name**: Shuttle-3
35
- * **Developed by**: ShuttleAI Inc.
36
- * **Base Model**: [Qwen-2.5-72b-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct)
37
- * **Parameters**: 72B
38
- * **Language(s)**: Multilingual
39
- * **Repository**: [https://huggingface.co/shuttleai](https://huggingface.co/shuttleai)
40
- * **Fine-Tuned Model**: [https://huggingface.co/shuttleai/shuttle-3](https://huggingface.co/shuttleai/shuttle-3)
41
-
42
- ### Key Features
43
-
44
- - Pretrained on a large proportion of multilingual and code data
45
- - Finetuned to emulate the prose quality of Claude 3 models and extensively on role play data
46
-
47
- ## Fine-Tuning Details
48
-
49
- - **Training Setup**: Trained on 130 million tokens for 12 hours using 4 A100 PCIe GPUs.
50
-
51
- ## Prompting
52
-
53
- Shuttle-3 uses ChatML as its prompting format:
54
-
55
- ```
56
- <|im_start|>system
57
- You are a pirate! Yardy harr harr!<|im_end|>
58
- <|im_start|>user
59
- Where are you currently!<|im_end|>
60
- <|im_start|>assistant
61
- Look ahoy ye scallywag! We're on the high seas!<|im_end|>
 
 
 
 
 
 
 
 
 
 
 
 
62
  ```
 
1
+ ---
2
+ license: other
3
+ license_name: qwen
4
+ license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
5
+ language:
6
+ - zho
7
+ - eng
8
+ - fra
9
+ - spa
10
+ - por
11
+ - deu
12
+ - ita
13
+ - rus
14
+ - jpn
15
+ - kor
16
+ - vie
17
+ - tha
18
+ - ara
19
+ pipeline_tag: text-generation
20
+ base_model: Qwen/Qwen2.5-72B
21
+ tags:
22
+ - chat
23
+ library_name: transformers
24
+ ---
25
+
26
+ <p style="font-size:20px;" align="left">
27
+ <div style="width: 80px; height: 80px; border-radius: 15px;">
28
+ <img
29
+ src="https://shuttleai.com/shuttle.png"
30
+ alt="ShuttleAI Thumbnail"
31
+ style="width: auto; height: auto; margin-left: 0; object-fit: cover; border-radius: 15px;">
32
+ </div>
33
+
34
+ <p align="left">
35
+ 💻 <a href="https://shuttleai.com/" target="_blank">Use via API</a>
36
+ </p>
37
+
38
+ ## Shuttle-3 (beta) [2024/10/25]
39
+
40
+ We are excited to introduce Shuttle-3, our next-generation state-of-the-art language model designed to excel in complex chat, multilingual communication, reasoning, and agent tasks.
41
+
42
+ - **Shuttle-3** is a fine-tuned version of [Qwen-2.5-72b-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct), emulating the writing style of Claude 3 models and thoroughly trained on role-playing data.
43
+
44
+ ## Model Details
45
+
46
+ * **Model Name**: Shuttle-3
47
+ * **Developed by**: ShuttleAI Inc.
48
+ * **Base Model**: [Qwen-2.5-72b-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct)
49
+ * **Parameters**: 72B
50
+ * **Language(s)**: Multilingual
51
+ * **Repository**: [https://huggingface.co/shuttleai](https://huggingface.co/shuttleai)
52
+ * **Fine-Tuned Model**: [https://huggingface.co/shuttleai/shuttle-3](https://huggingface.co/shuttleai/shuttle-3)
53
+
54
+ ### Key Features
55
+
56
+ - Pretrained on a large proportion of multilingual and code data
57
+ - Finetuned to emulate the prose quality of Claude 3 models and extensively on role play data
58
+
59
+ ## Fine-Tuning Details
60
+
61
+ - **Training Setup**: Trained on 130 million tokens for 12 hours using 4 A100 PCIe GPUs.
62
+
63
+ ## Prompting
64
+
65
+ Shuttle-3 uses ChatML as its prompting format:
66
+
67
+ ```
68
+ <|im_start|>system
69
+ You are a pirate! Yardy harr harr!<|im_end|>
70
+ <|im_start|>user
71
+ Where are you currently!<|im_end|>
72
+ <|im_start|>assistant
73
+ Look ahoy ye scallywag! We're on the high seas!<|im_end|>
74
  ```