Text Generation
Transformers
Safetensors
English
mistral
Trained with AutoTrain
text-generation-inference
Sirclavin commited on
Commit
1f6ebdc
·
1 Parent(s): 4e11be5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +123 -53
README.md CHANGED
@@ -1,70 +1,140 @@
1
  ---
2
- license: apache-2.0
3
  tags:
4
- - autotrain
5
- - text-generation
6
  base_model: Locutusque/TinyMistral-248M
7
  datasets:
8
- - OpenAssistant/oasst_top1_2023-08-25
 
 
9
  widget:
10
- - text: |-
11
- <|im_start|>user
12
- Write the specs of a game about trolls and warriors in a fantasy world.<|im_end|>
13
- <|im_start|>assistant
14
- The game is an adventure game that takes place on a planet, where players must explore their unique abilities to survive. Players can use different strategies such as collecting items or trading them for gold or silver coins, but they also need to learn how to deal with obstacles and find new ways to escape.<|im_end|>
15
- <|im_start|>user
16
- Could you tell me something curious about the Earth?<|im_end|>
17
- <|im_start|>assistant
18
- The planet is a large, rocky world with an atmosphere of 10 billion years old and a surface area around 25 million miles (36 million kilometers) wide.<|im_end|>
19
- <|im_start|>user
20
- What are some potential applications for quantum computing?<|im_end|>
21
- <|im_start|>assistant
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
  inference:
23
  parameters:
24
- max_new_tokens: 64
25
- repetition_penalty: 1.18
 
 
 
 
 
 
26
  ---
27
 
28
- # Locutusque's TinyMistral-248M trained on OpenAssistant TOP-1 Conversation Threads
29
 
30
- - Base model: [Locutusque/TinyMistral-248M](https://huggingface.co/Locutusque/TinyMistral-248M/blob/90b89d18fdf27937dc04ab8a9b543c5af2991c7f/README.md)
31
- - Dataset: [OpenAssistant/oasst_top1_2023-08-25](https://huggingface.co/datasets/OpenAssistant/oasst_top1_2023-08-25)
 
 
 
 
 
 
 
 
 
 
 
 
32
 
33
- ## Recommended Prompt Format
34
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
  ```
36
  <|im_start|>user
37
  {message}<|im_end|>
38
  <|im_start|>assistant
39
- ```
40
-
41
- ## How it was trained
42
-
43
- ```ipython
44
- %pip install autotrain-advanced
45
-
46
- !autotrain setup
47
-
48
- !autotrain llm \
49
- --train \
50
- --trainer "sft" \
51
- --model './TinyMistral-248M/' \
52
- --model_max_length 4096 \
53
- --block-size 1024 \
54
- --project-name 'trained-model' \
55
- --data-path "OpenAssistant/oasst_top1_2023-08-25" \
56
- --train_split "train" \
57
- --valid_split "test" \
58
- --text-column "text" \
59
- --lr 1e-5 \
60
- --train_batch_size 2 \
61
- --epochs 5 \
62
- --evaluation_strategy "steps" \
63
- --save-strategy "steps" \
64
- --save-total-limit 2 \
65
- --warmup-ratio 0.05 \
66
- --weight-decay 0.0 \
67
- --gradient-accumulation 8 \
68
- --logging-steps 10 \
69
- --scheduler "constant"
70
  ```
 
1
  ---
 
2
  tags:
3
+ - autotrain
4
+ - text-generation
5
  base_model: Locutusque/TinyMistral-248M
6
  datasets:
7
+ - CrabfishAI/ReasoningQuestions
8
+ - OpenAssistant/oasst_top1_2023-08-25
9
+ - nRuaif/OpenOrca-GPT3.5
10
  widget:
11
+ - text: |-
12
+ <|im_start|>user
13
+ Find me a list of some nice places to visit around the world.<|im_end|>
14
+
15
+
16
+
17
+
18
+ <|im_start|>assistant
19
+ - text: |-
20
+ <|im_start|>user
21
+ Tell me a story about some magical place.<|im_end|>
22
+
23
+
24
+
25
+
26
+ <|im_start|>assistant
27
+
28
+ - text: |-
29
+ <|im_start|>user
30
+ Write a python code to print numbers from 1 to 100 in sequence.<|im_end|>
31
+
32
+
33
+
34
+
35
+ <|im_start|>assistant
36
+ - text: |-
37
+ <|im_start|>user
38
+ What is biology?<|im_end|>
39
+
40
+
41
+
42
+
43
+ <|im_start|>assistant
44
+ - text: >-
45
+ <|im_start|>user
46
+ I love taking photos of nature. Can you come up with a 3-day itinerary
47
+ visiting the most photogenic places in Iceland?<|im_end|>
48
+
49
+
50
+
51
+
52
+ <|im_start|>assistant
53
  inference:
54
  parameters:
55
+ max_new_tokens: 32
56
+ repetition_penalty: 1.15
57
+ do_sample: true
58
+ temperature: 0.5
59
+ top_p: 0.5
60
+ license: apache-2.0
61
+ language:
62
+ - en
63
  ---
64
 
 
65
 
66
+ # InstructWise 470M - A virtual assistant.
67
+ Note- we,ll be releasing more versions of InstructWise soon, with the goal of making memory-efficent models while maintaining the performance, Thank you!
68
+
69
+ Introduction- InstructWise is a series of model created to act as helpful virtual assistant while maintaing the memory efficiency.
70
+ # Credits
71
+ - **Base Model:** Locutusque/TinyMistral-248M
72
+ - **Dataset Used:** (Mentioned earlier)
73
+ - **License:** Apache License 2.0
74
+
75
+ ## Features
76
+ - **Maintaining performance while being memory efficient:** Ram usage- 7.1GB Vram usage- 0.6GB (approximately)
77
+ - **Act as helpful virtual assistant:** InstructWise serves as a versatile and helpful assistant, offering a range of features that cater to various user needs. Its key strength lies in providing instructive responses to user prompts, offering detailed and insightful information.
78
+ - **Coding:** Model can perform coding as well.
79
+ - **Assisting capabilities:** can assist with wide rang of taskes.
80
 
81
+ ## Uses
82
 
83
+ InstructWise finds application in various domains, including:
84
+
85
+ - **Assistance in Writing:** Aid authors, bloggers, and students in drafting articles and essays.
86
+ - **Chatbot Development:** Power conversational agents with human-like responses.
87
+ - **Prototyping and Idea Generation:** Facilitate brainstorming sessions for product development.
88
+ - **Personal Assistant Applications:** Assist users in drafting emails and messages.
89
+ and many more.
90
+
91
+ ## Direct Use Cases
92
+
93
+ InstructWise can be directly employed for:
94
+
95
+ 1. **Educational Support:**
96
+ - Assist users in learning new topics with detailed explanations and step-by-step instructions.
97
+
98
+ 2. **Content Creation:**
99
+ - Generate creative content based on prompts, aiding content creators in the writing process.
100
+
101
+ 3. **Code Assistance:**
102
+ - Provide guidance on coding queries, improve code documentation, and generate code snippets for developers.
103
+
104
+ 4. **Interactive Conversations:**
105
+ - Enhance chatbots or virtual assistants with informative and helpful responses for users.
106
+
107
+ 5. **Q&A Platforms:**
108
+ - Power question and answer platforms, offering detailed and insightful answers on various topics.
109
+
110
+ 6. **Technical Writing Support:**
111
+ - Assist writers and technical communicators with suggestions for clarity and informativeness.
112
+
113
+ 7. **Idea Expansion:**
114
+ - Facilitate the expansion and development of ideas by providing detailed insights and suggestions.
115
+
116
+
117
+ ## Limitation
118
+
119
+ 1. **Content Quality**: The model's output may vary in quality, and there's a possibility it might generate content that is nonsensical, irrelevant, or grammatically incorrect.
120
+
121
+ 2. **Bias and Sensitivity**: The model is trained on diverse data, but it may inadvertently exhibit biases or generate content that is sensitive or inappropriate. Exercise caution and review generated text before use.
122
+
123
+ 3. **Inappropriate Language**: The model might generate text that includes offensive language or inappropriate content. Be mindful of this, especially in applications where maintaining a respectful and inclusive tone is essential.
124
+
125
+
126
+
127
+ ## Disclaimer
128
+
129
+ - **Use with Caution**: This model is a tool that should be used with caution. Always review and validate the generated text before incorporating it into any application or publication.
130
+
131
+ - **Not for Critical Applications**: Avoid using the model for critical applications where accuracy and reliability are paramount. The model is intended for creative and exploratory purposes.
132
+
133
+ - **Ongoing Improvement**: The model may be updated or fine-tuned for better performance. Stay informed about updates and consider using the latest version for improved results.
134
+
135
+ ## Recommended Prompt Format to use:
136
  ```
137
  <|im_start|>user
138
  {message}<|im_end|>
139
  <|im_start|>assistant
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
140
  ```