spikecodes commited on
Commit
e703839
·
verified ·
1 Parent(s): 6f3f75b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +119 -1
README.md CHANGED
@@ -8,4 +8,122 @@ pipeline_tag: text2text-generation
8
  tags:
9
  - code
10
  - legal
11
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  tags:
9
  - code
10
  - legal
11
+ library_name: peft
12
+ base_model: mistralai/Mistral-7B-v0.1
13
+ ---
14
+ # Model Card for 911 Operator Assistant
15
+
16
+ This model is a fine-tuned version of Mistral-7B-v0.1, designed to assist 911 operators in handling emergency calls professionally and efficiently.
17
+
18
+ ## Model Details
19
+
20
+ ### Model Description
21
+
22
+ - **Developed by:** The model was developed using the dispatch.ipynb notebook
23
+ - **Model type:** Fine-tuned Large Language Model
24
+ - **Language(s) (NLP):** English
25
+ - **License:** MIT
26
+ - **Finetuned from model:** mistralai/Mistral-7B-v0.1
27
+
28
+ ## Uses
29
+
30
+ ### Direct Use
31
+
32
+ This model is intended to be used as an assistant for 911 operators, helping them respond to emergency calls quickly and professionally.
33
+
34
+ ### Out-of-Scope Use
35
+
36
+ This model should not be used as a replacement for trained 911 operators or emergency responders. It is meant to assist, not replace, human judgment in emergency situations.
37
+
38
+ ## Bias, Risks, and Limitations
39
+
40
+ The model may have biases based on the training data used. It should not be relied upon for making critical decisions in emergency situations without human oversight.
41
+
42
+ ### Recommendations
43
+
44
+ Users should always verify the model's outputs and use them in conjunction with established emergency response protocols.
45
+
46
+ ## How to Get Started with the Model
47
+
48
+ Use the following code to initialize the model:
49
+
50
+ ```python
51
+ from peft import PeftModel
52
+ import torch
53
+ from transformers import AutoModelForCausalLM, AutoTokenizer
54
+
55
+ BASE_MODEL = "mistralai/Mistral-7B-v0.1"
56
+ LORA_CHECKPOINT = "./lora_adapters/checkpoint-200/"
57
+
58
+ model, tokenizer = setup_model_and_tokenizer(BASE_MODEL)
59
+ model = PeftModel.from_pretrained(model, LORA_CHECKPOINT)
60
+ model.to(torch.device("xpu" if torch.xpu.is_available() else "cpu"))
61
+ ```
62
+
63
+ ## Training Details
64
+
65
+ ### Training Data
66
+
67
+ The model was fine-tuned on a dataset of 911 call transcripts, using the "spikecodes/911-call-transcripts" dataset.
68
+
69
+ ### Training Procedure
70
+
71
+ #### Training Hyperparameters
72
+
73
+ - **Batch size:** 4
74
+ - **Learning rate:** 2e-5
75
+ - **Epochs:** 7.62 (based on max_steps)
76
+ - **Max steps:** 200
77
+ - **Warmup steps:** 20
78
+ - **Weight decay:** Not specified
79
+ - **Gradient accumulation steps:** 4
80
+ - **Training regime:** BFloat16 mixed precision
81
+
82
+ #### Speeds, Sizes, Times
83
+
84
+ - **Training time:** Approximately 800.64 seconds (13.34 minutes)
85
+
86
+ ## Evaluation
87
+
88
+ ### Testing Data, Factors & Metrics
89
+
90
+ #### Testing Data
91
+
92
+ The model was evaluated on a validation set derived from the same dataset used for training.
93
+
94
+ ## Environmental Impact
95
+
96
+ - **Hardware Type:** Intel(R) Data Center GPU Max 1100
97
+ - **Hours used:** Approximately 0.22 hours (13.34 minutes)
98
+
99
+ ## Technical Specifications
100
+
101
+ ### Model Architecture and Objective
102
+
103
+ The model uses the Mistral-7B architecture with LoRA (Low-Rank Adaptation) for efficient fine-tuning.
104
+
105
+ ### Compute Infrastructure
106
+
107
+ #### Hardware
108
+
109
+ Intel(R) Data Center GPU Max 1100
110
+
111
+ #### Software
112
+
113
+ - Python 3.9.18
114
+ - PyTorch 2.1.0.post0+cxx11.abi
115
+ - Transformers library
116
+ - PEFT library
117
+ - Intel Extension for PyTorch
118
+
119
+ ## Model Card Authors
120
+
121
+ https://github.com/spikecodes
122
+
123
+ ## Model Card Contact
124
+
125
+ For more information, please email me (using the contact button on my website: https://spike.codes) and refer to the repositories of the used libraries and base model.
126
+
127
+ ### Framework versions
128
+
129
+ - PEFT 0.11.1