PanicButtonPressed commited on
Commit
047c466
·
verified ·
1 Parent(s): a5479e2

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -3
README.md CHANGED
@@ -1,3 +1,58 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # NoxtuaCompliance
2
+
3
+ Noxtua-Compliance-70B-V1 is a specialized large language model designed for legal compliance applications. It is finetuned from the Llama-3-70B-Instruct model using a custom legal cases dataset to understand more complex contexts and achieve precise results when analyzing complex legal issues.
4
+
5
+ ## Model details
6
+
7
+ Model Name: Noxtua-Compliance-70B-V1
8
+
9
+ Base Model: Llama-3-70B-Instruct
10
+
11
+ Parameter Count: 70 billion
12
+
13
+ ## Run with vllm
14
+
15
+ ```bash
16
+ docker run --runtime nvidia --gpus=all -v ~/.cache/huggingface:/root/.cache/huggingface -p 8000:8000 --ipc=host vllm/vllm-openai:v0.6.6.post1 --model ACATECH/ncos --tensor-parallel-size=2 --disable-log-requests --max-model-len 120000 --gpu-memory-utilization 0.95
17
+ ```
18
+
19
+ ## Use with transformers
20
+
21
+ See the snippet below for usage with Transformers:
22
+
23
+ ```python
24
+ import torch
25
+ import transformers
26
+
27
+ model_id = "ACATECH/ncos"
28
+ tokenizer = transformers.AutoTokenizer.from_pretrained(model_id)
29
+ tokenizer.pad_token_id = tokenizer.eos_token_id
30
+
31
+ pipeline = transformers.pipeline(
32
+ "text-generation",
33
+ model=model_id,
34
+ tokenizer=tokenizer,
35
+ max_new_tokens=1024,
36
+ torch_dtype = torch.float16,
37
+ device_map="auto",
38
+ trust_remote_code=True
39
+ )
40
+
41
+ messages = [
42
+ {"role": "system", "content": "You are an intelligent AI assistant in the legal domain called Noxtua NCOS from the company Xayn. You will assist the user with care, respect and professionalism. Always answer in the same language as the question. Freely use legal jargon."},
43
+ {"role": "user", "content": "Carry out an entire authority check of the following text."},
44
+ ]
45
+
46
+ print(pipeline(messages))
47
+ ```
48
+
49
+ Please consider setting temperature = 0 to get consistent outputs.
50
+
51
+ ### Framework versions
52
+
53
+ - Transformers 4.47.1
54
+ - Pytorch 2.5.1+cu121
55
+
56
+ ## Recommended Hardware
57
+
58
+ Running this model requires 2 or more 80GB GPUs, e.g. NVIDIA A100, with at least 150GB of free disk space.