enochlev commited on
Commit
c19cdc1
·
verified ·
1 Parent(s): 30747ee

Clean up README: remove patch notes, fix prompt display

Browse files
Files changed (1) hide show
  1. README.md +1 -8
README.md CHANGED
@@ -18,13 +18,6 @@ Modern safetensors conversion of [xinrongzhang2022/MiniCPM-duplex](https://huggi
18
  to `model.safetensors`, enabling memory-mapped loading and compatibility with current
19
  versions of Transformers.
20
 
21
- ## Compatibility patch
22
-
23
- The original `modeling_minicpm.py` was written against an older Transformers Cache API.
24
- The copy here is patched for `transformers >= 4.38`: removed `DynamicCache` methods
25
- `seen_tokens`, `get_max_length()`, and `get_usable_length()` are replaced with
26
- `get_seq_length()`.
27
-
28
  ## Usage
29
 
30
  ```python
@@ -40,7 +33,7 @@ model = AutoModelForCausalLM.from_pretrained(
40
  device_map="auto",
41
  )
42
 
43
- prompt = "<\u7528\u6237>Hello, what can you do?<AI>"
44
  inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
45
  out = model.generate(**inputs, max_new_tokens=256)
46
  print(tokenizer.decode(out[0], skip_special_tokens=True))
 
18
  to `model.safetensors`, enabling memory-mapped loading and compatibility with current
19
  versions of Transformers.
20
 
 
 
 
 
 
 
 
21
  ## Usage
22
 
23
  ```python
 
33
  device_map="auto",
34
  )
35
 
36
+ prompt = "<用户>Hello, what can you do?<AI>"
37
  inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
38
  out = model.generate(**inputs, max_new_tokens=256)
39
  print(tokenizer.decode(out[0], skip_special_tokens=True))