petkopetkov commited on
Commit
6ff85cb
·
verified ·
1 Parent(s): bb793a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -6
README.md CHANGED
@@ -9,9 +9,7 @@ library_name: transformers
9
 
10
  Correct conversion to Hugging Face format of the https://huggingface.co/AntonV/mamba2-370m-hf model. This fixes the error that occurs during saving the weights:
11
 
12
- ```python
13
- RuntimeError: The weights trying to be saved contained shared tensors [{'backbone.embeddings.weight', 'lm_head.weight'}] that are mismatching the transformers base configuration. Try saving using `safe_serialization=False` or remove this tensor sharing.
14
- ```
15
 
16
  ## Usage
17
  ```python
@@ -27,9 +25,7 @@ print(tokenizer.batch_decode(out))
27
 
28
  This doesn't fix the Mamba2 cache error during evaluation in SFTTrainer:
29
 
30
- ```python
31
- TypeError: Unsupported types (<class 'transformers.models.mamba2.modeling_mamba2.Mamba2Cache'>) passed to `_pad_across_processes`. Only nested list/tuple/dicts of objects that are valid for `is_torch_tensor` should be passed.
32
- ```
33
 
34
  A temporary fix is disabling the usage of the cache:
35
 
 
9
 
10
  Correct conversion to Hugging Face format of the https://huggingface.co/AntonV/mamba2-370m-hf model. This fixes the error that occurs during saving the weights:
11
 
12
+ **RuntimeError: The weights trying to be saved contained shared tensors [{'backbone.embeddings.weight', 'lm_head.weight'}] that are mismatching the transformers base configuration. Try saving using `safe_serialization=False` or remove this tensor sharing.**
 
 
13
 
14
  ## Usage
15
  ```python
 
25
 
26
  This doesn't fix the Mamba2 cache error during evaluation in SFTTrainer:
27
 
28
+ **TypeError: Unsupported types (<class 'transformers.models.mamba2.modeling_mamba2.Mamba2Cache'>) passed to `_pad_across_processes`. Only nested list/tuple/dicts of objects that are valid for `is_torch_tensor` should be passed.**
 
 
29
 
30
  A temporary fix is disabling the usage of the cache:
31