Eclipse-Senpai commited on
Commit
d137cc3
·
verified ·
1 Parent(s): 7f614be

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -6
README.md CHANGED
@@ -62,8 +62,6 @@ GGUF builds for `llama.cpp`, LM Studio, and Ollama are available at [KeyLM-75M-I
62
 
63
  ## How to Use
64
 
65
- KeyLM ships its own modeling code, so load it with `trust_remote_code=True`. It requires `transformers>=4.51`.
66
-
67
  ```python
68
  import torch
69
  from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -85,8 +83,6 @@ outputs = model.generate(
85
  print(tokenizer.decode(outputs[0][inputs.shape[1]:], skip_special_tokens=True))
86
  ```
87
 
88
- The model uses a plain `User:` / `Assistant:` chat format, applied automatically by `apply_chat_template`. Assistant turns end with `</s>`.
89
-
90
  ## Evaluation
91
 
92
  ### Instruction following (IFEval)
@@ -103,7 +99,7 @@ KeyLM beats the original SmolLM-135M-Instruct at roughly half the size and a fra
103
 
104
  ### Base vs Instruct
105
 
106
- The base and instruction-tuned checkpoints across all benchmarks. Commonsense and knowledge tasks are zero-shot via `lm_eval` (accuracy; ARC and HellaSwag length-normalized); IFEval is the 4-metric average. Bold marks the stronger version per row.
107
 
108
  | Benchmark | KeyLM-75M (base) | KeyLM-75M-Instruct | Random |
109
  |---|---|---|---|
@@ -133,7 +129,7 @@ KeyLM was pretrained from random initialization on approximately 18B tokens, dra
133
 
134
  ### Post-training
135
 
136
- Instruction tuning used `smol-smoltalk`, `ultrachat_200k`, and several `smoltalk2` splits (magpie, persona instruction-following, science, OpenHermes, system chats, summarization), with assistant-only loss masking, plus a set of custom synthetic instruction-following examples. A final personality tuning pass produced the released checkpoint.
137
 
138
  ## Limitations
139
 
 
62
 
63
  ## How to Use
64
 
 
 
65
  ```python
66
  import torch
67
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
83
  print(tokenizer.decode(outputs[0][inputs.shape[1]:], skip_special_tokens=True))
84
  ```
85
 
 
 
86
  ## Evaluation
87
 
88
  ### Instruction following (IFEval)
 
99
 
100
  ### Base vs Instruct
101
 
102
+ The base and instruction-tuned checkpoints across all benchmarks. Commonsense and knowledge tasks are zero-shot via `lm_eval` (accuracy; ARC and HellaSwag length-normalized); IFEval is the 4-metric average.
103
 
104
  | Benchmark | KeyLM-75M (base) | KeyLM-75M-Instruct | Random |
105
  |---|---|---|---|
 
129
 
130
  ### Post-training
131
 
132
+ Instruction tuning used `smol-smoltalk`, `ultrachat_200k`, and several `smoltalk2` splits (magpie, persona instruction-following, science, OpenHermes, system chats, summarization), with assistant-only loss masking, plus a set of custom synthetic instruction-following examples.
133
 
134
  ## Limitations
135