ByteDance-Seed
/

Seed-Coder-8B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

yuyuzhang commited on Apr 30, 2025

Commit

14dbba3

·

verified ·

1 Parent(s): d7cbb90

Update README.md

Files changed (1) hide show

README.md +13 -11

README.md CHANGED Viewed

@@ -36,27 +36,29 @@ pip install -U transformers accelerate
 Here is a simple example demonstrating how to load the model and generate code using the Hugging Face `pipeline` API:
 ```python
-import transformers
 import torch
 model_id = "ByteDance-Seed/Seed-Coder-8B-Instruct"
-pipeline = transformers.pipeline(
-    "text-generation",
-    model=model_id,
-    model_kwargs={"torch_dtype": torch.bfloat16},
-    device_map="auto",
-)
 messages = [
     {"role": "user", "content": "Write a quick sort algorithm."},
 ]
-outputs = pipeline(
     messages,
-    max_new_tokens=512,
-)
-print(outputs[0]["generated_text"][-1]["content"])
 ```
 ## Evaluation

 Here is a simple example demonstrating how to load the model and generate code using the Hugging Face `pipeline` API:
 ```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
 import torch
 model_id = "ByteDance-Seed/Seed-Coder-8B-Instruct"
+tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.bfloat16, device_map="auto", trust_remote_code=True)
 messages = [
     {"role": "user", "content": "Write a quick sort algorithm."},
 ]
+input_ids = tokenizer.apply_chat_template(
     messages,
+    tokenize=True,
+    return_tensors="pt",
+    add_generation_prompt=True,
+).to(model.device)
+outputs = model.generate(input_ids, max_new_tokens=512)
+response = tokenizer.decode(outputs[0][input_ids.shape[-1]:], skip_special_tokens=True)
+print(response)
 ```
 ## Evaluation