YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

We release trained checkpoints in our paper (S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs).

Model Checkpoints:

Base Model	Our Trained Model	Link
DeepSeek-R1-Distill-Qwen-7B	S3-CoT-DeepSeek-R1-Distill-Qwen-7B	https://huggingface.co/yrdu/S3-CoT-DeepSeek-R1-Distill-Qwen-7B
Qwen2.5-7B-Instruct	S3-CoT-Qwen2.5-7B-Instruct	https://huggingface.co/yrdu/S3-CoT-Qwen2.5-7B-Instruct
Llama-3.1-8B-Instruct	S3-CoT-Llama-3.1-8B-Instruct	https://huggingface.co/yrdu/S3-CoT-Llama-3.1-8B-Instruct
Qwen3-4B-Thinking-2507	S3-CoT-Qwen3-4B-Thinking-2507	https://huggingface.co/yrdu/S3-CoT-Qwen3-4B-Thinking-2507

Inference Template: we provide System 1 and System 2 templates used in our training. During inference, please use the System 1 template to reproduce the results reported in our paper.

from transformers import AutoModelForCausalLM, AutoTokenizer

system1_template = " Please provide as a brief reasoning process as possible, and put your final answer within \\boxed{}"
system2_template = " Please reason step by step, and put your final answer within \\boxed{}"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",
    torch_dtype="bfloat16",
)
generation_config={"xxx"}
prompt = "xxx"

messages = [
    {"role": "user", "content": prompt+system1_template}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
generated_ids = model.generate(
    **model_inputs,
    **generation_config,
)

output_ids = generated_ids[0][len(model_inputs.input_ids[0]):].tolist()
output_content = tokenizer.decode(output_ids, skip_special_tokens=True)

Downloads last month: 7

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for yrdu/S3-CoT-DeepSeek-R1-Distill-Qwen-7B

Quantizations

1 model

Paper for yrdu/S3-CoT-DeepSeek-R1-Distill-Qwen-7B

S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs

Paper • 2602.01982 • Published 4 days ago