qwen3-4b-code-reasoning / README.md

ertghiu256

Update README.md

9f1b1c1 verified 10 months ago

preview code

raw

history blame

797 Bytes

metadata

license: apache-2.0
datasets:
  - nvidia/OpenCodeReasoning
  - vicgalle/creative-rubrics-gpt-4.5-o3-R1
base_model:
  - unsloth/Qwen3-4B
tags:
  - unsloth
  - trl
  - sft
  - code
pipeline_tag: text-generation

Qwen 3 Code Reasoning

A small Qwen 3 4 billion parameter model trained on nvidia/OpenCodeReasoning for coding tasks. For Coding, it is recommended to be in thinking mode.

Strengths

Code generation
Logical question answering

Drawbacks

Heavy overthinking
Context overflow

Recommended Usage:

vllm
transformers

GGUF VERSION

gguf