bdsaglam's picture
Upload folder using huggingface_hub
c9a2f21 verified
metadata
license: other
license_name: nvidia-open-model-license
license_link: https://huggingface.co/nvidia/Nemotron-Cascade-14B-Thinking/blob/main/LICENSE
base_model: nvidia/Nemotron-Cascade-14B-Thinking
tags:
  - qwen3
  - thinking
  - rl-training

Nemotron-Cascade-14B-Thinking (Modified Chat Template)

This is a modified version of nvidia/Nemotron-Cascade-14B-Thinking with a fixed chat template for RL training compatibility.

Changes

The original Nemotron chat template strips <think> sections from messages when processing inputs. This violates the increasing context requirement for multi-turn RL training (see verifiers documentation).

This version uses a simplified chat template (based on willcb/Qwen3-14B) that preserves thinking tokens in the conversation history, making it suitable for RL training with tools like verifiers.

Model Details

  • Base Model: nvidia/Nemotron-Cascade-14B-Thinking
  • Architecture: Qwen3ForCausalLM
  • Model Type: qwen3
  • Parameters: 14B

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("bdsaglam/Nemotron-Cascade-14B-Thinking")
tokenizer = AutoTokenizer.from_pretrained("bdsaglam/Nemotron-Cascade-14B-Thinking")

License

This model inherits the NVIDIA Open Model License from the base model.