Llama3.1-Nemo-Thinker

Llama3.1-Nemo-Thinker is a fine-tuned version of nvidia/Llama-3.1-Nemotron-Nano-8B-v1. It was trained on a subset of the agentlans/train-of-thought dataset to improve step-by-step reasoning and control over internal “thinking” behavior. The goal is to make the model more stable and easier to guide when reasoning through complex tasks.

Features

  • More reliable output: Handles a wider range of sampling settings while keeping responses coherent.
  • Switchable reasoning mode: Toggle between “thinking” and “non-thinking” (default) modes using system prompts.
  • Better math and logic: Inherits strong reasoning and math abilities from the Nemotron base and improved with DeepSeek and OpenAI o1 derived data.
  • Business-friendly licensing: Uses a more permissive commercial license than Llama 3.1 8B.

Limitations

  • Looping risk: May repeat itself when “thinking”; use a repetition penalty to reduce this.
  • Reduced original capabilities: Some strengths of the base model may be slightly weaker.
  • Shorter context window: Supports up to 2048 tokens during finetuning.
  • Single-turn focus: Not designed for multi-turn conversations or non-English text.

Example

Click here for an example in thinking mode

System prompt: Think using <think> tags and write your final answer in <response> tags.

Input: The current time is 16:18. Find the angle between the hour and minute hands on an analogue clock.

Output:

<think>Okay, so I need to find the angle between the hour and minute hands at 16:18. Let me think about how to approach this. I remember that there are formulas for calculating the positions of the hour and minute hands based on the time. Let me try to recall them.

First, the minute hand. Since there are 60 minutes on a clock, each minute, the minute hand moves 360 degrees divided by 60, which is 6 degrees per minute. So, for each minute past the hour, the minute hand moves 6 degrees. At 18 minutes past the hour, the minute hand would be at 18 * 6 = 108 degrees from the top (12 o'clock position).

Now, the hour hand. The hour hand moves much slower. It takes 12 hours to complete a full circle, which is 360 degrees. So, per hour, the hour hand moves 360/12 = 30 degrees per hour. But since it's also moving as the minutes pass, we need to account for that. So, each minute, the hour hand moves 30 degrees per hour divided by 60 minutes, which is 0.5 degrees per minute. 

At 16:18, the hour is 4 (since 16 - 12 = 4), so the hour hand is at 4 * 30 = 120 degrees from the top. But we also need to add the movement due to the 18 minutes. So, 18 minutes * 0.5 degrees per minute = 9 degrees. Therefore, the total position of the hour hand is 120 + 9 = 129 degrees.

Now, the minute hand is at 108 degrees, and the hour hand is at 129 degrees. To find the angle between them, we subtract the smaller position from the larger one: 129 - 108 = 21 degrees. But wait, the clock is a circle, so the angle could also be 360 - 21 = 339 degrees. However, we take the smaller angle, which is 21 degrees.

Wait, let me double-check. The formula I remember is |30H - 5.5M|, where H is the hour and M is the minutes. Let me apply that. Here, H is 4 (since 16 is 4 PM), and M is 18. So, 30*4 = 120, 5.5*18 = 99. The difference is |120 - 99| = 21 degrees. That matches my previous calculation. Then, since 21 degrees is less than 180, that's the angle. So, the answer should be 21 degrees.

But let me verify once more. Maybe I made a mistake in the hour hand position. The hour hand at 4 o'clock is 120 degrees. Each minute adds 0.5 degrees, so 18 minutes would add 9 degrees, making it 129 degrees. The minute hand at 18 minutes is 18*6 = 108 degrees. The difference is 21 degrees. Yes, that seems correct.

Alternatively, if I think about how many minutes past 4 o'clock it is, which is 18 minutes. The minute hand moves 6 degrees per minute, so 108 degrees. The hour hand moves 0.5 degrees per minute, so 4*30 + 18*0.5 = 120 + 9 = 129 degrees. The difference is 21 degrees. That's consistent.

Another way to check: The minute hand at 18 minutes is at 108 degrees. The hour hand at 4:18 is 4 + 18/60 hours, which is 4.3 hours. Multiply by 30 degrees per hour: 4.3 * 30 = 129 degrees. The difference is 21 degrees. Yep, same result.

So, all methods point to 21 degrees. Therefore, the angle between the hour and minute hands at 16:18 is 21 degrees.</think><response>The angle between the hour and minute hands at 16:18 is **21 degrees**. 

**Step-by-Step Explanation:**
1. **Minute Hand Position:**  
   Each minute, the minute hand moves \(6^\circ\) (since \(360^\circ/60 = 6^\circ\)).  
   At 18 minutes: \(18 \times 6 = 108^\circ\).

2. **Hour Hand Position:**  
   - The hour hand moves \(30^\circ\) per hour (clockwise).  
   - At 4 o'clock: \(4 \times 30 = 120^\circ\).  
   - It also moves \(0.5^\circ\) per minute (to account for the 18 minutes past 4): \(18 \times 0.5 = 9^\circ\).  
   - Total hour hand position: \(120 + 9 = 129^\circ\).

3. **Angle Between Them:**  
   Subtract the smaller position from the larger:  
   \(|129^\circ - 108^\circ| = 21^\circ\).  
   (Since \(21^\circ &lt; 180^\circ\), this is the smaller angle.)

**Final Answer:** \(\boxed{21}\) degrees.</response>

Training

  • Method: QLoRA fine-tuning (4-bit quantization)
  • Framework: LLaMA Factory with the Liger kernel
  • Data used: 20 000 samples from the 100K train-of-thought split
  • Epochs: 1
  • NEFTune: alpha = 5
  • Sequence packing: Enabled
  • LoRA settings: rank = 32, alpha = 64, with rslora enabled

See Also

agentlans/Llama3.1-deep-o1 which is based on a merged model, not limited to Nemotron Nano.

Downloads last month
13
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for agentlans/Llama3.1-Nemo-Thinker

Finetuned
(9)
this model
Quantizations
2 models

Dataset used to train agentlans/Llama3.1-Nemo-Thinker