Reinforcement Learning
Safetensors
qwen3
paulwilczewski's picture
Creating model card
1699056 verified
|
raw
history blame
1.57 kB
metadata
license: other
datasets:
  - LightningRodLabs/future-as-label-paper-training-dataset
base_model:
  - Qwen/Qwen3-32B
pipeline_tag: reinforcement-learning

Model Card

Model Description

This model is a fine-tuned derivative of Qwen3-32B, trained using reinforcement learning on the future-as-label dataset to improve forecasting and long-horizon reasoning behavior.

  • Developed by: LightningRod Labs
  • Model type: Large language model (decoder-only transformer)
  • Language(s): English (primarily)
  • License: Qwen3 License
  • Finetuned from: Qwen/Qwen3-32B

Model Sources

Uses

Direct Use

Research and applied use cases involving:

  • Binary event prediction
  • Real-world forecasting
  • Synthetic data generation

Out-of-Scope Use

The model is not intended for:

  • Safety-critical or regulated domains
  • Deployment without additional evaluation
  • Use cases restricted by the Qwen3 license terms

Training Data

The model was fine-tuned on:

  • LightningRodLabs/future-as-label-paper-training-dataset

Limitations

As a fine-tuned derivative model, behavior may differ from the base Qwen3 model and may exhibit hallucinations or reasoning errors.

License and Attribution

This model is a fine-tuned derivative of Qwen3-32B.

The model weights are released under the Qwen3 License. All original license terms, conditions, and attribution requirements apply.

See the original Qwen3 license for full details.

Model Card Contact

https://www.lightningrod.ai/