Overview

Inspired by a similar model, this model addresses the same challenge: providing an efficient way to generate questions from raw data. It is highly versatile and requires no specific data formatting; it is even robust enough to handle noisy or low-quality OCR text.

Designed for:

  • Golden dataset generation
  • RAG benchmarking
  • Generating HyDE indices for QnA systems
  • Evaluation corpus bootstrapping
  • Retriever quality testing
  • Graph RAG generation

The model is trained to generate unstructured output consisting of a single atomic question. Due to its small scale, it may struggle to produce correctly formatted structured data (e.g., JSON).

Suggested prompt template

Given the text below, extract ONE question grounded strictly in a single atomic fact.

<text>
<your_text_here>
</text>

Return ONLY the question:

Uploaded finetuned model

  • Developed by: Catlilface
  • License: apache-2.0
  • Finetuned from model : Catlilface/Qwen3.5-0.8B-interrogator

This qwen3_5 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
31
Safetensors
Model size
0.9B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for catlilface/Qwen3.5-0.8B-interrogator

Unable to build the model tree, the base model loops to the model itself. Learn more.

Datasets used to train catlilface/Qwen3.5-0.8B-interrogator