File size: 1,799 Bytes
868a03d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
---
license: mit
task_categories:
- question-answering
- code-understanding
language:
- en
pretty_name: MiniTorch GraphQA
tags:
- code reasoning
- graph-based QA
- symbolic reasoning
- minitorch
- trm
size_categories:
- n<1K
source_datasets:
- original
---
# MiniTorch GraphQA
A dataset of 100 developer-style questions over the [MiniTorch](https://github.com/atgctg/minitorch) codebase, designed for **structured, graph-based code reasoning**. Each question maps to a precise function or class in the source code and is intended for use with **symbolic reasoning models** like the Tiny Recursion Model (TRM).
## Intended Use
This dataset is designed to evaluate **lightweight, recursive reasoning agents** that operate over **retrieved subgraphs of code** (e.g., 10-node neighborhoods). It is **not** for code generation or language modeling.
## Data Format
Each line in `questions.jsonl` is a JSON object:
```json
{
"id": "minitorch-001",
"question": "Which function computes the derivative of scalar addition?",
"answer_node": "minitorch.scalar.Scalar.add_back",
"module": 1,
"reasoning_type": "autodiff"
}
id: unique identifier
question: natural language developer question
answer_node: fully qualified name of the ground-truth function/class
module: MiniTorch module number (0–6)
reasoning_type: category of reasoning required
License
MIT License.
Citation
If you use this dataset, please cite:
@dataset{minitorch_graphqa_2025,
author = {Sarosh Quraishi},
title = {MiniTorch GraphQA: A Symbolic Reasoning Benchmark for Code},
year = {2025},
publisher = {Hugging Face},
url = {https://huggingface.co/datasets/saroshq/minitorch-graphqa}
}
Acknowledgements
MiniTorch: https://github.com/atgctg/minitorch
|