anonym-ous
/

tempgraphrag-artifacts

temporal-reasoning

knowledge-graph

retrieval-augmented-generation

Model card Files Files and versions

tempgraphrag-artifacts / README.md

anonym-ous's picture

fix card title: Prompting (not Supervision)

5507e74 verified about 1 month ago

|

History Blame Contribute Delete

2.75 kB

	---
	license: other
	language:
	- en
	tags:
	- temporal-reasoning
	- knowledge-graph
	- graphrag
	- retrieval-augmented-generation
	- lora
	- peft
	pretty_name: Temporal-Aware GraphRAG artifacts (anonymous)
	---

	# Temporal-Aware GraphRAG — anonymous data archive

	Anonymous double-blind artifact accompanying the EMNLP submission *"The Lever Is
	the Prompt: Retrieval-Conditioned Prompting in Temporal-Aware GraphRAG."* This
	is the data + trained-adapter half; the code is in the companion anonymous
	repository linked from the paper. See `DATASHEET.md` for provenance.

	> No author, affiliation, or identifying information is included. Please do not
	> attempt to de-anonymize.

	## Contents
	```
	adapters/ LoRA adapters (inference-ready: adapter_model.safetensors +
	adapter_config.json + tokenizer + chat_template). 14 policies:
	sft-v3{,-seed1337,-seed7} Qwen3-8B headline (3 seeds)
	sft-llama31{,-seed1337,-seed7} Llama-3.1-8B cross-arch (3 seeds)
	sft-mistral{,-seed1337,-seed7} Mistral-7B cross-arch (3 seeds)
	sft-multitq{,-seed1337,-seed7} MultiTQ cross-benchmark (3 seeds)
	sft-multitq-{llama,mistral} MultiTQ cross-arch
	eval/ Per-question predictions + gold for every reported run (74 JSONs):
	TempBench 3-seed, 3-hop ablation, empty/shuffled-evidence,
	data-scale (1k/2k), full-test LLM-judge runs, MultiTQ, baselines.
	benchmark/ benchmark_labelled.jsonl + labels.tsv (the benchmark of [anon-bench],
	provided as a read-only input; see PROVENANCE.txt).
	sft_data/ Retrieval-conditioned SFT corpora (CoT, terse, MultiTQ).
	logs/ Training / evaluation logs (negative-result evidence).
	```

	## Mapping to the code repository
	Place these so the code repo's scripts find them:
	```
	adapters/<name>/ -> checkpoints/<name>/final/
	eval/*.json -> outputs/eval/
	benchmark/* -> outputs/benchmark/
	sft_data/*.jsonl -> outputs/
	```

	## Notes
	- Adapters are inference-only (training state / optimizer checkpoints removed).
	- Three contaminated judge runs (errored API calls) are excluded; all reported
	judge-EM numbers come from the clean full-test runs included here.

	## License
	Mixed; the `license: other` tag reflects this:
	- Data (eval outputs, benchmark, SFT corpora): derived from Wikidata (CC0) via
	TGB 2.0 — permissive.
	- LoRA adapters: derivative works of their base models and governed by those
	licenses — Qwen3-8B (Apache-2.0), Llama-3.1-8B-Instruct (Llama 3.1 Community
	License), Mistral-7B-Instruct-v0.3 (Apache-2.0). Use of the Llama-based adapters
	is subject to the Llama 3.1 Community License.