Add files using upload-large-folder tool

1145a14 verified 1 day ago

4.68 kB

	---
	license: cc-by-nc-4.0
	base_model: Qwen/Qwen2.5-7B-Instruct
	language:
	- en
	- zh
	pipeline_tag: text-generation
	library_name: peft
	tags:
	- hypernetwork
	- hyper-lora
	- lora
	- role-play
	- character-impersonation
	- persona
	- dialogue
	- phase-tree
	datasets:
	- IAAR-Shanghai/phase_tree_data
	---

	# PHASE-Tree Models

	Released model checkpoints for the PHASE-Tree project
	(Psychology-grounded Hierarchical Attribute-Structured Evolving Tree).

	## Download

	The PHASE-Tree codebase expects these checkpoints under
	`PHASE-Tree/phase_tree_models/`. The recommended way to obtain a working copy
	is therefore:

	```bash
	# From the repository root (i.e. inside the cloned PHASE-Tree project):
	cd PHASE-Tree
	hf download IAAR-Shanghai/phase_tree_models --local-dir phase_tree_models
	```

	This places every file under `PHASE-Tree/phase_tree_models/`, matching the
	relative paths used by every script in the codebase (e.g.
	`phase_tree_models/sft/hyper_lora/hypermod.pt`).

	Alternative methods:

	- `git clone https://huggingface.co/IAAR-Shanghai/phase_tree_models`
	(run from the `PHASE-Tree/` root; clones into `phase_tree_models/`
	automatically).
	- Programmatic via `huggingface_hub.snapshot_download(...)` with
	`local_dir="phase_tree_models"`.

	This release contains the single recommended checkpoint for each of the
	two stages in the PHASE-Tree training pipeline. During development we ran a
	larger ablation grid (six hyper-LoRA SFT runs covering warm-start vs
	cold-start initialisation, two learning rates, and trainable vs frozen
	hypernet output heads, plus a separate One-PEFT-Per-User / OPPU baseline
	sweep). Only the checkpoints reported in the paper are bundled here; the
	ablations are kept locally for reproducibility but are not part of the
	release.

	## Layout

	\| Path \| Description \|
	\|------\|-------------\|
	\| `phase_tree_pretrained/` \| Hypernetwork pretrained on the PHASE-Tree character corpus. Used as the warm-start initialisation for the SFT run below. \|
	\| `sft/hyper_lora/` \| The anchor hyper-LoRA SFT run (warm-start, lr=5e-6, trainable heads). This is the checkpoint reported in the PHASE-Tree paper. \|

	Each leaf folder is self-describing via its own `README.md`.

	## Recommended Checkpoint

	For character-conditioned generation, load:

	```
	sft/hyper_lora/hypermod.pt
	```

	The pretrained hypermod (`phase_tree_pretrained/hypermod.pt`) is the upstream
	warm-start dependency of this anchor run, not an independently usable
	inference model. It is included so the training pipeline can be reproduced
	end-to-end.

	```
	phase_tree_pretrained/hypermod.pt ──warm-start──▶ sft/hyper_lora/hypermod.pt
	(pretraining stage) (anchor SFT, recommended)
	```

	## Why a single SFT checkpoint?

	Six hyper-LoRA SFT runs were trained during development, varying
	initialisation, learning rate, and whether the hypernet output heads are
	trainable. The released `sft/hyper_lora/` is the cell selected by the
	LLM-as-judge `character` and `semantic` ratings together with
	Qwen3-Embedding-4B response-vs-reference cosine similarity on a held-out
	evaluation set; the other five cells are ablations and are not bundled.

	Per-step intermediate checkpoints (`it_5000` … `it_40000`) and the full
	post-hoc evaluation artefacts (`eval_ckpt_judge_scores/`,
	`eval_ckpt_val_loss/`) are likewise not bundled. To regenerate them you
	would need to re-run training (`src/scripts/train_phase_tree_qwen_7b.sh`)
	followed by the evaluation scripts under `src/scripts/`.

	## Base Model

	All checkpoints are trained on top of
	[`Qwen/Qwen2.5-7B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct).
	The hyper-LoRA and pretrained-hypermod checkpoints additionally use
	[`Qwen/Qwen3-Embedding-4B`](https://huggingface.co/Qwen/Qwen3-Embedding-4B)
	as the task-embedding encoder.

	## Loading

	\| Checkpoint \| Loader \|
	\|------------\|--------\|
	\| Pretrained hypermod (`phase_tree_pretrained/`) \| `hyper_llm_modulator.hyper_modulator.load_hypermod_checkpoint(...)` \|
	\| Hyper-LoRA SFT (`sft/hyper_lora/`) \| `hyper_llm_modulator.hyper_modulator.load_hypermod_checkpoint(...)` \|

	The hypermod loader expects a checkpoint directory layout identical to the
	one used here (`hypermod.pt` + sibling `args.yaml` + `adapter_config.json`).
	It reads the architecture from `args.yaml` automatically; no extra
	configuration is required at inference time.

	## Intended Use

	These checkpoints are released as research artefacts for evaluating
	personalised and hypernetwork-based approaches to character-grounded dialogue
	generation. They are not intended for production user-facing
	applications without additional safety filtering.