Duplicated from yuanpan/SocratTeachLLM

ulises-c
/

SocratTeachLLM

socratic-teaching

Model card Files Files and versions

SocratTeachLLM / ATTRIBUTION.md

ulises-c's picture

Add ATTRIBUTION file

fdae5ec verified about 1 month ago

|

history blame contribute delete

1.77 kB

	# Attribution — ulises-c/SocratTeachLLM

	## Original Model

	This is a copy of the model published by the KELE research team:

	> Original: [yuanpan/SocratTeachLLM](https://huggingface.co/yuanpan/SocratTeachLLM)

	The model weights, LoRA fine-tuning, and all training methodology are the intellectual property of the KELE research team as described in:

	> Peng, Yuan et al. "KELE: A Multi-Agent Framework for Structured Socratic Teaching with Large Language Models."
	> Findings of the Association for Computational Linguistics: EMNLP 2025.
	> https://aclanthology.org/2025.findings-emnlp.888/

	## Base Model

	Fine-tuned from [THUDM/glm-4-9b-chat](https://huggingface.co/THUDM/glm-4-9b-chat) (GLM4-9B) by Zhipu AI.

	## Training Data

	Trained on SocratDataset — 6,803 multi-turn Socratic dialogues in Chinese covering elementary school science, constructed using the KELE B.3/B.4 pipeline from the CSQ dataset (Liu et al., 2025).

	- Chinese dataset: https://huggingface.co/datasets/ulises-c/SocratDataset
	- English translation: https://huggingface.co/datasets/ulises-c/SocratDataset-EN
	- Original distribution (JSON): https://github.com/yuanpan1020/KELE

	## This Copy

	Copied by: Ulises Chavarria
	Context: CSEN 346 (Natural Language Processing), Santa Clara University
	Reason: The original `yuanpan/SocratTeachLLM` had a minimal README. This copy adds a detailed README with architecture specs, usage examples, the full Table 1 benchmark results from the paper, and cross-references to the associated datasets and evaluation code. No weights were modified.

	Repository for downstream evaluation and extension work:
	https://github.com/ulises-c/csen-346

	## License

	Apache 2.0 — consistent with the original model and the GLM4-9B-Chat base model.