rahul3613
/

lean_byt5_aug

theorem-proving

Model card Files Files and versions

lean_byt5_aug / README.md

rahul3613's picture

Added paper details

992f05b verified 3 days ago

|

history blame contribute delete

1.21 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- google/byt5-small
	tags:
	- theorem-proving
	- lean
	- byt5
	- mathematics
	- code
	---

	# Lean-ByT5
	Lean-ByT5 is a ByT5-small model fine-tuned on the augmented Lean-Mathlib data for Lean theorem proving.

	The model performs sequence-to-sequence generation, generating Lean tactics for a given
	statement goal.

	## Base Model
	- google/byt5-small (Apache License 2.0)

	## Training
	- PyTorch Lightning

	## Related Paper & Citation
	For more details on the training methodology, data augmentation strategy, and dynamic sampling method used for Lean theorem proving, please refer to our paper:

	Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method
	Rahul Vishwakarma, Subhankar Mishra
	arXiv:2312.14188 (cs.AI, cs.LG, cs.LO)

	Paper: https://arxiv.org/abs/2312.14188

	If you use this model or build upon this work, please cite:
	```bibtex
	@article{vishwakarma2023enhancing,
	title={Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method},
	author={Vishwakarma, Rahul and Mishra, Subhankar},
	journal={arXiv preprint arXiv:2312.14188},
	year={2023},
	primaryClass={cs.AI}
	}