rahul3613
/

lean_byt5_aug

theorem-proving

Model card Files Files and versions

rahul3613 commited on 3 days ago

Commit

992f05b

·

verified ·

1 Parent(s): 6989033

Added paper details

Files changed (1) hide show

README.md +20 -2

README.md CHANGED Viewed

@@ -13,7 +13,6 @@ tags:
 ---
 # Lean-ByT5
 **Lean-ByT5** is a ByT5-small model fine-tuned on the augmented Lean-Mathlib data for **Lean theorem proving**.
 The model performs **sequence-to-sequence generation**, generating Lean tactics for a given
@@ -23,4 +22,23 @@ statement goal.
 - google/byt5-small (Apache License 2.0)
 ## Training
-- PyTorch Lightning

 ---
 # Lean-ByT5
 **Lean-ByT5** is a ByT5-small model fine-tuned on the augmented Lean-Mathlib data for **Lean theorem proving**.
 The model performs **sequence-to-sequence generation**, generating Lean tactics for a given
 - google/byt5-small (Apache License 2.0)
 ## Training
+- PyTorch Lightning
+## Related Paper & Citation
+For more details on the training methodology, data augmentation strategy, and dynamic sampling method used for Lean theorem proving, please refer to our paper:
+**Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method**
+Rahul Vishwakarma, Subhankar Mishra
+*arXiv:2312.14188 (cs.AI, cs.LG, cs.LO)*
+**Paper:** https://arxiv.org/abs/2312.14188
+If you use this model or build upon this work, please cite:
+```bibtex
+@article{vishwakarma2023enhancing,
+  title={Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method},
+  author={Vishwakarma, Rahul and Mishra, Subhankar},
+  journal={arXiv preprint arXiv:2312.14188},
+  year={2023},
+  primaryClass={cs.AI}
+}