rahul3613 commited on
Commit
992f05b
·
verified ·
1 Parent(s): 6989033

Added paper details

Browse files
Files changed (1) hide show
  1. README.md +20 -2
README.md CHANGED
@@ -13,7 +13,6 @@ tags:
13
  ---
14
 
15
  # Lean-ByT5
16
-
17
  **Lean-ByT5** is a ByT5-small model fine-tuned on the augmented Lean-Mathlib data for **Lean theorem proving**.
18
 
19
  The model performs **sequence-to-sequence generation**, generating Lean tactics for a given
@@ -23,4 +22,23 @@ statement goal.
23
  - google/byt5-small (Apache License 2.0)
24
 
25
  ## Training
26
- - PyTorch Lightning
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
  # Lean-ByT5
 
16
  **Lean-ByT5** is a ByT5-small model fine-tuned on the augmented Lean-Mathlib data for **Lean theorem proving**.
17
 
18
  The model performs **sequence-to-sequence generation**, generating Lean tactics for a given
 
22
  - google/byt5-small (Apache License 2.0)
23
 
24
  ## Training
25
+ - PyTorch Lightning
26
+
27
+ ## Related Paper & Citation
28
+ For more details on the training methodology, data augmentation strategy, and dynamic sampling method used for Lean theorem proving, please refer to our paper:
29
+
30
+ **Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method**
31
+ Rahul Vishwakarma, Subhankar Mishra
32
+ *arXiv:2312.14188 (cs.AI, cs.LG, cs.LO)*
33
+
34
+ **Paper:** https://arxiv.org/abs/2312.14188
35
+
36
+ If you use this model or build upon this work, please cite:
37
+ ```bibtex
38
+ @article{vishwakarma2023enhancing,
39
+ title={Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method},
40
+ author={Vishwakarma, Rahul and Mishra, Subhankar},
41
+ journal={arXiv preprint arXiv:2312.14188},
42
+ year={2023},
43
+ primaryClass={cs.AI}
44
+ }