File size: 1,205 Bytes
a589665
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
992f05b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
license: apache-2.0
language:
- en
base_model:
- google/byt5-small
tags:
- theorem-proving
- lean
- byt5
- mathematics
- code
---

# Lean-ByT5
**Lean-ByT5** is a ByT5-small model fine-tuned on the augmented Lean-Mathlib data for **Lean theorem proving**.

The model performs **sequence-to-sequence generation**, generating Lean tactics for a given
statement goal.

## Base Model
- google/byt5-small (Apache License 2.0)

## Training
- PyTorch Lightning

## Related Paper & Citation
For more details on the training methodology, data augmentation strategy, and dynamic sampling method used for Lean theorem proving, please refer to our paper:

**Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method**  
Rahul Vishwakarma, Subhankar Mishra  
*arXiv:2312.14188 (cs.AI, cs.LG, cs.LO)*

**Paper:** https://arxiv.org/abs/2312.14188

If you use this model or build upon this work, please cite:
```bibtex
@article{vishwakarma2023enhancing,
  title={Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method},
  author={Vishwakarma, Rahul and Mishra, Subhankar},
  journal={arXiv preprint arXiv:2312.14188},
  year={2023},
  primaryClass={cs.AI}
}