| | --- |
| | tags: |
| | - model_hub_mixin |
| | - pytorch_model_hub_mixin |
| | license: mit |
| | datasets: |
| | - Marcus2112/minipile_density-proportioned |
| | language: |
| | - en |
| | --- |
| | |
| | This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration.<br> |
| | Part of [github.com/MK2112/nn-zero-to-hero-notes](https://github.com/MK2112/nn-zero-to-hero-notes).<br> |
| | Refer to [this file](https://github.com/MK2112/nn-zero-to-hero-notes/blob/main/N007%20-%20GPT%20From%20Scratch/N007_GPT_Solved_Exercise_Finetune.py) for model implementation and training setup. |