nikhil07prakash
/

float-7b

Text Generation

text-generation-inference

Model card Files Files and versions

nikhil07prakash commited on Feb 2, 2024

Commit

06d94e1

·

verified ·

1 Parent(s): faa4824

Update README.md

Updated model card details.

Files changed (1) hide show

README.md +41 -0

README.md CHANGED Viewed

@@ -1,3 +1,44 @@
 ---
 license: mit
 ---

 ---
 license: mit
 ---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+This model is a vanilla fine-tuned version of the [Llama-7B](https://huggingface.co/huggyllama/llama-7b) model on synthetically generated arithmetic tasks. It was introduced in [this](https://openreview.net/forum?id=8sKcAWOf2D) paper. It is very similar to [Goat-7B](https://github.com/liutiedong/goat), except it was trained without LoRA.
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [Nikhil Prakash](https://nix07.github.io/)
+- **Model type:** Autoregressive Decoder-only Language Model
+- **License:** MIT License
+- **Finetuned from model [optional]:** [Llama-7B](https://huggingface.co/huggyllama/llama-7b)
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** TODO
+- **Paper [optional]:** [Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking](https://openreview.net/forum?id=8sKcAWOf2D)
+## How to Get Started with the Model
+Use the code below to get started with the model.
+```python
+from transformers import AutoModel
+model = AutoModel.from_pretrained("nikhil07prakash/float-7b")
+```
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+TODO