2nji
/

makebelieve

Text Generation

text-generation-inference

Model card Files Files and versions

makebelieve / README.md

2nji's picture

specify cluster on g5k

ee90e72 about 2 years ago

|

history blame contribute delete

1.11 kB

	---
	license: apache-2.0
	---
	# Finetuning llama 2 on our celebrity news dataset located [here](https://huggingface.co/datasets/2nji/makebelieve-480)

	Disclaimer: This is still work in progress as we need to preprocess our celebrity news dataset to match Llama 2's prompt format as described [here](https://huggingface.co/blog/llama2#how-to-prompt-llama-2)

	## Reserve GPU on g5k

	Log into your Grid5000 account using ssh and run the following code in the terminal

	```script
	oarsub -p "cluster='graffiti'" -l gpu=1 -I -q production
	```

	Wait till GPUs are available and assigned to you, if you need more information about g5k, you can refer to [here](https://www.grid5000.fr/w/Getting_Started)

	### Create a virtual environment

	- Installing PIP

	```script
	pip install virtualenv
	```

	- Creating environment

	```script
	virtualenv venv
	```

	- Activating environment

	```script
	source venv/bin/activate
	```

	## Install requirements file

	```script
	pip install -r requirements.txt
	```

	## Running the script to finetune Llama-2-7b-chat-hf and push to huggingface model repository

	```script
	python makebelieve.py
	```