vikp
/

instruct_rater

Text Classification

Model card Files Files and versions

instruct_rater / README.md

vikp's picture

Update README.md

6346687 over 2 years ago

|

950 Bytes

	---
	license: cc-by-4.0
	---

	This model judges if a given output is sufficient to recreate a given instruction.

	It's useful for filtering data to train a reverse instruct model. It could also have applications around determining if an output/instruction pair is linked, or around quality filtering data (data where the instruction can be recreated from the output might be higher quality).

	The model is a binary classifier trained on top of Python 410m with 100k examples for 1 epoch. The final validation loss is .35. You can see an example of a dataset filtered with this model [here](https://huggingface.co/datasets/vikp/reverse_instruct).

	To use it, pass in this prompt format:

	```
	Output

	{output}

	Instruction

	{instruction}
	```

	Output should be the output from a model, and instruction should be the instruction that generated the output. The model will return a 0-1 score indicating how effectively the instruction can be recreated.