KAKA22
/

CodeRM-8B

+---
+license: llama3.1
+datasets:
+- KAKA22/CodeRM-UnitTest
+language:
+- en
+base_model:
+- meta-llama/Llama-3.1-8B-Instruct
+pipeline_tag: text-generation
+tags:
+- code
+- llama
+---
+# Model Description
+CodeRM-8B is a small yet powerful model designed to enable efficient and high-quality unit test generation.
+It is trained based on Llama3.1-8B-Instruct on a dataset of 60k high-quality synthetic Python unit tests.
+These unit tests are derived from two well-regarded code instruction tuning datasets:
+[CodeFeedback-Filtered-Instruction](https://huggingface.co/datasets/m-a-p/CodeFeedback-Filtered-Instruction) and the
+training set of [TACO](https://huggingface.co/datasets/BAAI/TACO).
+The training dataset used for unit test generation is openly available under
+[CodeRM-UnitTest](https://huggingface.co/datasets/KAKA22/CodeRM-UnitTest).
+For further information and details of training, refer to our paper:
+"Dynamic Scaling of Unit Tests for Code Reward Modeling" available on arXiv.
+# Prompt Format
+```
+Below is a question and it's corresponding code answer. Please write test cases to check the correctness of the code answer. You need to use the unittest library in Python and create a test class for testing.
+### question
+{question}
+### code solution
+{code in function format}
+Please add detailed comments to the test cases you write. You do not need to test the function's ability to throw exceptions.
+```