Commit ·
5ec7191
1
Parent(s): 86cf4e3
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,4 +1,4 @@
|
|
| 1 |
-
Small dummy deberta-v3-type Model useable for Unit/Integration tests. Suitable for CPU only machines, see [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio/blob/main/tests/integration/test_integration.py) for an example integration test.
|
| 2 |
|
| 3 |
Model was created as follows:
|
| 4 |
```python
|
|
|
|
| 1 |
+
Small dummy deberta-v3-type Reward Model useable for Unit/Integration tests for RLHF. Suitable for CPU only machines, see [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio/blob/main/tests/integration/test_integration.py) for an example integration test.
|
| 2 |
|
| 3 |
Model was created as follows:
|
| 4 |
```python
|