CatoG commited on
Commit
81a070f
Β·
unverified Β·
1 Parent(s): 425fb1f

Update README with detailed application description

Browse files
Files changed (1) hide show
  1. README.md +6 -7
README.md CHANGED
@@ -1,10 +1,3 @@
1
- A test / demo application playground for DPO Preference Tuning on different LLM models.
2
- Running on Huggingspace:
3
- https://huggingface.co/spaces/CatoG/DPO_Demo
4
-
5
- Allows for LLM model selection, preference tuning of LLM responses, model response tuning with LoRA and Direct Preference Optimization (DPO).
6
- Tuned model / policies can be downloaded for further use.
7
-
8
  ---
9
  title: DPO Demo
10
  emoji: πŸ“š
@@ -16,5 +9,11 @@ app_file: app.py
16
  pinned: false
17
  short_description: Testing DPO for finetuning models
18
  ---
 
 
 
 
 
 
19
 
20
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
1
  ---
2
  title: DPO Demo
3
  emoji: πŸ“š
 
9
  pinned: false
10
  short_description: Testing DPO for finetuning models
11
  ---
12
+ A test / demo application playground for DPO Preference Tuning on different LLM models.
13
+ Running on Huggingspace:
14
+ https://huggingface.co/spaces/CatoG/DPO_Demo
15
+
16
+ Allows for LLM model selection, preference tuning of LLM responses, model response tuning with LoRA and Direct Preference Optimization (DPO).
17
+ Tuned model / policies can be downloaded for further use.
18
 
19
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference