dmgcsilva commited on
Commit
ac0f1db
verified
1 Parent(s): 79041a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -3
README.md CHANGED
@@ -4,7 +4,23 @@ language:
4
  - en
5
  ---
6
 
7
- This is under construction.
8
 
9
- This is PlanLLM, a model trained to, given a recipe or a DIY task, help the user complete it in a conversational setting, answering questions and clarifying any doubts.
10
- This was trained based on a Vicuna with SFT and DPO training.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  - en
5
  ---
6
 
7
+ # PlanLLM
8
 
9
+ ### Model Details
10
+
11
+ PlanLLM is a conversational assistant trained to assist users in completing a recipe from beginning to end and be able to answer any related or relevant requests that the user might have.
12
+ The model was also tested with DIY Tasks and performed similarly.
13
+
14
+ #### Training
15
+
16
+ PlanLLM was trained by fine-tuning a [Vicuna](https://huggingface.co/lmsys/vicuna-7b-v1.1) model on synthetic dialogue between users and an assistant about a given recipe.
17
+ The model was first trained using SFT and then using Direct Preference Optimization (DPO).
18
+
19
+ #### License
20
+
21
+ It's the same as Vicuna. A non-commercial Apache 2.0 license.
22
+
23
+ #### Paper
24
+
25
+ "Plan-Grounded Large Language Models for Dual Goal Conversational Settings" (Accepted at EACL 2024)
26
+ Diogo Gl贸ria-Silva, Rafael Ferreira, Diogo Tavares, David Semedo, Jo茫o Magalh茫es