Bopalv commited on
Commit
dfa5826
·
verified ·
1 Parent(s): dff2191

Upload DPO-Training/sample_preference_data.jsonl with huggingface_hub

Browse files
DPO-Training/sample_preference_data.jsonl ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {"prompt": "What is the capital of France?", "chosen": "The capital of France is Paris.", "rejected": "I don't know."}
2
+ {"prompt": "Explain quantum computing in simple terms.", "chosen": "Quantum computing uses quantum bits (qubits) that can exist in multiple states at once, allowing it to solve certain problems much faster than classical computers.", "rejected": "Quantum computing is complicated and I can't explain it."}
3
+ {"prompt": "How do I make a cup of tea?", "chosen": "To make a cup of tea: 1. Boil water 2. Add tea bag or loose tea to cup 3. Pour hot water over tea 4. Steep for 3-5 minutes 5. Remove tea bag and enjoy!", "rejected": "Just add water to tea."}
4
+ {"prompt": "What is machine learning?", "chosen": "Machine learning is a type of artificial intelligence where computers learn from data to make predictions or decisions without being explicitly programmed for each task.", "rejected": "Machine learning is when computers learn."}
5
+ {"prompt": "What are the benefits of exercise?", "chosen": "Regular exercise offers many benefits: improves cardiovascular health, strengthens muscles, helps maintain healthy weight, boosts mood, reduces stress, and improves sleep quality.", "rejected": "Exercise is good for you."}