Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -14,39 +14,3 @@ short_description: Medical datasets for healthcare model training.
|
|
| 14 |
|
| 15 |
This Medical dataset is crafted as a versatile resource for enthusiasts of data science, machine learning, and data analysis. It replicates the characteristics of real-world healthcare data, offering users a platform to practice, refine, and showcase their data manipulation and analytical skills within the healthcare domain.
|
| 16 |
|
| 17 |
-
## **Potential Uses**
|
| 18 |
-
- Building and testing predictive models specific to healthcare.
|
| 19 |
-
- Practicing techniques for data cleaning, transformation, and analysis.
|
| 20 |
-
- Designing visualizations to uncover insights into healthcare trends.
|
| 21 |
-
- Learning and teaching data science and machine learning concepts in a healthcare setting.
|
| 22 |
-
|
| 23 |
-
## **Acknowledgments**
|
| 24 |
-
- This dataset is entirely synthetic, created with a focus on respecting healthcare data privacy and security. It contains no real patient information and complies with privacy regulations.
|
| 25 |
-
- The goal is to support advancements in data science and healthcare analytics while inspiring innovative ideas.
|
| 26 |
-
|
| 27 |
-
## **Dataset Contents**
|
| 28 |
-
|
| 29 |
-
### **Evaluation Medical Instruction Datasets**
|
| 30 |
-
Contains datasets used for evaluating medical instruction models:
|
| 31 |
-
- `evaluation-medical-instruction-dataset.json`
|
| 32 |
-
- `medmcqa-train-instruction-dataset.json`
|
| 33 |
-
- `medial-train-instruction-dataset.json`
|
| 34 |
-
- `pubmedqa-train-instruction-train.json`
|
| 35 |
-
|
| 36 |
-
### **General Medical Instruction Datasets**
|
| 37 |
-
Contains general medical instruction datasets:
|
| 38 |
-
- `general-medical-instruction-dataset.json`
|
| 39 |
-
- `GenMedGPT-5k.json`
|
| 40 |
-
- `HealthCareMagic-100k.json`
|
| 41 |
-
- `medical_meadow_wikidoc_medical_flashcards.json`
|
| 42 |
-
- `medical_meadow_wikidoc_patient_info.json`
|
| 43 |
-
- `medicationqa.json`
|
| 44 |
-
|
| 45 |
-
### **Medical Preference Data**
|
| 46 |
-
- `medical-preference-data.json`: Contains data related to medical preferences.
|
| 47 |
-
|
| 48 |
-
### **Medical Pretraining Datasets**
|
| 49 |
-
Contains datasets used for pretraining medical models.
|
| 50 |
-
|
| 51 |
-
## **Usage**
|
| 52 |
-
These datasets can be used to train and evaluate models for medical question answering and other related tasks. Each dataset is provided in **JSON** and **TXT** formats and contains various medical instructions, inputs, and outputs.
|
|
|
|
| 14 |
|
| 15 |
This Medical dataset is crafted as a versatile resource for enthusiasts of data science, machine learning, and data analysis. It replicates the characteristics of real-world healthcare data, offering users a platform to practice, refine, and showcase their data manipulation and analytical skills within the healthcare domain.
|
| 16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|