Commit
·
c201e40
1
Parent(s):
6746e8d
update
Browse files- README.md +10 -0
- german_credit_risk/code/raw/gcrisk.ipynb +1 -1
README.md
CHANGED
|
@@ -10,4 +10,14 @@ pinned: false
|
|
| 10 |
license: apache-2.0
|
| 11 |
---
|
| 12 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 13 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
|
| 10 |
license: apache-2.0
|
| 11 |
---
|
| 12 |
|
| 13 |
+
This is a case study about creditworthiness classification where I did the whole process of XXX, from the data cleansing part until the deployment of the model in an application simulated for the banks managers. To achieve this goal, I analyzed and prepared the dataset for machine learning models. The applied models are: Logistic Regression, Decision Tree Classifier, and Random Forest Classifier, which are available in Python's sklearn library. To optimize the workflow and model results, I applied a personalized pipeline for model application and GridSearchCV for parameter optimization. The app development was made using gradio app.
|
| 14 |
+
|
| 15 |
+
The data is uploaded in path: german_credit_risk/data/raw, but it was first obtained from Kaggle and can be obtained [HERE](https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv).
|
| 16 |
+
|
| 17 |
+
I used a previous work from Pennsylvania State University as a reference in many parts of the code, you can find it [HERE](https://online.stat.psu.edu/stat508/resource/analysis/gcd). Also, as this is a case study, the code steps are commented in english.
|
| 18 |
+
|
| 19 |
+
It is worth mentioning that the feature selection process was carefully performed according to the regulations of the Central Bank of Brazil, as I'm Brazilian.
|
| 20 |
+
|
| 21 |
+
---
|
| 22 |
+
|
| 23 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
german_credit_risk/code/raw/gcrisk.ipynb
CHANGED
|
@@ -49,7 +49,7 @@
|
|
| 49 |
"\n",
|
| 50 |
"The data was obtained from Kaggle and can be accessed [HERE](https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv).\n",
|
| 51 |
"\n",
|
| 52 |
-
"I used a previous work from Pennsylvania State University as a reference in many parts of the code, which can be accessed[HERE](https://online.stat.psu.edu/stat508/resource/analysis/gcd). Also, as this is a case study, the code steps are commented.\n",
|
| 53 |
"\n",
|
| 54 |
"It is worth mentioning that the feature selection process was carefully performed according to the regulations of the Central Bank of Brazil, as I'm Brazilian."
|
| 55 |
]
|
|
|
|
| 49 |
"\n",
|
| 50 |
"The data was obtained from Kaggle and can be accessed [HERE](https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv).\n",
|
| 51 |
"\n",
|
| 52 |
+
"I used a previous work from Pennsylvania State University as a reference in many parts of the code, which can be accessed [HERE](https://online.stat.psu.edu/stat508/resource/analysis/gcd). Also, as this is a case study, the code steps are commented.\n",
|
| 53 |
"\n",
|
| 54 |
"It is worth mentioning that the feature selection process was carefully performed according to the regulations of the Central Bank of Brazil, as I'm Brazilian."
|
| 55 |
]
|