marcilioduarte commited on
Commit
c201e40
·
1 Parent(s): 6746e8d
README.md CHANGED
@@ -10,4 +10,14 @@ pinned: false
10
  license: apache-2.0
11
  ---
12
 
 
 
 
 
 
 
 
 
 
 
13
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
10
  license: apache-2.0
11
  ---
12
 
13
+ This is a case study about creditworthiness classification where I did the whole process of XXX, from the data cleansing part until the deployment of the model in an application simulated for the banks managers. To achieve this goal, I analyzed and prepared the dataset for machine learning models. The applied models are: Logistic Regression, Decision Tree Classifier, and Random Forest Classifier, which are available in Python's sklearn library. To optimize the workflow and model results, I applied a personalized pipeline for model application and GridSearchCV for parameter optimization. The app development was made using gradio app.
14
+
15
+ The data is uploaded in path: german_credit_risk/data/raw, but it was first obtained from Kaggle and can be obtained [HERE](https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv).
16
+
17
+ I used a previous work from Pennsylvania State University as a reference in many parts of the code, you can find it [HERE](https://online.stat.psu.edu/stat508/resource/analysis/gcd). Also, as this is a case study, the code steps are commented in english.
18
+
19
+ It is worth mentioning that the feature selection process was carefully performed according to the regulations of the Central Bank of Brazil, as I'm Brazilian.
20
+
21
+ ---
22
+
23
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
german_credit_risk/code/raw/gcrisk.ipynb CHANGED
@@ -49,7 +49,7 @@
49
  "\n",
50
  "The data was obtained from Kaggle and can be accessed [HERE](https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv).\n",
51
  "\n",
52
- "I used a previous work from Pennsylvania State University as a reference in many parts of the code, which can be accessed[HERE](https://online.stat.psu.edu/stat508/resource/analysis/gcd). Also, as this is a case study, the code steps are commented.\n",
53
  "\n",
54
  "It is worth mentioning that the feature selection process was carefully performed according to the regulations of the Central Bank of Brazil, as I'm Brazilian."
55
  ]
 
49
  "\n",
50
  "The data was obtained from Kaggle and can be accessed [HERE](https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv).\n",
51
  "\n",
52
+ "I used a previous work from Pennsylvania State University as a reference in many parts of the code, which can be accessed [HERE](https://online.stat.psu.edu/stat508/resource/analysis/gcd). Also, as this is a case study, the code steps are commented.\n",
53
  "\n",
54
  "It is worth mentioning that the feature selection process was carefully performed according to the regulations of the Central Bank of Brazil, as I'm Brazilian."
55
  ]