saifhmb
/

fraud-detection-model

Tabular Classification

Model card Files Files and versions

saifhmb commited on Jun 27, 2024

Commit

4c58b40

·

verified ·

1 Parent(s): 4a137c5

Update README.md

Files changed (1) hide show

README.md +13 -25

README.md CHANGED Viewed

@@ -4,6 +4,7 @@ tags:
 - sklearn
 - skops
 - tabular-classification
 model_format: pickle
 model_file: skops-ise057qg.pkl
 widget:
@@ -27,16 +28,22 @@ widget:
 ---
 # Model description
-[More Information Needed]
 ## Intended uses & limitations
-[More Information Needed]
 ## Training Procedure
-[More Information Needed]
 ### Hyperparameters
@@ -97,26 +104,7 @@ widget:
 ![Confusion Matrix](confusion_matrix.png)
-# How to Get Started with the Model
-[More Information Needed]
 # Model Card Authors
-This model card is written by following authors:
-[More Information Needed]
-# Model Card Contact
-You can contact the model card authors through following channels:
-[More Information Needed]
-# Citation
-Below you can find information related to citation.
-**BibTeX:**
-```
-[More Information Needed]
-```

 - sklearn
 - skops
 - tabular-classification
+- finance
 model_format: pickle
 model_file: skops-ise057qg.pkl
 widget:
 ---
 # Model description
+This is a Gaussian Naive Bayes model trained on a synthetic dataset, containining a large variety of transaction types representing normal activities as well as
+abnormal/fraudulent activities generated by J.P. Morgan AI Research. The model predicts whether a transaction is normal or fraudulent.
 ## Intended uses & limitations
+Terms of use for the J.P. Morgan AI Research synthetic dataset limits sharing of the dataset
 ## Training Procedure
+The data preprocessing steps applied include the following:
+- Dropping high cardinality features or no variance features. This include Transaction ID, Sender ID, Sender Account, Beneficiary ID, Beneficiary Account, Sender LOB, Sender Sector and Time
+- Transforming and Encoding categorical features namely: Sender Country, Beneficiary Country, Transaction Type, and the target variable, Label
+- Applying feature scaling on all features
+- Splitting the dataset into training/test set using 85/15 split ratio
+- Handling imbalanced dataset using imblearn framework and applying RandomUnderSampler method to eliminate noise
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6662300a0ad8c45a1ce59190/BEi0CfOfJ2ytxD5VoN4IM.png)
 ### Hyperparameters
 ![Confusion Matrix](confusion_matrix.png)
 # Model Card Authors
+This model card is written by following authors: Seifullah Bello