Spaces:

JAYASREESS
/

duckdb

Runtime error

JAYASREESS commited on Jan 28

Commit

794fb5d

verified ·

1 Parent(s): b531b77

Upload 2 files

Files changed (2) hide show

README.md ADDED Viewed

+---
+title: Credit Card Fraud Detection with DuckDB
+emoji: 💳
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: "4.44.0"
+python_version: "3.10"
+app_file: app.py
+pinned: false
+---
+# Credit Card Fraud Detection with DuckDB and Medallion Architecture
+This project demonstrates an end-to-end pipeline for credit card fraud detection. It uses DuckDB to process data in a Medallion Architecture (Bronze, Silver, Gold) and trains a Random Forest model to identify fraudulent transactions.
+## Project Structure
+- `data/`: Contains the raw CSV datasets (`fraudTrain.csv`, `fraudTest.csv`).
+- `src/`: Contains the Python scripts for the data pipeline and model training.
+  - `bronze.py`: Ingests raw data into the bronze layer.
+  - `silver.py`: Cleans and transforms data for the silver layer.
+  - `gold.py`: Creates aggregated features for the gold (analytics) layer.
+  - `train.py`: Trains a `RandomForestClassifier` on the gold data and saves the model.
+- `models/`: Directory where the trained model is saved.
+- `requirements.txt`: Lists the required Python packages.
+## How to Run
+1. **Install dependencies:**
+```bash
+pip install -r requirements.txt

app.py ADDED Viewed

+import gradio as gr
+def status():
+    return "Credit Card Fraud Detection Pipeline is ready. Run training using src/train.py"
+gr.Interface(fn=status, inputs=[], outputs="text").launch()