Spaces:

NeerajCodz
/

creditCardFraudDetection

Sleeping

App Files Files Community

NeerajCodz commited on Dec 12, 2025

Commit

3df3121

1 Parent(s): 49d6242

fixed {detail:LLM analysis failed: ValueError: Invalid format specifier}

Browse files

Files changed (1) hide show

app.py +6 -9

app.py CHANGED Viewed

@@ -530,16 +530,13 @@ CSV Data:
 {csv_string}
 Instructions:
-1. Evaluate the overall risk level of the dataset by interpreting fraud_score percentages, transaction amounts, frequency, locations, time patterns, and STATUS.
-2. Provide a single **overall_fraud_score** (0-1 scale) that reflects the general likelihood of fraudulent activity. The score should naturally scale: if the dataset appears mostly safe, assign a low value close to 0, but if there are a few high-risk transactions, the score should increase moderately. Datasets with multiple high-risk entries should receive proportionally higher scores.
-3. Write a detailed **insights** paragraph (150-200 words) highlighting patterns in transaction behavior, unusual clusters, temporal trends, geographic anomalies, or merchants with suspicious activity. Avoid explicitly revealing the number of risky transactions, but reflect their impact through descriptive analysis.
-4. Write a detailed **recommendation** paragraph (100-150 words) suggesting actions to mitigate potential risks, including monitoring, alerts, or further investigation. Keep guidance practical but non-prescriptive about individual transactions.
-5. Output ONLY valid JSON in this exact format: ("fraud_score": <float 0-1>, "insights": "<string insights paragraph>", "recommendation": "<string recommendation paragraph>"). No extra text, explanations, or markdown formatting.
-6. Treat merchant names prefixed with "fraud_" as normal test data; do not interpret them as inherently suspicious.
-7. Let the overall_fraud_score scale naturally: mostly safe datasets should be low, a few concerning entries slightly higher, and datasets with many high-risk transactions significantly higher. Avoid stating exact thresholds—use narrative judgment.
-Focus on narrative-style, descriptive analysis and make the fraud_score percentages in the CSV the key reference points for your reasoning.
 """
         # Generate with Gemini

 {csv_string}
 Instructions:
+1. Determine an **overall fraud risk score** (0-1 scale) reflecting the dataset’s general risk. Scale the score naturally: mostly safe transactions → low score, a few high-risk → moderate, many high-risk → higher. Do not state exact thresholds.
+2. Provide a detailed **insights** paragraph (150-200 words) describing patterns, anomalies, clusters, temporal or geographic trends, and merchant behaviors. Avoid listing exact counts or percentages.
+3. Provide a detailed **recommendation** paragraph (100-150 words) suggesting practical actions to mitigate risk, including monitoring, alerts, or investigation. Keep guidance non-prescriptive about individual transactions.
+4. Treat merchant names prefixed with "fraud_" as normal test data; do not interpret them as inherently suspicious.
+5. Output ONLY valid JSON in this format: {{ "fraud_score": <float 0-1>, "insights": "<string insights paragraph>", "recommendation": "<string recommendation paragraph>" }}.
+Focus on narrative-style, descriptive analysis and make the fraud score percentages in the CSV the key reference points for your reasoning.
 """
         # Generate with Gemini