Update README.md

9ece304 verified 5 months ago

5.11 kB

	---
	library_name: transformers
	metrics:
	- accuracy
	- precision
	- recall
	- f1
	license: apache-2.0
	base_model:
	- google-bert/bert-base-uncased
	pipeline_tag: text-classification
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->
	Model Summary

	1. `bert-log-anomaly-detection` is a BERT-based NLP model fine-tuned for single SQL transaction log anomaly detection.

	2. The model classifies each database transaction log as either `Normal` or `Anomaly`, with the goal of supporting AI-powered fraud detection and cybersecurity monitoring systems.

	3. This model was developed as part of the _Samsung × KBTG Digital Fraud Cybersecurity Hackathon_ (Thailand) under the AI-Powered Fraud Detection & Prevention track.


	### Model Description

	<!-- Provide a longer summary of what this model is. -->
	This model analyzes individual SQL database transaction logs and detects abnormal patterns that may indicate fraudulent, malicious, or suspicious behavior.

	Demo: Hackathon prototype

	- Developed by: Aungruk Vanichanai, Napat Wanitwatthakorn, Thanakrit Sriphiphattana
	- Shared by: Aungruk Vanichanai
	- Model type: Transformer-based binary text classifier
	- Language(s) (NLP): English (SQL logs in text format)
	- License: Apache 2.0
	- Finetuned from model: google-bert/bert-base-uncased

	### Model Sources

	<!-- Provide the basic links for the model. -->

	- GitHub Repository: https://github.com/AungMoonLord/AI-Cybersecurity-Hackathon/tree/main/New%20Finetune%20Hackathon

	### How to Get Started with the Model

	<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->

	## Step 1 (Setup)
	```python
	import torch
	from transformers import BertForSequenceClassification, BertTokenizer

	MODEL_PATH = "AungMoonLord/bert-log-anomaly-detection"

	model = BertForSequenceClassification.from_pretrained(MODEL_PATH)
	tokenizer = BertTokenizer.from_pretrained(MODEL_PATH)

	model.eval()
	```

	## Step 2 (Clean and Label Logs) — Optional, but may slightly improve accuracy, recall, and F1-score
	```python
	# Perfom log preprocessing
	def add_prefix_token(text): # log data must pass this code before training/inferencing
	# clean log
	text = text.replace("\t", " ")
	text = text.strip()
	# add token
	if text[0].isalpha() or text[3].isalpha():
	return "[SQL]\n" + text
	else:
	return "[LOG]\n" + text
	```

	## Step 3 (Create the Function for Log Classification)
	```python
	def predict_log(log_text):
	log_text = add_prefix_token(log_text)
	inputs = tokenizer(
	log_text,
	return_tensors="pt",
	truncation=True,
	padding=True, # for cases when the inference contains more than 1 log, i.e., batch size > 1
	max_length=128
	)

	with torch.no_grad():
	logits = model(**inputs).logits
	pred = torch.argmax(logits, dim=1).item()
	prob = torch.softmax(logits, dim=-1).tolist()[0]

	return "Normal" if pred == 1 else "Anomaly", prob
	```

	## Step 4 (Samples of Inferences)
	```python
	# Example 1
	text1 = "SELECT * FROM users WHERE id = 1 OR 1=1"
	print(predict_log(text1))

	# Example 2
	text2 = "2025-01-06 14:23:45 \| User: anonymous \| IP: 203.154.89.102 \| Duration: 0.05s SELECT * FROM users WHERE username = 'admin' OR '1'='1' -- ' AND password = 'x'"
	print(predict_log(text2))

	# Example 3
	text3 = "3051-06-22T07:20:02.296945Z 3 Query select e3mJKDCCY from 7Q8SpG8LLEWhrfpe4s5 where ph4d = 'a1S9hQa92uC1EAyJf2Y';"
	print(predict_log(text3))
	```

	### Application in Hackathon Project

	<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->

	- Developed by Waris Sripatoomrak, this model integrates with an n8n workflow to automate fraud detection within financial transaction logs.


	### Out-of-Scope Use

	- Multi-log sequence anomaly detection

	- Non-textual anomaly detection



	## Training Data


	- SQL database transaction logs (1,611 samples) synthetically generated by ChatGPT, Qwen, DeepSeek, Grok, Gemini, and Claude

	- Each log labeled as either `Normal` or `Anomaly`

	- Data prepared for single-log classification


	## Evaluation

	#### Metrics

	##### - Training Set

	\| Metric \| Value \|
	\| --------------- \| ------ \|
	\| Accuracy \| 0.8950 \|
	\| Precision \| 0.8580 \|
	\| Recall \| 0.9026 \|
	\| F1-score \| 0.8797 \|
	\| Validation Loss \| 0.3279 \|


	##### - Test Set (Baseline — No Step 2 Preprocessing)
	\| Metric \| Value \|
	\| --------------- \| ------ \|
	\| Accuracy \| 0.6950 \|
	\| Precision \| 0.6639 \|
	\| Recall \| 0.7900 \|
	\| F1-score \| 0.7215 \|
	\| Validation Loss \| 0.6251 \|


	##### - Test Set (Full Pipeline — With Step 2 Preprocessing)
	\| Metric \| Value \|
	\| --------------- \| ------ \|
	\| Accuracy \| 0.7000 \|
	\| Precision \| 0.6613 \|
	\| Recall \| 0.8200 \|
	\| F1-score \| 0.7321 \|
	\| Validation Loss \| 0.6344 \|


	#### Summary

	The model demonstrates strong anomaly detection capability with high recall, making it suitable for fraud detection and cybersecurity use cases.