Codex12 commited on
Commit
e57b3c9
·
verified ·
1 Parent(s): 6b22b15

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +76 -0
README.md ADDED
@@ -0,0 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: pytorch
4
+ tags:
5
+ - scam-detection
6
+ - multi-modal
7
+ - audio-classification
8
+ - text-classification
9
+ - fusion
10
+ - MiniLM
11
+ - vosk
12
+ ---
13
+
14
+ # MultiModal Scam Detection — Models & Dataset
15
+
16
+ **Hugging Face asset repository** for the [MultiModal Scam Detection](https://github.com/Codexx121/MultiModal_Scam_Detct) project.
17
+
18
+ This repo contains **trained model checkpoints, cached features, embeddings, and test audio** — too large for GitHub.
19
+
20
+ ## Contents
21
+
22
+ | Asset | Size | Description |
23
+ |-------|------|-------------|
24
+ | `audio_features/` | ~3.3 GB | Pre-computed MFCC features (2407 `.pt` files) for audio encoder training |
25
+ | `detection_checkpoints/` | ~1.1 GB | Fine-tuned MiniLM text classifier checkpoints (4 checkpoints) |
26
+ | `fusion_embeddings/` | ~14 MB | Pre-extracted audio + text embeddings + fusion dataset (`.npz`) |
27
+ | `test_samples/` | ~22 MB | Sample WAV files for testing inference |
28
+
29
+ ## Download
30
+
31
+ ### Via Python
32
+
33
+ ```python
34
+ from huggingface_hub import snapshot_download
35
+
36
+ snapshot_download("Codex12/MultiModal_Scam_Models-Dataset", repo_type="model")
37
+ ```
38
+
39
+ ### Via CLI
40
+
41
+ ```bash
42
+ huggingface-cli download Codex12/MultiModal_Scam_Models-Dataset --repo-type model --local-dir ./assets
43
+ ```
44
+
45
+ ### Via Git LFS (advanced)
46
+
47
+ ```bash
48
+ git lfs install
49
+ git clone https://huggingface.co/Codex12/MultiModal_Scam_Models-Dataset
50
+ ```
51
+
52
+ ## Usage
53
+
54
+ ```python
55
+ from huggingface_hub import hf_hub_download
56
+ import torch
57
+
58
+ # Download a checkpoint
59
+ checkpoint = hf_hub_download(
60
+ "Codex12/MultiModal_Scam_Models-Dataset",
61
+ "detection_checkpoints/best_model/model.safetensors",
62
+ repo_type="model"
63
+ )
64
+
65
+ # Download audio features
66
+ feature_path = hf_hub_download(
67
+ "Codex12/MultiModal_Scam_Models-Dataset",
68
+ "audio_features/legitimate_00001.pt",
69
+ repo_type="model"
70
+ )
71
+ ```
72
+
73
+ ## Related
74
+
75
+ - **GitHub (code)**: [Codexx121/MultiModal_Scam_Detct](https://github.com/Codexx121/MultiModal_Scam_Detct)
76
+ - **Pipeline**: Audio → MFCC → Conv2D Encoder (128-D) + ASR → MiniLM (384-D) → Fusion MLP → SCAM/LEGITIMATE