tjadamsdfw commited on
Commit
7e94401
·
verified ·
1 Parent(s): e701038

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +81 -3
README.md CHANGED
@@ -1,3 +1,81 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: apache-2.0
4
+ tags:
5
+ - security
6
+ - secrets-detection
7
+ - code-analysis
8
+ - onnx
9
+ library_name: onnxruntime
10
+ pipeline_tag: text-classification
11
+ ---
12
+
13
+ # Argus Nano
14
+
15
+ A purpose-built SLM for secrets detection that runs entirely on your machine.
16
+
17
+ ## Model Description
18
+
19
+ - Fine-tuned from CodeBERT for binary classification: REAL_SECRET vs BENIGN
20
+ - Designed to distinguish real API keys, tokens, and credentials from benign lookalikes (test values, hashes, UUIDs, placeholders)
21
+ - Context-aware: considers surrounding code, not just the string itself
22
+
23
+ ## Intended Use
24
+
25
+ - Pre-commit hooks
26
+ - CI/CD pipeline scanning
27
+ - IDE integration
28
+ - Any tool that needs to detect leaked secrets in code
29
+
30
+ ## How to Use
31
+
32
+ ### Python
33
+
34
+ ```python
35
+ from argus_nano import Scanner
36
+
37
+ scanner = Scanner()
38
+ results = scanner.scan_file("config.yml")
39
+ ```
40
+
41
+ ### Rust
42
+
43
+ ```rust
44
+ let scanner = argus_nano::Scanner::new(Default::default())?;
45
+ let results = scanner.scan_file("config.yml")?;
46
+ ```
47
+
48
+ ### CLI
49
+
50
+ ```bash
51
+ argus-nano scan ./my-repo
52
+ ```
53
+
54
+ ## Performance
55
+
56
+ Tested against a 1,180-file corpus (480 real secrets across 37 providers, 700 benign files).
57
+
58
+ | Metric | Value |
59
+ |--------|-------|
60
+ | Precision | 100.0% |
61
+ | Recall | 99.8% |
62
+ | F1 Score | 99.9% |
63
+ | False Positive Rate | 0.0% |
64
+ | Inference Speed | 11.0s / 1000 files |
65
+ | Model Size (quantized) | ~125MB |
66
+
67
+ ## Supported Providers
68
+
69
+ See [patterns/providers/](https://github.com/arc-commander/argus-nano/tree/main/patterns/providers) for the full list.
70
+
71
+ ## Limitations
72
+
73
+ - Optimized for standard provider key formats; custom/proprietary formats may require adding patterns
74
+ - Binary classification only (v1); multi-class planned for v2
75
+ - Designed for source code context; may be less accurate on log files or unstructured text
76
+
77
+ ## License
78
+
79
+ Apache 2.0
80
+
81
+ ## Made by the [Arc Commander](https://github.com/arc-commander) team