Sentence Similarity
sentence-transformers
Joblib
Safetensors
modernbert
security
intrusion-detection
behavior-analytics
intent-recognition
linux
kubernetes
audit-log
text-embeddings-inference
Instructions to use willchen0011/SecEBL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use willchen0011/SecEBL with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("willchen0011/SecEBL") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
File size: 1,085 Bytes
607b6c0 b883fb0 607b6c0 b883fb0 607b6c0 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 | # SecEBL Public Final Benchmark Examples
This directory contains publicly releasable example data for verifying the
release code path: a subset of the internal Linux final benchmark plus
normalized Kubernetes AuditLog examples.
**The full internal benchmark, training corpora, private pressure-stream rows,
and removed sensitive benchmark sessions are not distributed in this GitHub
repository.**
The `*_gold.rev20.jsonl` filenames are kept for compatibility. In this release,
they are the expected Rev20 behavior-tag labels for this public benchmark
subset.
- `linux/example_sessions.jsonl`: 10,520 Linux command-session rows from 531
public subset sessions.
- `linux/example_gold.rev20.jsonl`: matching Rev20 behavior tags for the same
10,520 rows.
- `k8s/example_sessions.jsonl`: 144 normalized Kubernetes AuditLog rows across
46 sessions.
- `k8s/example_gold.rev20.jsonl`: matching Rev20 behavior tags for the same 144
K8s rows, with 163 behavior-label instances and 27 unique behavior tags.
Session-level labels use English enums: `normal_operation` and `intrusion`.
|