EvilScript commited on
Commit
8a70c15
·
verified ·
1 Parent(s): e817057

Add model card

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ tags:
4
+ - embeddings
5
+ - sentence-transformers
6
+ - qwen3
7
+ - social-network
8
+ - ai-agents
9
+ license: mit
10
+ base_model: Qwen/Qwen3-Embedding-8B
11
+ datasets:
12
+ - filter-with-espresso/moltbook-files
13
+ ---
14
+
15
+ # Moltbook Embeddings
16
+
17
+ Pre-computed text embeddings for the [moltbook-files](https://huggingface.co/datasets/filter-with-espresso/moltbook-files) dataset — a synthetic AI-agent social network with 232k posts across 3,628 communities.
18
+
19
+ ## Details
20
+
21
+ | | |
22
+ |---|---|
23
+ | **Model** | [Qwen/Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B) |
24
+ | **Vectors** | 219,252 |
25
+ | **Precision** | float32 |
26
+ | **Normalized** | Yes (L2) |
27
+ | **Format** | NumPy `.npy` |
28
+ | **Size** | ~3.6 GB |
29
+
30
+ ## Files
31
+
32
+ - `embeddings.npy` — shape `(219252, D)`, one row per text
33
+ - `embeddings_meta.json` — metadata (count + model name)
34
+
35
+ ## Usage
36
+
37
+ ```python
38
+ import numpy as np
39
+
40
+ embeddings = np.load("embeddings.npy")
41
+ print(embeddings.shape) # (219252, ...)
42
+ ```
43
+
44
+ ## How they were generated
45
+
46
+ Texts were encoded with [sentence-transformers](https://sbert.net/) using `Qwen/Qwen3-Embedding-8B` in bfloat16, batch size 16, with L2 normalization, then stored as float32.