yusuffarhan commited on
Commit
c61ea3c
·
verified ·
1 Parent(s): ace70fe

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - mneme
5
+ - memory
6
+ - weight-injection
7
+ - qwen
8
+ ---
9
+
10
+ # Mneme: Neural Episodic Weight Injection Encoder
11
+
12
+ Trained encoder for the Mneme memory system - injects facts directly into LLM weights.
13
+
14
+ ## Usage
15
+
16
+ ```bash
17
+ # Clone the repo
18
+ git clone https://github.com/Yusuffarhan13/Mneme-v1-mvp.git
19
+ cd Mneme-v1-mvp
20
+
21
+ # Download the encoder
22
+ pip install huggingface_hub
23
+ python -c "from huggingface_hub import hf_hub_download; hf_hub_download(repo_id='yusuffarhan/qwen-memory', filename='best_encoder.pt', local_dir='mneme_trained')"
24
+
25
+ # Run
26
+ python qwen.py --encoder mneme_trained/best_encoder.pt
27
+ ```
28
+
29
+ ## Training Config
30
+
31
+ - **Delta rank**: 16
32
+ - **Target layers**: [4, 8, 12, 16, 20, 24]
33
+ - **Encoder**: 768 hidden, 4 layers
34
+ - **Base model**: Qwen/Qwen3-4B
35
+
36
+ ## What This Does
37
+
38
+ Injects facts directly INTO model weights (no RAG, no prompt injection):
39
+
40
+ ```
41
+ /remember My name is Yusuf
42
+ /remember I work at Google
43
+ What is my name? → "Your name is Yusuf"
44
+ Where do I work? → "You work at Google"
45
+ ```