Rulga commited on
Commit
0edccd1
·
2 Parent(s): 0f93e9d 0d2d420

Resolve merge conflicts with Hugging Face dataset

Browse files
Files changed (4) hide show
  1. .gitattributes +30 -0
  2. README.md +40 -0
  3. chat_history/.gitkeep +0 -0
  4. vector_store/.gitkeep +0 -0
.gitattributes CHANGED
@@ -8,6 +8,11 @@
8
  *.h5 filter=lfs diff=lfs merge=lfs -text
9
  *.joblib filter=lfs diff=lfs merge=lfs -text
10
  *.lfs.* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
11
  *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
  *.model filter=lfs diff=lfs merge=lfs -text
13
  *.msgpack filter=lfs diff=lfs merge=lfs -text
@@ -33,3 +38,28 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  *.h5 filter=lfs diff=lfs merge=lfs -text
9
  *.joblib filter=lfs diff=lfs merge=lfs -text
10
  *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ <<<<<<< HEAD
12
+ =======
13
+ *.lz4 filter=lfs diff=lfs merge=lfs -text
14
+ *.mds filter=lfs diff=lfs merge=lfs -text
15
+ >>>>>>> 0d2d42071b65c9b49ea7e471b711fe3cdf9fb532
16
  *.mlmodel filter=lfs diff=lfs merge=lfs -text
17
  *.model filter=lfs diff=lfs merge=lfs -text
18
  *.msgpack filter=lfs diff=lfs merge=lfs -text
 
38
  *.zip filter=lfs diff=lfs merge=lfs -text
39
  *.zst filter=lfs diff=lfs merge=lfs -text
40
  *tfevents* filter=lfs diff=lfs merge=lfs -text
41
+ <<<<<<< HEAD
42
+ =======
43
+ # Audio files - uncompressed
44
+ *.pcm filter=lfs diff=lfs merge=lfs -text
45
+ *.sam filter=lfs diff=lfs merge=lfs -text
46
+ *.raw filter=lfs diff=lfs merge=lfs -text
47
+ # Audio files - compressed
48
+ *.aac filter=lfs diff=lfs merge=lfs -text
49
+ *.flac filter=lfs diff=lfs merge=lfs -text
50
+ *.mp3 filter=lfs diff=lfs merge=lfs -text
51
+ *.ogg filter=lfs diff=lfs merge=lfs -text
52
+ *.wav filter=lfs diff=lfs merge=lfs -text
53
+ # Image files - uncompressed
54
+ *.bmp filter=lfs diff=lfs merge=lfs -text
55
+ *.gif filter=lfs diff=lfs merge=lfs -text
56
+ *.png filter=lfs diff=lfs merge=lfs -text
57
+ *.tiff filter=lfs diff=lfs merge=lfs -text
58
+ # Image files - compressed
59
+ *.jpg filter=lfs diff=lfs merge=lfs -text
60
+ *.jpeg filter=lfs diff=lfs merge=lfs -text
61
+ *.webp filter=lfs diff=lfs merge=lfs -text
62
+ # Video files - compressed
63
+ *.mp4 filter=lfs diff=lfs merge=lfs -text
64
+ *.webm filter=lfs diff=lfs merge=lfs -text
65
+ >>>>>>> 0d2d42071b65c9b49ea7e471b711fe3cdf9fb532
README.md CHANGED
@@ -42,3 +42,43 @@ Status Law Assistant — это интеллектуальный чат-бот,
42
  - `src/`: директория с исходным кодом
43
  - `knowledge_base/`: модуль для работы с базой знаний
44
  - `models/`: модуль для работы с моделями
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42
  - `src/`: директория с исходным кодом
43
  - `knowledge_base/`: модуль для работы с базой знаний
44
  - `models/`: модуль для работы с моделями
45
+ # Status Law Knowledge Base Dataset
46
+
47
+ This dataset serves as a storage for the Status Law Assistant chatbot, containing vector embeddings and chat history.
48
+
49
+ ## 📁 Structure
50
+
51
+ ```
52
+ status-law-knowledge-base/
53
+ ├── vector_store/
54
+ │ ├── index.faiss # FAISS vector store for document embeddings
55
+ │ └── index.pkl # Metadata and configuration for the vector store
56
+
57
+ └── chat_history/
58
+ └── logs.json # Chat history logs
59
+ ```
60
+
61
+ ## 🔍 Description
62
+
63
+ - `vector_store/`: Contains FAISS embeddings of legal documents from status.law website
64
+ - `index.faiss`: Vector embeddings for semantic search
65
+ - `index.pkl`: Metadata and configuration information
66
+
67
+ - `chat_history/`: Stores conversation logs
68
+ - `logs.json`: JSON file containing chat history and metadata
69
+
70
+ ## 🚀 Usage
71
+
72
+ This dataset is used by the Status Law Assistant chatbot to:
73
+ 1. Store and retrieve document embeddings for context-aware responses
74
+ 2. Maintain chat history for conversation continuity
75
+ 3. Track user interactions and improve response quality
76
+
77
+ ## 🔗 Related Links
78
+
79
+ - [Status Law Website](https://status.law)
80
+ - [Status Law Assistant Repository](https://huggingface.co/spaces/Rulga/status-law-assistant)
81
+
82
+ ## 📝 License
83
+
84
+ Private dataset for Status Law Assistant usage only.
chat_history/.gitkeep ADDED
File without changes
vector_store/.gitkeep ADDED
File without changes