Spaces:
Running
Running
Resolve merge conflicts with Hugging Face dataset
Browse files- .gitattributes +30 -0
- README.md +40 -0
- chat_history/.gitkeep +0 -0
- vector_store/.gitkeep +0 -0
.gitattributes
CHANGED
|
@@ -8,6 +8,11 @@
|
|
| 8 |
*.h5 filter=lfs diff=lfs merge=lfs -text
|
| 9 |
*.joblib filter=lfs diff=lfs merge=lfs -text
|
| 10 |
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
| 12 |
*.model filter=lfs diff=lfs merge=lfs -text
|
| 13 |
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
|
@@ -33,3 +38,28 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
*.h5 filter=lfs diff=lfs merge=lfs -text
|
| 9 |
*.joblib filter=lfs diff=lfs merge=lfs -text
|
| 10 |
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
| 11 |
+
<<<<<<< HEAD
|
| 12 |
+
=======
|
| 13 |
+
*.lz4 filter=lfs diff=lfs merge=lfs -text
|
| 14 |
+
*.mds filter=lfs diff=lfs merge=lfs -text
|
| 15 |
+
>>>>>>> 0d2d42071b65c9b49ea7e471b711fe3cdf9fb532
|
| 16 |
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
| 17 |
*.model filter=lfs diff=lfs merge=lfs -text
|
| 18 |
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
|
|
|
| 38 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 39 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 40 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
<<<<<<< HEAD
|
| 42 |
+
=======
|
| 43 |
+
# Audio files - uncompressed
|
| 44 |
+
*.pcm filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
*.sam filter=lfs diff=lfs merge=lfs -text
|
| 46 |
+
*.raw filter=lfs diff=lfs merge=lfs -text
|
| 47 |
+
# Audio files - compressed
|
| 48 |
+
*.aac filter=lfs diff=lfs merge=lfs -text
|
| 49 |
+
*.flac filter=lfs diff=lfs merge=lfs -text
|
| 50 |
+
*.mp3 filter=lfs diff=lfs merge=lfs -text
|
| 51 |
+
*.ogg filter=lfs diff=lfs merge=lfs -text
|
| 52 |
+
*.wav filter=lfs diff=lfs merge=lfs -text
|
| 53 |
+
# Image files - uncompressed
|
| 54 |
+
*.bmp filter=lfs diff=lfs merge=lfs -text
|
| 55 |
+
*.gif filter=lfs diff=lfs merge=lfs -text
|
| 56 |
+
*.png filter=lfs diff=lfs merge=lfs -text
|
| 57 |
+
*.tiff filter=lfs diff=lfs merge=lfs -text
|
| 58 |
+
# Image files - compressed
|
| 59 |
+
*.jpg filter=lfs diff=lfs merge=lfs -text
|
| 60 |
+
*.jpeg filter=lfs diff=lfs merge=lfs -text
|
| 61 |
+
*.webp filter=lfs diff=lfs merge=lfs -text
|
| 62 |
+
# Video files - compressed
|
| 63 |
+
*.mp4 filter=lfs diff=lfs merge=lfs -text
|
| 64 |
+
*.webm filter=lfs diff=lfs merge=lfs -text
|
| 65 |
+
>>>>>>> 0d2d42071b65c9b49ea7e471b711fe3cdf9fb532
|
README.md
CHANGED
|
@@ -42,3 +42,43 @@ Status Law Assistant — это интеллектуальный чат-бот,
|
|
| 42 |
- `src/`: директория с исходным кодом
|
| 43 |
- `knowledge_base/`: модуль для работы с базой знаний
|
| 44 |
- `models/`: модуль для работы с моделями
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 42 |
- `src/`: директория с исходным кодом
|
| 43 |
- `knowledge_base/`: модуль для работы с базой знаний
|
| 44 |
- `models/`: модуль для работы с моделями
|
| 45 |
+
# Status Law Knowledge Base Dataset
|
| 46 |
+
|
| 47 |
+
This dataset serves as a storage for the Status Law Assistant chatbot, containing vector embeddings and chat history.
|
| 48 |
+
|
| 49 |
+
## 📁 Structure
|
| 50 |
+
|
| 51 |
+
```
|
| 52 |
+
status-law-knowledge-base/
|
| 53 |
+
├── vector_store/
|
| 54 |
+
│ ├── index.faiss # FAISS vector store for document embeddings
|
| 55 |
+
│ └── index.pkl # Metadata and configuration for the vector store
|
| 56 |
+
│
|
| 57 |
+
└── chat_history/
|
| 58 |
+
└── logs.json # Chat history logs
|
| 59 |
+
```
|
| 60 |
+
|
| 61 |
+
## 🔍 Description
|
| 62 |
+
|
| 63 |
+
- `vector_store/`: Contains FAISS embeddings of legal documents from status.law website
|
| 64 |
+
- `index.faiss`: Vector embeddings for semantic search
|
| 65 |
+
- `index.pkl`: Metadata and configuration information
|
| 66 |
+
|
| 67 |
+
- `chat_history/`: Stores conversation logs
|
| 68 |
+
- `logs.json`: JSON file containing chat history and metadata
|
| 69 |
+
|
| 70 |
+
## 🚀 Usage
|
| 71 |
+
|
| 72 |
+
This dataset is used by the Status Law Assistant chatbot to:
|
| 73 |
+
1. Store and retrieve document embeddings for context-aware responses
|
| 74 |
+
2. Maintain chat history for conversation continuity
|
| 75 |
+
3. Track user interactions and improve response quality
|
| 76 |
+
|
| 77 |
+
## 🔗 Related Links
|
| 78 |
+
|
| 79 |
+
- [Status Law Website](https://status.law)
|
| 80 |
+
- [Status Law Assistant Repository](https://huggingface.co/spaces/Rulga/status-law-assistant)
|
| 81 |
+
|
| 82 |
+
## 📝 License
|
| 83 |
+
|
| 84 |
+
Private dataset for Status Law Assistant usage only.
|
chat_history/.gitkeep
ADDED
|
File without changes
|
vector_store/.gitkeep
ADDED
|
File without changes
|