File size: 1,163 Bytes
638084e
3694da1
 
638084e
 
 
 
 
 
 
 
3694da1
638084e
 
 
3694da1
 
638084e
 
 
3694da1
 
 
638084e
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
"""EvoLLM knowledge layer — RAG + LoRA-on-upload.

Components:
- parser             : PDF / TXT / MD / DOCX → clean text
- chunker            : long text → overlapping ~400-token chunks
- embedder           : fastembed wrapper (no torch dependency)
- store              : SQLite + numpy vector store
- pipeline           : high-level ingest / query API
- dataset_builder    : doc chunks → training JSONL
- notebook_generator : produce a Colab notebook with the dataset baked in
- adapter_importer   : accept uploaded GGUF + manifest, register an adapter

Local persistence at data/knowledge.sqlite + data/adapters/. On HF Spaces
the path is the container's ephemeral disk, so uploads vanish on rebuild
— there's a visible notice in the UI explaining that.
"""

from .adapter_importer import import_adapter
from .dataset_builder import build_dataset, write_jsonl
from .notebook_generator import generate_training_notebook
from .pipeline import KnowledgePipeline
from .store import KnowledgeStore

__all__ = [
    "KnowledgePipeline",
    "KnowledgeStore",
    "build_dataset",
    "write_jsonl",
    "generate_training_notebook",
    "import_adapter",
]