Spaces:
Running
Running
File size: 1,163 Bytes
638084e 3694da1 638084e 3694da1 638084e 3694da1 638084e 3694da1 638084e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 | """EvoLLM knowledge layer — RAG + LoRA-on-upload.
Components:
- parser : PDF / TXT / MD / DOCX → clean text
- chunker : long text → overlapping ~400-token chunks
- embedder : fastembed wrapper (no torch dependency)
- store : SQLite + numpy vector store
- pipeline : high-level ingest / query API
- dataset_builder : doc chunks → training JSONL
- notebook_generator : produce a Colab notebook with the dataset baked in
- adapter_importer : accept uploaded GGUF + manifest, register an adapter
Local persistence at data/knowledge.sqlite + data/adapters/. On HF Spaces
the path is the container's ephemeral disk, so uploads vanish on rebuild
— there's a visible notice in the UI explaining that.
"""
from .adapter_importer import import_adapter
from .dataset_builder import build_dataset, write_jsonl
from .notebook_generator import generate_training_notebook
from .pipeline import KnowledgePipeline
from .store import KnowledgeStore
__all__ = [
"KnowledgePipeline",
"KnowledgeStore",
"build_dataset",
"write_jsonl",
"generate_training_notebook",
"import_adapter",
]
|