Spaces:

SDSC
/

ai-agent

Paused

App Files Files Community

ai-agent / docs /development /structure.md

katospiegel

Deploy develop: FastAPI+React frontend, multi-stage Docker (ai_agent serve)

07c2476 verified 6 days ago

preview code

Raw

History Blame Contribute Delete

8.58 kB

	# Project Structure

	The AI Imaging Agent is organized into modular components with clear separation of concerns.

	## Directory Layout

	```
	ai-agent/
	├── .github/
	│ └── workflows/ # CI/CD workflows
	│ └── deploy_docs.yml # Documentation deployment
	├── artifacts/
	│ └── rag_index/ # FAISS index and embeddings
	├── dataset/
	│ └── catalog.jsonl # Software catalog
	├── docs/ # MkDocs documentation
	├── logs/ # Application logs
	├── src/
	│ └── ai_agent/ # Main package
	│ ├── agent/ # PydanticAI agent
	│ ├── api/ # Pipeline orchestration
	│ ├── catalog/ # Catalog management
	│ ├── generator/ # VLM selection (schemas)
	│ ├── retriever/ # Text retrieval
	│ ├── ui/ # Gradio interface
	│ └── utils/ # Shared utilities
	├── tests/ # Test suite
	├── config.yaml # Model configuration
	├── mkdocs.yml # Documentation config
	├── pyproject.toml # Package metadata
	└── README.md # Project readme
	```

	## Core Modules

	### src/ai_agent/

	Main package containing all application code.

	#### agent/

	PydanticAI conversational agent implementation.

	```
	agent/
	├── __init__.py
	├── agent.py # Agent definition, tool adapters
	├── models.py # Agent output/log models
	├── utils.py # Agent state and tool quota helpers
	└── tools/ # Tool implementations (search, repo_info, mcp)
	```

	Key components:

	- `agent.py`: Agent instance, system prompt, tool definitions
	- `models.py`: Agent output and tool usage schemas
	- `utils.py`: `AgentState` plus call caps/prepare hooks
	- `tools/`: Tool implementations (search, alternatives, repo info, mcp tools)

	Dependencies: `api/`, `utils/`

	#### api/

	Pipeline orchestration and core logic.

	```
	api/
	├── __init__.py
	└── pipeline.py # RAGImagingPipeline main class
	```

	Responsibilities:

	- File validation and metadata extraction
	- Retrieval + VLM selection orchestration
	- Error handling and logging
	- Index management

	Dependencies: `retriever/`, `generator/`, `utils/`

	#### catalog/

	Software catalog synchronization.

	```
	catalog/
	├── __init__.py
	└── sync.py # Catalog sync logic
	```

	Functions:

	- Load catalog from JSONL
	- Check for changes (SHA1)
	- Trigger index rebuild

	Dependencies: `retriever/`

	#### generator/

	VLM selection schemas and types.

	```
	generator/
	├── __init__.py
	└── schema.py # Pydantic models for responses
	```

	Models:

	- `ToolRecommendation`: Individual tool recommendation
	- `AgentResponse`: Complete response with status
	- `ConversationStatus`: Enum for conversation states
	- `ToolReason`: Enum for recommendation reasons

	Dependencies: None (pure schemas)

	#### retriever/

	Text-based retrieval pipeline.

	```
	retriever/
	├── __init__.py
	├── text_embedder.py # BGE-M3 embedding model
	├── vector_index.py # FAISS index management
	├── reranker.py # CrossEncoder reranking
	└── software_doc.py # Catalog schema and loading
	```

	Pipeline flow:

	1. `text_embedder.py`: Embed query
	2. `vector_index.py`: FAISS search
	3. `reranker.py`: CrossEncoder reranking
	4. Output: Top-K candidates

	Dependencies: None (pure retrieval)

	#### ui/

	Gradio web interface.

	```
	ui/
	├── __init__.py
	├── app.py # Gradio app definition
	├── components.py # Reusable UI components
	├── formatters.py # Response formatting
	├── handlers.py # Message handlers
	├── state.py # UI state management
	└── visualizations.py # Preview and trace rendering
	```

	Key files:

	- `app.py`: Main Gradio interface
	- `handlers.py`: `respond()` function - core interaction logic
	- `formatters.py`: Format recommendations as markdown/cards
	- `components.py`: Reusable Gradio components

	Dependencies: `agent/`, `api/`

	#### utils/

	Shared utilities.

	```
	utils/
	├── __init__.py
	├── config.py # Configuration loading
	├── file_validator.py # File validation
	├── image_meta.py # Metadata extraction (DICOM, NIfTI, TIFF)
	├── previews.py # Image preview generation
	└── tags.py # Control tag parsing
	```

	Common utilities:

	- `config.py`: Load `config.yaml` with Pydantic validation
	- `file_validator.py`: Size limits, format checks
	- `image_meta.py`: Extract DICOM/NIfTI/TIFF metadata
	- `previews.py`: Convert medical images to PNG
	- `tags.py`: Parse exclusion tags and strip control tags from queries

	Dependencies: None (pure utilities)

	#### cli.py

	Command-line interface entry point.

	```python
	def main():
	# Parse arguments
	# Route to chat or sync
	```

	Commands:

	- `ai_agent chat`: Launch UI
	- `ai_agent sync`: Sync catalog

	### tests/

	Test suite.

	```
	tests/
	├── data/
	│ └── test_data.json # Test cases
	├── test_retrieval_pipeline.py
	├── test_deepwiki_repo_info.py
	└── ...
	```

	Test categories:

	- Unit tests: Individual components
	- Integration tests: Full pipeline
	- End-to-end tests: Real API calls (optional)

	## Configuration Files

	### pyproject.toml

	Python package metadata and dependencies.

	```toml
	[project]
	name = "ai_agent"
	version = "1.0.0"
	dependencies = [...]

	[project.scripts]
	ai_agent = "ai_agent.cli:main"
	```

	### config.yaml

	Model configuration.

	```yaml
	agent_model:
	name: "gpt-4o-mini"
	base_url: null
	api_key_env: "OPENAI_API_KEY"

	available_models:
	- display_name: "gpt-4o-mini"
	name: "gpt-4o-mini"
	...
	```

	### mkdocs.yml

	Documentation configuration.

	```yaml
	site_name: AI Imaging Agent
	theme:
	name: material
	nav: [...]
	```

	### .env

	Environment variables (not committed).

	```dotenv
	OPENAI_API_KEY=sk-xxxx
	SOFTWARE_CATALOG=dataset/catalog.jsonl
	```

	## Data Files

	### dataset/catalog.jsonl

	Software catalog in JSON Lines format.

	Each line is a complete JSON object following schema.org SoftwareSourceCode.

	### artifacts/rag_index/

	Pre-built FAISS index and metadata.

	```
	artifacts/rag_index/
	├── index.faiss # FAISS binary index
	└── meta.json # Tool IDs, config, timestamps
	```

	## Module Boundaries

	Clear separation prevents circular dependencies:

	```
	ui/ → agent/ → api/ → retriever/
	→ generator/
	→ utils/
	```

	Rules:

	- `utils/`: No dependencies on other modules
	- `retriever/`: Pure retrieval, no generation
	- `generator/`: Pure schemas, no retrieval
	- `api/`: Orchestrates retriever + generator
	- `agent/`: Uses api for tool calls
	- `ui/`: Top-level, depends on agent + api

	## Import Patterns

	All imports use absolute paths from `ai_agent`:

	```python
	from ai_agent.retriever.vector_index import VectorIndex
	from ai_agent.utils.config import load_config
	from ai_agent.agent.utils import AgentState
	```

	Never use relative imports like `from ..utils import ...`

	## Extension Points

	### Adding New Tools

	Add tool adapters to `agent/agent.py` and implement logic in `agent/tools/`:

	```python
	@agent.tool
	async def new_tool(ctx: RunContext[AgentState], param: str) -> str:
	"""Tool description."""
	# Implementation
	return result
	```

	### Adding New Metadata Extractors

	Add to `utils/image_meta.py`:

	```python
	def extract_custom_format(file_path: str) -> dict:
	"""Extract metadata from custom format."""
	# Implementation
	return metadata
	```

	### Adding New Retrieval Models

	Replace in `retriever/text_embedder.py`:

	```python
	class TextEmbedder:
	def __init__(self, model_name="new-embedding-model"):
	self.model = SentenceTransformer(model_name)
	```

	## Next Steps

	- Learn about [Contributing](contributing.md)
	- Explore [Testing](testing.md)
	- Return to [Architecture Overview](../architecture/overview.md)