suhail
spoecs
9eafd9f
# Data Model: Task CRUD Operations
**Feature**: Task CRUD Operations
**Date**: 2026-01-08
**Status**: Complete
## Overview
This document defines the database schema, entity relationships, and data validation rules for the Task CRUD feature. The data model supports multi-user task management with user data isolation.
## Entity Relationship Diagram
```
┌─────────────────┐ ┌─────────────────┐
│ User │ │ Task │
├─────────────────┤ ├─────────────────┤
│ id (PK) │◄────────│ id (PK) │
│ email │ 1:N │ user_id (FK) │
│ name │ │ title │
│ created_at │ │ description │
│ updated_at │ │ completed │
└─────────────────┘ │ created_at │
│ updated_at │
└─────────────────┘
Relationship: One User has many Tasks
One Task belongs to one User
```
## Entities
### Task Entity
**Purpose**: Represents a to-do item belonging to a specific user.
**Table Name**: `tasks`
**Columns**:
| Column Name | Type | Constraints | Description |
|--------------|--------------|--------------------------------|------------------------------------------------|
| id | Integer | PRIMARY KEY, AUTO_INCREMENT | Unique task identifier |
| user_id | Integer | FOREIGN KEY (users.id), NOT NULL, INDEX | Owner of the task |
| title | String(200) | NOT NULL, LENGTH(1-200) | Task title (required) |
| description | String(1000) | NULLABLE, LENGTH(0-1000) | Optional task description |
| completed | Boolean | NOT NULL, DEFAULT FALSE, INDEX | Completion status |
| created_at | DateTime | NOT NULL, DEFAULT NOW() | Timestamp when task was created |
| updated_at | DateTime | NOT NULL, DEFAULT NOW(), ON UPDATE NOW() | Timestamp of last update |
**Indexes**:
- PRIMARY KEY on `id`
- INDEX on `user_id` (for filtering tasks by user)
- INDEX on `completed` (for filtering active/completed tasks)
- COMPOSITE INDEX on `(user_id, completed)` (for combined filtering)
- INDEX on `created_at` (for sorting by date)
**Constraints**:
- `user_id` FOREIGN KEY references `users(id)` ON DELETE CASCADE
- `title` must be between 1 and 200 characters
- `description` must be between 0 and 1000 characters (NULL allowed)
- `completed` must be boolean (true/false)
**SQLModel Definition**:
```python
from sqlmodel import SQLModel, Field, Relationship
from datetime import datetime
from typing import Optional
class Task(SQLModel, table=True):
"""Task entity representing a to-do item."""
__tablename__ = "tasks"
id: Optional[int] = Field(default=None, primary_key=True)
user_id: int = Field(foreign_key="users.id", nullable=False, index=True)
title: str = Field(max_length=200, nullable=False)
description: Optional[str] = Field(default=None, max_length=1000)
completed: bool = Field(default=False, nullable=False, index=True)
created_at: datetime = Field(default_factory=datetime.utcnow, nullable=False)
updated_at: datetime = Field(default_factory=datetime.utcnow, nullable=False)
# Relationship (will be fully implemented in Spec 2)
# user: Optional["User"] = Relationship(back_populates="tasks")
class Config:
json_schema_extra = {
"example": {
"id": 1,
"user_id": 42,
"title": "Buy groceries",
"description": "Milk, eggs, bread",
"completed": False,
"created_at": "2026-01-08T10:00:00Z",
"updated_at": "2026-01-08T10:00:00Z"
}
}
```
### User Entity (Stub)
**Purpose**: Represents an authenticated user (full implementation in Spec 2).
**Table Name**: `users`
**Columns** (minimal for Spec 1):
| Column Name | Type | Constraints | Description |
|--------------|--------------|--------------------------------|------------------------------------------------|
| id | Integer | PRIMARY KEY, AUTO_INCREMENT | Unique user identifier |
| email | String(255) | UNIQUE, NOT NULL | User email address |
| name | String(100) | NOT NULL | User display name |
| created_at | DateTime | NOT NULL, DEFAULT NOW() | Timestamp when user was created |
| updated_at | DateTime | NOT NULL, DEFAULT NOW() | Timestamp of last update |
**SQLModel Definition** (stub for Spec 1):
```python
from sqlmodel import SQLModel, Field
from datetime import datetime
from typing import Optional
class User(SQLModel, table=True):
"""User entity (stub for authentication spec)."""
__tablename__ = "users"
id: Optional[int] = Field(default=None, primary_key=True)
email: str = Field(max_length=255, unique=True, nullable=False)
name: str = Field(max_length=100, nullable=False)
created_at: datetime = Field(default_factory=datetime.utcnow, nullable=False)
updated_at: datetime = Field(default_factory=datetime.utcnow, nullable=False)
# Relationship (will be fully implemented in Spec 2)
# tasks: List["Task"] = Relationship(back_populates="user")
```
## Pydantic Schemas (Request/Response)
### TaskCreate (Request)
**Purpose**: Validate task creation requests.
```python
from pydantic import BaseModel, Field
from typing import Optional
class TaskCreate(BaseModel):
"""Schema for creating a new task."""
title: str = Field(
min_length=1,
max_length=200,
description="Task title (1-200 characters)"
)
description: Optional[str] = Field(
default=None,
max_length=1000,
description="Optional task description (0-1000 characters)"
)
class Config:
json_schema_extra = {
"example": {
"title": "Buy groceries",
"description": "Milk, eggs, bread"
}
}
```
### TaskUpdate (Request)
**Purpose**: Validate task update requests (full replacement).
```python
class TaskUpdate(BaseModel):
"""Schema for updating an existing task."""
title: str = Field(
min_length=1,
max_length=200,
description="Task title (1-200 characters)"
)
description: Optional[str] = Field(
default=None,
max_length=1000,
description="Optional task description (0-1000 characters)"
)
completed: bool = Field(
description="Task completion status"
)
class Config:
json_schema_extra = {
"example": {
"title": "Buy groceries and milk",
"description": "Updated description",
"completed": False
}
}
```
### TaskPatch (Request)
**Purpose**: Validate partial task updates (e.g., toggle completion).
```python
class TaskPatch(BaseModel):
"""Schema for partially updating a task."""
title: Optional[str] = Field(
default=None,
min_length=1,
max_length=200,
description="Task title (1-200 characters)"
)
description: Optional[str] = Field(
default=None,
max_length=1000,
description="Optional task description (0-1000 characters)"
)
completed: Optional[bool] = Field(
default=None,
description="Task completion status"
)
class Config:
json_schema_extra = {
"example": {
"completed": True
}
}
```
### TaskResponse (Response)
**Purpose**: Standardized task response format.
```python
from datetime import datetime
class TaskResponse(BaseModel):
"""Schema for task responses."""
id: int
user_id: int
title: str
description: Optional[str]
completed: bool
created_at: datetime
updated_at: datetime
class Config:
from_attributes = True # Enable ORM mode
json_schema_extra = {
"example": {
"id": 1,
"user_id": 42,
"title": "Buy groceries",
"description": "Milk, eggs, bread",
"completed": False,
"created_at": "2026-01-08T10:00:00Z",
"updated_at": "2026-01-08T10:00:00Z"
}
}
```
### TaskListResponse (Response)
**Purpose**: Response format for listing multiple tasks.
```python
from typing import List
class TaskListResponse(BaseModel):
"""Schema for task list responses."""
tasks: List[TaskResponse]
total: int
class Config:
json_schema_extra = {
"example": {
"tasks": [
{
"id": 1,
"user_id": 42,
"title": "Buy groceries",
"description": "Milk, eggs, bread",
"completed": False,
"created_at": "2026-01-08T10:00:00Z",
"updated_at": "2026-01-08T10:00:00Z"
}
],
"total": 1
}
}
```
## Data Validation Rules
### Title Validation
- **Required**: Yes
- **Min Length**: 1 character
- **Max Length**: 200 characters
- **Allowed Characters**: Any Unicode characters
- **Trimming**: Leading/trailing whitespace should be trimmed
- **Error Message**: "Title must be between 1 and 200 characters"
### Description Validation
- **Required**: No (optional)
- **Min Length**: 0 characters (empty string or NULL)
- **Max Length**: 1000 characters
- **Allowed Characters**: Any Unicode characters
- **Trimming**: Leading/trailing whitespace should be trimmed
- **Error Message**: "Description must be 1000 characters or less"
### Completed Validation
- **Required**: Yes (defaults to False on creation)
- **Type**: Boolean (true/false)
- **Error Message**: "Completed must be a boolean value"
### User ID Validation
- **Required**: Yes
- **Type**: Integer
- **Validation**: Must reference existing user in users table
- **Error Message**: "Invalid user ID"
## State Transitions
### Task Lifecycle
```
┌────────────┐
│ Created │ (completed = false)
│ (Initial) │
└──────┬──────┘
│ User marks complete
┌─────────────┐
│ Completed │ (completed = true)
└──────┬──────┘
│ User marks incomplete
┌─────────────┐
│ Active │ (completed = false)
└──────┬──────┘
│ User deletes
┌─────────────┐
│ Deleted │ (removed from database)
└─────────────┘
```
**Valid Transitions**:
- Created → Completed (mark as done)
- Completed → Active (mark as not done)
- Any state → Deleted (remove task)
- Active → Updated (edit title/description)
- Completed → Updated (edit title/description)
## Database Migration
### Initial Migration (Alembic)
```python
"""Create tasks table
Revision ID: 001_create_tasks
Revises:
Create Date: 2026-01-08
"""
from alembic import op
import sqlalchemy as sa
from sqlalchemy.dialects import postgresql
# revision identifiers
revision = '001_create_tasks'
down_revision = None
branch_labels = None
depends_on = None
def upgrade():
# Create users table (stub for Spec 2)
op.create_table(
'users',
sa.Column('id', sa.Integer(), nullable=False),
sa.Column('email', sa.String(length=255), nullable=False),
sa.Column('name', sa.String(length=100), nullable=False),
sa.Column('created_at', sa.DateTime(), nullable=False),
sa.Column('updated_at', sa.DateTime(), nullable=False),
sa.PrimaryKeyConstraint('id'),
sa.UniqueConstraint('email')
)
# Create tasks table
op.create_table(
'tasks',
sa.Column('id', sa.Integer(), nullable=False),
sa.Column('user_id', sa.Integer(), nullable=False),
sa.Column('title', sa.String(length=200), nullable=False),
sa.Column('description', sa.String(length=1000), nullable=True),
sa.Column('completed', sa.Boolean(), nullable=False, server_default='false'),
sa.Column('created_at', sa.DateTime(), nullable=False),
sa.Column('updated_at', sa.DateTime(), nullable=False),
sa.ForeignKeyConstraint(['user_id'], ['users.id'], ondelete='CASCADE'),
sa.PrimaryKeyConstraint('id')
)
# Create indexes
op.create_index('ix_tasks_user_id', 'tasks', ['user_id'])
op.create_index('ix_tasks_completed', 'tasks', ['completed'])
op.create_index('ix_tasks_user_id_completed', 'tasks', ['user_id', 'completed'])
op.create_index('ix_tasks_created_at', 'tasks', ['created_at'])
def downgrade():
op.drop_index('ix_tasks_created_at', table_name='tasks')
op.drop_index('ix_tasks_user_id_completed', table_name='tasks')
op.drop_index('ix_tasks_completed', table_name='tasks')
op.drop_index('ix_tasks_user_id', table_name='tasks')
op.drop_table('tasks')
op.drop_table('users')
```
## Data Integrity Rules
### Foreign Key Constraints
- `tasks.user_id` MUST reference valid `users.id`
- ON DELETE CASCADE: Deleting a user deletes all their tasks
- Prevents orphaned tasks in database
### Uniqueness Constraints
- No uniqueness constraint on task titles (users can have duplicate titles)
- `users.email` must be unique (enforced in users table)
### NOT NULL Constraints
- `tasks.id`: Always required (auto-generated)
- `tasks.user_id`: Always required (task must belong to user)
- `tasks.title`: Always required (empty tasks not allowed)
- `tasks.completed`: Always required (defaults to false)
- `tasks.created_at`: Always required (auto-generated)
- `tasks.updated_at`: Always required (auto-updated)
### Check Constraints (Optional)
```sql
-- Ensure title is not empty after trimming
ALTER TABLE tasks ADD CONSTRAINT check_title_not_empty
CHECK (LENGTH(TRIM(title)) > 0);
-- Ensure description length if provided
ALTER TABLE tasks ADD CONSTRAINT check_description_length
CHECK (description IS NULL OR LENGTH(description) <= 1000);
```
## Query Patterns
### Common Queries
**Get all tasks for a user**:
```sql
SELECT * FROM tasks
WHERE user_id = ?
ORDER BY created_at DESC;
```
**Get active tasks for a user**:
```sql
SELECT * FROM tasks
WHERE user_id = ? AND completed = false
ORDER BY created_at DESC;
```
**Get completed tasks for a user**:
```sql
SELECT * FROM tasks
WHERE user_id = ? AND completed = true
ORDER BY created_at DESC;
```
**Get specific task with ownership check**:
```sql
SELECT * FROM tasks
WHERE id = ? AND user_id = ?;
```
**Update task with timestamp**:
```sql
UPDATE tasks
SET title = ?, description = ?, completed = ?, updated_at = NOW()
WHERE id = ? AND user_id = ?;
```
**Delete task with ownership check**:
```sql
DELETE FROM tasks
WHERE id = ? AND user_id = ?;
```
## Performance Considerations
### Index Usage
- `user_id` index: Used in all queries (data isolation)
- `completed` index: Used for filtering active/completed
- Composite `(user_id, completed)` index: Optimizes filtered queries
- `created_at` index: Used for sorting by date
### Query Optimization
- Always include `user_id` in WHERE clause (uses index)
- Limit result sets for large task lists (pagination)
- Use `SELECT *` sparingly in production (specify columns)
- Avoid N+1 queries (use joins if fetching related data)
### Connection Pooling
- Use Neon's built-in connection pooling
- Configure pool size based on expected concurrent users
- Reuse database sessions across requests
## Data Seeding (Development)
### Sample Data for Testing
```python
# Sample users
users = [
{"id": 1, "email": "alice@example.com", "name": "Alice"},
{"id": 2, "email": "bob@example.com", "name": "Bob"}
]
# Sample tasks
tasks = [
{
"user_id": 1,
"title": "Buy groceries",
"description": "Milk, eggs, bread",
"completed": False
},
{
"user_id": 1,
"title": "Finish project report",
"description": None,
"completed": True
},
{
"user_id": 2,
"title": "Call dentist",
"description": "Schedule appointment",
"completed": False
}
]
```
## Summary
The data model defines two entities: Task (full implementation) and User (stub for Spec 2). Tasks have a many-to-one relationship with Users, enforced via foreign key constraint. Validation rules ensure data integrity at both API and database layers. Indexes optimize query performance for filtering and sorting. The schema supports all functional requirements from the specification while maintaining user data isolation.
**Ready for**: API contract generation (contracts/).