Clinical Report Generator API

FastAPI-based API for generating clinical report summaries using a fine-tuned T5 model optimized for clinical documentation.

Model Information

This API uses a T5 model fine-tuned specifically for clinical report generation. The model has been optimized with enhanced generation parameters for better quality summaries.

API Endpoints

POST /predict

Generate a clinical report summary from input text.

Request body:

{
    "text": "string"
}

Example:

{
    "text": "evaluation type: initial. primary diagnosis: F84.0. severity: mild. primary language: english"
}

Response:

{
    "success": true,
    "data": "string",
    "error": null,
    "metrics": {
        "process_time": 1.23
    }
}

GET /health

Health check endpoint that provides system and model status.

Response:

{
    "status": "healthy",
    "model_loaded": true,
    "last_load_time": 1234567890.123,
    "version": "1.0.0",
    "gpu_available": true,
    "gpu_name": "NVIDIA GeForce RTX 3080"
}

Input Format Guidelines

For best results with the T5 model:

Structure your input data clearly using key-value pairs
Separate different fields with periods or commas
Keep inputs concise and focused on relevant clinical information
The API automatically prefixes inputs with "summarize:" for optimal model performance

Technical Details

Model: pdarleyjr/iplc-t5-clinical
Max Input Length: 512 tokens
Max Output Length: 512 tokens (increased for more comprehensive summaries)
Generation Parameters:
- Number of beams: 5 (increased for better coherence)
- Temperature: 0.7 (added for more natural generation)
- No repeat ngram size: 3
- Length penalty: 2.0
- Early stopping: Enabled

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support