Spaces:

Kush26
/

toneforge

Sleeping

App Files Files Community

toneforge / docs /core_features.md

Kush-Singh-26

corrected models and added test prompts

ebefa8f 4 months ago

preview code

Raw

History Blame Contribute Delete

5 kB

Detailed Documentation: core_features.py (Developer 1 Domain)

This file manages the Core Communication Logic: Email Formalization, Translation, and Smart Replies.

1. Environment & API Key Guard (Best Practice)

In Python, any code at the top level (outside functions) is executed the moment the file is imported. Since we initialize LLM objects immediately, we MUST ensure the API key is present, or the app will crash with a confusing error.

import os
from dotenv import load_dotenv

# 0. Load the .env file immediately
load_dotenv()

# Guard: Check if the key exists BEFORE initializing models
if not os.getenv("GROQ_API_KEY"):
    raise ValueError("GROQ_API_KEY is missing. Check your .env file.")

Viva Point: Mention that this is a "Guard Clause" for Import-Time Initialization. It provides a clear, human-readable error message if the environment is misconfigured.

2. LLM Configuration & Temperatures

We use OpenAI GPT-OSS 120B for all tasks through Groq.

# Using openai/gpt-oss-120b for all tasks - high reasoning and quality
analyser_llm = ChatGroq(model="openai/gpt-oss-120b", temperature=0.0)
business_llm = ChatGroq(model="openai/gpt-oss-120b", temperature=0.4)
academic_llm = ChatGroq(model="openai/gpt-oss-120b", temperature=0.4)
corporate_llm = ChatGroq(model="openai/gpt-oss-120b", temperature=0.4)
translator_llm = ChatGroq(model="openai/gpt-oss-120b", temperature=0.2)
reply_llm = ChatGroq(model="openai/gpt-oss-120b", temperature=0.5)

OpenAI GPT-OSS 120B: Used for all tasks including analysis, writing, and translation. Provides high-quality output across all use cases.
Temperature 0.0: Used for the analyser to ensure deterministic classification.
Temperature 0.2-0.5: Used for writing and translation to balance professional consistency with human-like phrasing.

3. Pydantic Models (Type Safety)

Pydantic ensures that the LLM's output matches exactly what we expect.

class AnalysisOutput(BaseModel):
    already_formal: bool = Field(description="True if the email is already formal")
    detected_category: Literal["business", "academic", "corporate", "unknown"]
    main_points: str = Field(description="Extracted or original main content")

Field Descriptions: These strings are actually passed to the LLM by the PydanticOutputParser so the AI knows exactly what each field means.

4. LangGraph Workflow Routing

The logic behind how the email "moves" through the system.

def _route_after_analysis(state: EmailState) -> str:
    a = state["analysis"]
    if a and a.already_formal and a.detected_category == state["category"]:
        return "return_direct"
    return state["category"]

Logic: If the already_formal flag is True, we skip the writing nodes and jump straight to the output. This saves tokens and reduces latency for the user.

5. API Endpoint Implementation

The bridge to the frontend.

@router.post("/formalize_email")
async def formalize_email(request: EmailRequest):
    result = await main_graph.ainvoke({
        "raw_email": request.raw_email,
        "category": request.category,
        "language": request.language or "english",
        # ... state ...
    })
    return {
        "category": request.category,
        "email": result["final_email"].model_dump(),
    }

ainvoke: Stands for "Asynchronous Invoke." It allows the server to handle multiple users simultaneously without waiting (blocking) for the LLM to finish.
model_dump(): Converts the Pydantic object into a JSON-friendly Python dictionary.

6. Available Tasks & Functional Logic

A. Email Formalizer (Rewrite Task)

Purpose: Transforms informal, messy, or "brain-dump" notes into structured, professional emails.

Analysis Node: The system first identifies the "Main Points" (who is involved, what is requested, what is the deadline).
Style Injection: Depending on the category (Business/Academic/Corporate), the AI applies a rigid structural template.
Edge Case Handling: If the email is already formal, the AI recognizes this and preserves the original content to avoid over-processing.

B. Smart Reply Generator

Purpose: Context-aware reply generation based on the thread's history.

Context Awareness: Instead of a generic "Thanks," the AI reads the incoming message and crafts a specific response addressing the points raised.
Tone Matching: Ensures the reply matches the professional category chosen by the user.

C. Professional Translator

Purpose: High-fidelity translation that preserves professional nuance.

Tone Preservation: Unlike standard translators that might lose formality, this task specifically instructs the AI to maintain "Business Formal" or "Academic Respectful" tones in the target language.
Entity Protection: Proper nouns, company names, and technical terms are preserved as-is.