Spaces:

ThejasRao
/

openenv-content-moderation

Sleeping

App Files Files Community

openenv-content-moderation / docs /TECH_STACK.md

ThejasRao

Initial OpenENV hackathon submission

c492c3f 2 months ago

preview code

raw

history blame contribute delete

4.27 kB

📄 TECH_STACK.md — Technology Choices & Justification

🧠 Overview

This document defines the technology stack used to build the AI Community Moderation Environment.

The stack is chosen to:

ensure OpenEnv compliance
enable fast development
support deterministic execution
allow seamless Docker + Hugging Face deployment

🧱 Core Language

🐍 Python 3.10+

Why:

native ecosystem for OpenEnv
strong support for RL + APIs
rapid prototyping and debugging

🌐 Backend Framework

⚡ FastAPI

Why:

lightweight and high-performance
built-in request validation (via Pydantic)
async support
ideal for exposing /reset, /step, /grader, etc.
easily deployable on Hugging Face Spaces

🧾 Data Validation & Models

📦 Pydantic

Why:

strict typing for OpenEnv compliance
automatic validation of API inputs/outputs
seamless integration with FastAPI
ensures deterministic schema handling

🧠 Environment Logic

🧩 Custom Python Modules

env/ → environment core
policy_engine.py → rule evaluation
reward_engine.py → step rewards
data_generator.py → synthetic data

Why:

full control over logic
deterministic execution
no external dependencies

🤖 Baseline Agent

🪶 Rule-based Agent (`baseline/agent.py`)

Heuristic agent using keyword matching — no LLM, fully reproducible.

🧠 LLM Agent — 🔹 Added

Gemini 2.5 Flash via `google-genai` SDK

Why:

state-of-the-art reasoning on structured tasks
multi-turn chat with system prompt support
deterministic temperature=0.0 setting minimises variance
google-genai SDK is the latest official Google AI Python SDK

Key files:

agent/gemini_agent.py   — GeminiAgent class, multi-turn loop
agent/prompts.py        — system prompt + per-turn prompt builder

SDK:

from google import genai
client = genai.Client(api_key=GOOGLE_API_KEY)
chat = client.chats.create(model="gemini-2.5-flash", config=...)
response = chat.send_message(turn_prompt)

Design constraint: LLM is ONLY the decision layer. Policy, reward, and grading remain deterministic in env/.

📦 API Communication

JSON over HTTP

Why:

simple and standard
compatible with OpenEnv expectations
easy debugging and logging

🐳 Containerization

🐳 Docker

Why:

mandatory for evaluation pipeline
ensures consistent runtime environment
required for Hugging Face Spaces deployment

☁️ Deployment Platform

🤗 Hugging Face Spaces (Docker-based)

Why:

required by competition
easy container deployment
public endpoint for validation
integrates well with OpenEnv ecosystem

🧪 Testing & Validation

✅ Pytest (optional but recommended)

Why:

quick validation of:
- policy engine
- reward logic
- graders

🔍 OpenEnv Validator

Why:

ensures compliance with:
- API schema
- openenv.yaml
- required endpoints

📊 Logging (Optional)

Python Logging

Why:

debug environment flow
trace agent decisions
inspect reward calculations

📦 Dependency Summary

fastapi
uvicorn[standard]
pydantic>=2.6
google-genai>=1.0
openai          # optional / legacy
pytest
httpx           # TestClient

⚙️ Runtime Setup

Component	Tool
API Server	Uvicorn
Backend	FastAPI
Models	Pydantic
Container	Docker
Deployment	HF Spaces

🧠 Design Principles

1. Minimalism

Only essential tools used

2. Determinism

No external APIs for core logic

3. Reproducibility

Same input → same output

4. Fast Iteration

Simple stack = quick debugging

⚠️ What We Avoid (Important)

❌ heavy ML frameworks (TensorFlow, PyTorch)
❌ databases (not needed for MVP)
❌ microservices complexity
❌ external APIs for environment logic

🧠 One-Line Summary

A lightweight Python-based stack using FastAPI and Pydantic to build a deterministic, OpenEnv-compliant environment with Docker-based deployment.

📄 TECH_STACK.md — Technology Choices & Justification

🧠 Overview

🧱 Core Language

🐍 Python 3.10+

🌐 Backend Framework

⚡ FastAPI

🧾 Data Validation & Models

📦 Pydantic

🧠 Environment Logic

🧩 Custom Python Modules

🤖 Baseline Agent

🪶 Rule-based Agent (baseline/agent.py)

🧠 LLM Agent — 🔹 Added

Gemini 2.5 Flash via google-genai SDK

📦 API Communication

JSON over HTTP

🐳 Containerization

🐳 Docker

☁️ Deployment Platform

🤗 Hugging Face Spaces (Docker-based)

🧪 Testing & Validation

✅ Pytest (optional but recommended)

🔍 OpenEnv Validator

📊 Logging (Optional)

Python Logging

📦 Dependency Summary

⚙️ Runtime Setup

🧠 Design Principles

1. Minimalism

2. Determinism

3. Reproducibility

4. Fast Iteration

⚠️ What We Avoid (Important)

🧠 One-Line Summary

🪶 Rule-based Agent (`baseline/agent.py`)

Gemini 2.5 Flash via `google-genai` SDK