Spaces:

halima014
/

adaptive-study-agent

Sleeping

App Files Files Community

Mituvinci commited on Mar 16

Commit

7428575

1 Parent(s): 6d671f9

Two-model setup: GPT-4o-mini examines, Claude answers

Browse files

Files changed (2) hide show

README.md +20 -12
src/graph/nodes.py +3 -2

README.md CHANGED Viewed

@@ -12,11 +12,18 @@ private: true
 # Adaptive Study Agent
-A **LLM self-examination simulation** built with **LangGraph** and **Claude (Anthropic)**. The agent reads any document you provide, then runs a fully autonomous study loop — the LLM generates its own comprehension questions, retrieves context from ChromaDB to answer them, and evaluates its own answers. The user does not answer any questions. The purpose is to **probe where the LLM's understanding of the document breaks down** — which topics it answers confidently versus where it scores low and needs to re-read.
-The output is a structured session report revealing the LLM's weak areas within your document. This is useful for identifying conceptually dense or underrepresented sections in any text.
-This project can be applied to **any domain** — machine learning papers, medical literature, legal documents, textbooks — anything in PDF or TXT format.
 ---
@@ -38,15 +45,16 @@ The agent operates as a LangGraph state machine with conditional branching. Afte
 ## Tech Stack
-| Component        | Technology                    | Purpose                                      |
-|------------------|-------------------------------|----------------------------------------------|
-| Agent framework  | LangGraph                     | Stateful loops with conditional branching     |
-| LLM              | Claude Sonnet 4 (Anthropic)   | Question generation, answering, evaluation    |
-| Embeddings       | OpenAI text-embedding-3-small | Text chunk embeddings                         |
-| Vector store     | ChromaDB (local, embedded)    | No Docker required                            |
-| Document parsing | PyMuPDF (fitz)                | PDF support                                   |
-| UI               | Gradio                        | Web interface and Hugging Face Spaces deploy  |
-| Package manager  | uv                            | Dependency management                         |
 ---

 # Adaptive Study Agent
+A **two-model LLM self-examination simulation** built with **LangGraph**, **Claude (Anthropic)**, and **GPT-4o-mini (OpenAI)**. The agent reads any document you provide and runs a fully autonomous study loop — no human answers anything.
+**How the two models collaborate:**
+- **GPT-4o-mini** generates comprehension questions from document chunks (temperature 0.7 — creative) and evaluates the answers against the source material (temperature 0.0 — deterministic)
+- **Claude Sonnet** answers the questions using RAG retrieval from ChromaDB (temperature 0.3 — balanced)
+- **OpenAI text-embedding-3-small** handles document chunking and embedding into ChromaDB only — not used for reasoning
+The purpose is to **probe where Claude's understanding of the document breaks down** — GPT acts as the examiner, Claude as the student. When Claude scores below the mastery threshold, the agent re-reads the weak chunk and tries again.
+The output is a structured session report revealing Claude's weak areas within your document — useful for identifying conceptually dense or underrepresented sections in any text.
+Applicable to **any domain** — ML papers, medical literature, legal documents, textbooks — anything in PDF or TXT format.
 ---
 ## Tech Stack
+| Component        | Technology                    | Purpose                                              |
+|------------------|-------------------------------|------------------------------------------------------|
+| Agent framework  | LangGraph                     | Stateful loops with conditional branching            |
+| Examiner LLM     | GPT-4o-mini (OpenAI)          | Question generation (0.7) + evaluation (0.0)         |
+| Student LLM      | Claude Sonnet 4 (Anthropic)   | Answering questions via RAG (0.3)                    |
+| Embeddings       | OpenAI text-embedding-3-small | Document chunking and embedding into ChromaDB only   |
+| Vector store     | ChromaDB (local, embedded)    | No Docker required                                   |
+| Document parsing | PyMuPDF (fitz)                | PDF support                                          |
+| UI               | Gradio                        | Web interface and Hugging Face Spaces deploy         |
+| Package manager  | uv                            | Dependency management                                |
 ---

src/graph/nodes.py CHANGED Viewed

@@ -2,6 +2,7 @@ import random
 import re
 from langchain_anthropic import ChatAnthropic
 from langchain_core.messages import HumanMessage
 from src.graph.state import StudyState
@@ -45,7 +46,7 @@ def generate_question_node(state: StudyState) -> dict:
     else:
         passage = random.choice(chunks)
-    llm = ChatAnthropic(model="claude-sonnet-4-20250514", temperature=0.7)
     prompt = QUESTION_PROMPT.format(passage=passage)
     response = llm.invoke([HumanMessage(content=prompt)])
     question = response.content.strip()
@@ -81,7 +82,7 @@ def evaluate_node(state: StudyState) -> dict:
     source_chunks = retrieve_chunks(vectorstore, question, top_k=1)
     source = source_chunks[0] if source_chunks else ""
-    llm = ChatAnthropic(model="claude-sonnet-4-20250514", temperature=0.0)
     prompt = EVALUATE_PROMPT.format(question=question, answer=answer, source=source)
     response = llm.invoke([HumanMessage(content=prompt)])
     result = response.content.strip()

 import re
 from langchain_anthropic import ChatAnthropic
+from langchain_openai import ChatOpenAI
 from langchain_core.messages import HumanMessage
 from src.graph.state import StudyState
     else:
         passage = random.choice(chunks)
+    llm = ChatOpenAI(model="gpt-4o-mini", temperature=0.7)
     prompt = QUESTION_PROMPT.format(passage=passage)
     response = llm.invoke([HumanMessage(content=prompt)])
     question = response.content.strip()
     source_chunks = retrieve_chunks(vectorstore, question, top_k=1)
     source = source_chunks[0] if source_chunks else ""
+    llm = ChatOpenAI(model="gpt-4o-mini", temperature=0.0)
     prompt = EVALUATE_PROMPT.format(question=question, answer=answer, source=source)
     response = llm.invoke([HumanMessage(content=prompt)])
     result = response.content.strip()