Spaces:

DecentSanage
/

constraint-env

Sleeping

App Files Files Community

DecentSanage commited on 15 days ago

Commit

319b4a3

verified ·

1 Parent(s): 6293ebc

Upload folder using huggingface_hub

Browse files

Files changed (6) hide show

README.md +13 -14
inference.py +2 -1
models.py +1 -1
server/app.py +16 -0
server/constraint_env_environment.py +14 -12
server/gradio_ui.py +7 -1

README.md CHANGED Viewed

@@ -1,12 +1,12 @@
 ---
 title: Constraint Environment
-emoji: 🧩
 colorFrom: purple
 colorTo: blue
 sdk: docker
 pinned: false
 license: mit
-short_description: RL training env — natural language to constraint AST
 base_path: /web
 ---
@@ -211,22 +211,21 @@ INFO:     10.16.33.124:5187 - "GET /web HTTP/1.1" 307 Temporary Redirect
 INFO:     10.16.24.44:32462 - "GET /web/ HTTP/1.1" 200 OK
 ```
-## Custom Web UI
-When `ENABLE_WEB_INTERFACE=true` the server mounts a **tabbed Gradio interface** at `/web`:
 | Tab | Description |
 |-----|-------------|
-| **Playground** | Default OpenEnv UI with Reset / Step / Get State controls |
-| **Constraint Compiler** | Our custom tab — task selector, full AST code editor, sample loaders, node-structure reference, and a real-time compiler chatbot |
-### Trying the Compiler tab
-1. Select a difficulty (`easy / medium / hard`) and click **Reset / Load Task**.
-2. The prompt appears and the editor is pre-filled with a correct sample AST.
-3. Edit the AST and click **▶ Submit to Compiler** — the chatbot shows the reward, error code, and exact compiler message.
-4. Use **📚 Load Sample ASTs** to reload any of the 3 canonical examples instantly.
-> The custom tab is implemented in `server/gradio_ui.py` via the `gradio_builder` extension point provided by OpenEnv core.
 ## Project Structure

 ---
 title: Constraint Environment
+emoji: 👨‍🔧
 colorFrom: purple
 colorTo: blue
 sdk: docker
 pinned: false
 license: mit
+short_description: RL training env for natural language to constraint AST
 base_path: /web
 ---
 INFO:     10.16.24.44:32462 - "GET /web/ HTTP/1.1" 200 OK
 ```
+## Web UI & Interactive Compiler
+When `ENABLE_WEB_INTERFACE=true` (or when deployed to Hugging Face Spaces), the server mounts a **tabbed Gradio interface** at `/web`, natively featuring our custom compiler interface as the primary landing page:
 | Tab | Description |
 |-----|-------------|
+| **Constraint Compiler** | **(Default First Page)** Our custom built interactive AST environment — featuring a task difficulty selector, full JSON syntax AST code editor, interactive single-click AST sample loaders, node-structure references, and a real-time syntax/logic compiler chatbot. |
+| **Base Playground** | The fallback default OpenEnv UI, displaying stateless Reset / Step / Get State controls for the raw API. |
+### Trying the Compiler Loop (First Page)
+1. Navigate to `/web` to access the **Constraint Compiler**.
+2. Start by expanding the **📚 Load Sample ASTs** accordion and clicking one of the sample buttons (`🟢 Easy Sample`, `🟡 Medium Sample`, `🔴 Hard Sample`). This instantly populates the compiler code editor with a canonical AST structure avoiding manual entry.
+3. Select a difficulty (`easy / medium / hard`) in the task dropdown and click **Reset / Load Task**.
+4. The prompt will update. Edit the loaded AST in the code editor to attempt to solve the logic.
+5. Click **▶ Submit to Compiler** — the chatbot will parse the payload and display the reward, error code, and precise compiler traceback feedback natively inline!
 ## Project Structure

inference.py CHANGED Viewed

@@ -249,7 +249,8 @@ def _run_task(task_id: str, env_url: str = "http://localhost:8000") -> None:
                 messages = getattr(obs_data, "messages", []) if not isinstance(obs_data, dict) else obs_data.get("messages", [])
                 if step_count > 0 and messages:
-                    prompt_text += "\n\n" + "\n".join(messages)
                 # Generate AST from LLM with retry loop
                 raw_output = "{}"

                 messages = getattr(obs_data, "messages", []) if not isinstance(obs_data, dict) else obs_data.get("messages", [])
                 if step_count > 0 and messages:
+                    parsed = [m.get("content", str(m)) if isinstance(m, dict) else str(m) for m in messages]
+                    prompt_text += "\n\n" + "\n".join(parsed)
                 # Generate AST from LLM with retry loop
                 raw_output = "{}"

models.py CHANGED Viewed

@@ -39,7 +39,7 @@ class ConstraintObservation(Observation):
     done: bool
     reward: float
     info: Dict[str, Any]
-    messages: list[str] = []
 class ConstraintState(State):

     done: bool
     reward: float
     info: Dict[str, Any]
+    messages: list[Dict[str, Any]] = []
 class ConstraintState(State):

server/app.py CHANGED Viewed

@@ -66,6 +66,22 @@ except ImportError:
         _gradio_builder = None
 # Create the app – pass the factory so create_app calls _make_env() per session.
 app = create_app(
     _make_env,
     ConstraintAction,

         _gradio_builder = None
 # Create the app – pass the factory so create_app calls _make_env() per session.
+import gradio as gr
+_orig_tabbed = gr.TabbedInterface
+def _swapped_tabbed(interface_list, tab_names, **kwargs):
+    if len(interface_list) == 2 and "Custom" in tab_names:
+        idx_custom = tab_names.index("Custom")
+        idx_play = tab_names.index("Playground")
+        return _orig_tabbed(
+            [interface_list[idx_custom], interface_list[idx_play]],
+            ["Constraint Compiler", "Base Playground"],
+            **kwargs
+        )
+    return _orig_tabbed(interface_list, tab_names, **kwargs)
+gr.TabbedInterface = _swapped_tabbed
 app = create_app(
     _make_env,
     ConstraintAction,

server/constraint_env_environment.py CHANGED Viewed

@@ -158,9 +158,9 @@ class ConstraintEnvironment(Environment):
         except (json.JSONDecodeError, TypeError) as exc:
             info["error"] = "invalid_json"
             messages.extend([
-                "Your last submitted AST:",
-                str(action.ast_output),
-                f"Compiler Error: Syntax Error. Invalid JSON — {exc}"
             ])
         # ── 2. Logic match (ignores "name") ──────────────────────────
@@ -171,29 +171,31 @@ class ConstraintEnvironment(Environment):
             else:
                 info["exact_match"] = False
-        # ── 3. Validate structure ─────────────────────────────────────
-        # ── 3. Validate structure ─────────────────────────────────────
         if is_exact_match:
             is_valid_structure = True
         elif is_valid_json:
             valid, msg = self._validate_structure(ast)
             if valid:
                 is_valid_structure = True
                 # NEW FIX: Provide feedback when structure is valid but logic fails!
                 info["error"] = "logic_mismatch"
                 messages.extend([
-                    "Your last submitted AST:",
-                    action.ast_output,
-                    "Compiler Error: Syntax is valid, but the logic does not match the prompt's target constraint. Please adjust your logical conditions and resubmit."
                 ])
             else:
                 info["error"] = "bad_structure"
                 messages.extend([
-                    "Your last submitted AST:",
-                    action.ast_output,
-                    f"Compiler Error: {msg}"
                 ])
         done = is_exact_match or self._state.step_count >= self._state.max_steps
         reward = calculate_step_reward(is_valid_json, is_valid_structure, is_exact_match, self._state.step_count)

         except (json.JSONDecodeError, TypeError) as exc:
             info["error"] = "invalid_json"
             messages.extend([
+                {"role": "assistant", "content": "Your last submitted AST:"},
+                {"role": "assistant", "content": str(action.ast_output)},
+                {"role": "assistant", "content": f"Compiler Error: Syntax Error. Invalid JSON — {exc}"}
             ])
         # ── 2. Logic match (ignores "name") ──────────────────────────
             else:
                 info["exact_match"] = False
+      # ── 3. Validate structure ─────────────────────────────────────
         if is_exact_match:
             is_valid_structure = True
         elif is_valid_json:
+            # THE FIX: Safely convert the AST to a string so Pydantic doesn't crash!
+            safe_ast_str = json.dumps(ast) if isinstance(ast, dict) else str(action.ast_output)
             valid, msg = self._validate_structure(ast)
             if valid:
                 is_valid_structure = True
                 # NEW FIX: Provide feedback when structure is valid but logic fails!
                 info["error"] = "logic_mismatch"
                 messages.extend([
+                    {"role": "assistant", "content": "Your last submitted AST:"},
+                    {"role": "assistant", "content": safe_ast_str},
+                    {"role": "assistant", "content": "Compiler Error: Syntax is valid, but the logic does not match the prompt's target constraint. Please adjust your logical conditions and resubmit."}
                 ])
             else:
                 info["error"] = "bad_structure"
                 messages.extend([
+                    {"role": "assistant", "content": "Your last submitted AST:"},
+                    {"role": "assistant", "content": safe_ast_str},
+                    {"role": "assistant", "content": f"Compiler Error: {msg}"}
                 ])
         done = is_exact_match or self._state.step_count >= self._state.max_steps
         reward = calculate_step_reward(is_valid_json, is_valid_structure, is_exact_match, self._state.step_count)

server/gradio_ui.py CHANGED Viewed

@@ -183,7 +183,13 @@ def build_constraint_gradio_ui(web_manager, action_fields, metadata, is_chat_env
         else:
             status = f"ℹ️ {error}"
-        compiler_msg = "\n".join(msgs) if msgs else ("✅ Correct! Episode complete." if exact else "No compiler feedback.")
         done_badge = "  🏁 Episode Done" if done else ""
         entry = (
             f"[YOU]\n{ast_text[:400]}\n\n"

         else:
             status = f"ℹ️ {error}"
+        parsed_msgs = []
+        for m in msgs:
+            if isinstance(m, dict):
+                parsed_msgs.append(m.get("content", str(m)))
+            else:
+                parsed_msgs.append(str(m))
+        compiler_msg = "\n".join(parsed_msgs) if parsed_msgs else ("✅ Correct! Episode complete." if exact else "No compiler feedback.")
         done_badge = "  🏁 Episode Done" if done else ""
         entry = (
             f"[YOU]\n{ast_text[:400]}\n\n"