added ai2thor enviroment with a method that proposes steps and uses C… by Vman11 · Pull Request #280 · ITM-Kitware/align-system

Vman11 · 2026-05-28T15:14:53Z

…R to choose steps

dmjoy

Overall this is in the right direction, but there are a few things I would like to see changed (see comments) or at least have a good handle on why the current modifications are needed for those cases.

dmjoy · 2026-06-04T20:13:15Z

+    Unlike the Outlines engine, output is not grammar-constrained — the
+    schema is appended to the prompt as an instruction and the response
+    is parsed as JSON with a repair fallback.
+    """


It does look like Ollama supports structured output with a JSON schema: https://ollama.com/blog/structured-outputs. Seems like we should use that if possible

dmjoy · 2026-06-04T20:16:00Z

        return outputs

-    def run_inference(self, prompts, schema):
+    def _parse_json(self, text: str) -> dict:


I'm not really excited about the idea of including this here. Have you been running into JSON validation etc. errors with the outlines inference engine here or? Just curious why this is even needed.

dmjoy · 2026-06-04T20:22:35Z

I think it's fine to have the history tracking as you have it in here for now, but I'm more inclined to merge Yoni's approach on this: https://github.com/ITM-Kitware/align-system/pull/277/changes#diff-ea512e45fac46d4935ce85a4837bdbcd27b5891a09c1ac5f6aad038076f4d497

As it maintains the full working_output history.

dmjoy · 2026-06-04T20:24:32Z

+        tool_lines = "\n".join(f"- {t.name}: {t.description}" for t in tools)
+        history_lines = (
+            "\n".join(f"- {a.tool_name}({a.args})" for a in self._history)
+            if self._history else "None"
+        )
+        predict_proposer_prompt = (
+            f"Task: {scenario_state.unstructured}\n\n"
+            f"Available tools:\n{tool_lines}\n\n"
+            f"Action history:\n{history_lines}\n\n"
+            f"Generate {self.num_candidates} diverse candidate plans."
+        )
+
+        score_schema = (
+            '{"candidates":[{"actions":[{"tool_name":"MoveAhead","args":{"moveMagnitude":0.25}}],'
+            '"rationale":"..."}]}'
+        )
+
+        prompt_system = ("You are an embodied planning model.\n"
+            "Return ONLY valid JSON. No extra text.\n"
+            f"Generate {self.num_candidates} semi-diverse candidate plans.\n")        
+        prompt = (
+            f"You are an embodied planning model.\n"
+            "Return ONLY valid JSON. No extra text.\n"
+            f"Generate {self.num_candidates} diverse candidate plans.\n"
+            f"- Each plan is 1 to {self.rollout_horizon} actions.\n"
+            f"- Use ONLY the tool names provided.\n"
+            f"- Args MUST satisfy each tool schema.\n"
+            f"- IMPORTANT objectId rule: For tools requiring objectId (TeleportNearObject, PickupObject, "
+            f"OpenObject, CloseObject, ToggleObjectOn/Off), you MUST copy the exact full objectId string "
+            "from the observation's visible lines (the value after 'id='). "
+            "Never use object type names like 'Apple' as objectId. Full objectIds contain '|' characters.\n"
+            "- Avoid repeating the same last action unless clearly helpful.\n"
+        )


My preference would for these to be done similar to how we do prompts in other ADMs (outlines templates or callables, and parameterize them in the init call so that we can swap them around at Hydra configuration time)

dmjoy · 2026-06-04T20:26:41Z

These experiment configs should be probably be in a subdirectory inside of experiment as that's typically how we've been doing it.

dmjoy · 2026-06-04T20:28:52Z

I feel like these should be in a file specific to AI2Thor (if they are indeed specific data types for that domain) just to help with namespacing. I.e. if I do from align_system.data_models.ai2thor import ToolSpec that seems more informative

dmjoy · 2026-06-04T20:30:24Z

+        Image.fromarray(frame.astype(np.uint8)).save(fpath)
+        print(f"[AI2ThorEnv] saved frame: {fpath}")
+
+    def reset(self, task: str) -> Observation:


All of the setup info etc. in here seems like it should be living in a data or config file somewhere rather than code?

dmjoy · 2026-06-04T20:31:38Z

+from align_system.data_models.types import Action as PlannerAction
+
+
+TASKS = {


Similar story here, this should probably be in a data or configuration file somewhere and we would probably want to be able to modify it for a given experiment.

added ai2thor enviroment with a method that proposes steps and uses C…

9ebb478

…R to choose steps

dmjoy requested changes Jun 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added ai2thor enviroment with a method that proposes steps and uses C…#280

added ai2thor enviroment with a method that proposes steps and uses C…#280
Vman11 wants to merge 1 commit into
mainfrom
ai2thor

Vman11 commented May 28, 2026

Uh oh!

dmjoy left a comment

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

dmjoy Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		from align_system.data_models.types import Action as PlannerAction


		TASKS = {

Conversation

Vman11 commented May 28, 2026

Uh oh!

dmjoy left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants