scienceworld-room-scanner
$
npx mdskill add zjunlp/SkillNet/scienceworld-room-scannerScan the current environment to generate a detailed inventory of visible objects, containers, and features.
- Determining item locations or assessing the current physical state of a location.
- Integrates with the environment's core spatial awareness and object detection APIs.
- Executes upon entering a new area or when environmental status needs verification.
- Delivers a structured observation detailing room contents, including nested container inventories.
SKILL.md
.github/skills/scienceworld-room-scannerView on GitHub ↗
--- name: scienceworld-room-scanner description: This skill performs a 'look around' action to scan and describe the current room's contents, including visible objects, containers, and doors. Use when entering a new room or when the agent needs to locate specific items or assess the environment state. The skill outputs a detailed room description, which is essential for inventory discovery and situational awareness. --- # Skill: Room Scanner ## Purpose Execute a `look around` action to obtain a comprehensive description of the current room in the ScienceWorld environment. This description is the foundational step for any task requiring item location, environmental assessment, or navigation planning. ## Core Instruction When this skill is invoked, the agent must perform the **`look around`** action. ## Trigger Conditions Invoke this skill when: 1. You first enter a new room via `teleport` or other movement. 2. You need to locate a specific object or container mentioned in your task. 3. The state of the room may have changed (e.g., after an interaction). 4. You are formulating a plan and require an inventory of available resources. ## Output Processing The observation from `look around` will contain: * **Room Name:** The identifier of your current location. * **Visible Objects & Agents:** A list of all entities in the room. * **Container Contents:** For open containers, a nested list of items inside (e.g., `a bowl (containing a red apple, a banana)`). * **Device States:** The status of interactive objects (e.g., `a stove, which is turned off`). * **Connections:** All accessible doors and their destination rooms. **You must parse this output carefully.** Use it to update your mental model of the environment before proceeding with other actions like `pick up`, `examine`, or `use`. ## Integration Notes * This is a low-level, atomic skill. It should often be the first action in a sequence. * The observation it generates is critical context for subsequent decision-making. Refer back to it. * Do not overuse it. Once you have a recent description of a room, rely on that knowledge until you have reason to believe the state has changed. ## Example **Task:** Survey the workshop after teleporting there. 1. `look around` 2. Observation: "This room is called the workshop. In it, you see: a table. On the table is: a battery, a blue light bulb, an orange wire, a yellow wire, a green wire. You also see: a blue box, an orange box. There is a door to the hallway." 3. Parse: available components include battery, light bulb, three wires; classification containers are blue box and orange box.