📐 schemas/ · 03_shell___system_integration.md

Chapter 3: Shell & System Integration

📄 schemas/03_shell___system_integration.md

Chapter 3: Shell & System Integration

In the previous chapter, Polymorphic Hook Definitions, we learned how to differentiate between different types of actions (Commands, HTTP requests, Prompts).

Now, we will focus on the most common and powerful action: The Command.

This chapter explores Shell & System Integration. We will look at how to define the "Rules of Engagement" for interacting with the operating system, ensuring commands run safely and in the correct environment.

The Problem: "It's not just about running a command"

Imagine you want your AI assistant to start a local web server. You tell it to run npm start.

If you simply execute this command blindly, several things could go wrong:

Freezing: The server runs forever. The AI freezes, waiting for the command to finish.
Wrong Language: You are on Windows, but the command uses Linux syntax (ls vs dir).
Runaway Process: The command hangs and eats up all your CPU.

We need a way to wrap the raw command in a safety layer. We need to tell the system how to run the command, not just what to run.

Analogy: The Soldier's Orders

Think of the BashCommandHook as a set of orders given to a soldier before a mission:

The Weapon (Shell): Are you using a Rifle (Bash) or a Pistol (PowerShell)?
The Time Limit (Timeout): "If you aren't back in 30 minutes, abort."
The Radio Protocol (Async): "Call us when you are done" vs. "Maintain radio silence."

The Solution: `BashCommandHook`

The BashCommandHook schema handles these details. Let's look at a concrete use case to understand the configuration options.

Use Case: Starting a Background Server

Goal: We want to start a server (npm start). We want it to run in the background so we can keep working, and we want to ensure it works on Windows.

Here is how we configure that using the schema:

{
  "type": "command",
  "command": "npm start",
  "shell": "powershell",
  "async": true,
  "timeout": 10
}

Let's break down the key concepts used here.

Concept 1: The Shell Interpreter (`shell`)

Operating systems speak different languages. Linux/macOS usually speak Bash (or Zsh). Windows usually speaks PowerShell.

If you write a script that uses grep (Linux search), it will fail on Windows standard command prompt.

Options: "bash" (default) or "powershell".
Why it matters: It ensures your command syntax is interpreted correctly by the underlying OS.

Concept 2: Timeouts (`timeout`)

This is your safety switch. It defines the maximum time (in seconds) a command is allowed to run.

Default: Usually runs until finished.
Why it matters: If a command asks for user input or enters an infinite loop, the timeout kills it automatically so your application doesn't hang forever.

Concept 3: Background Execution (`async`)

By default, the system blocks. It waits for the command to finish before moving to the next task.

async: true: The system fires the command and immediately moves on. It returns the "Process ID" (PID) instead of the command output.
Why it matters: Essential for long-running tasks like servers, watchers, or complex builds.

Concept 4: The Watchdog (`asyncRewake`)

This is a special, advanced feature.

asyncRewake: true: Like async, it runs in the background. However, if the command crashes (specifically with exit code 2), it "wakes up" the AI model.
Use Case: You start a server in the background. You go to sleep. The server crashes. The system wakes up the AI to investigate why it crashed.

Internal Implementation: Under the Hood

How does the system enforce these rules? Let's visualize the flow when a command is executed.

The Execution Sequence

sequenceDiagram participant Config as Configuration participant Val as Validation (Zod) participant Exec as Shell Executor participant OS as Operating System Config->>Val: Check Schema (Timeout? Shell?) Val-->>Exec: Validated Config Exec->>Exec: Select Interpreter (Bash/Pwsh) Exec->>OS: Spawn Process par Watchdog Exec->>Exec: Start Timer (Timeout) and Process OS-->>Exec: Running... end opt Timeout Exceeded Exec->>OS: Send Kill Signal (SIGTERM) end

Validation: Zod checks if your settings are valid numbers/strings.
Selection: The code checks shell to decide which executable to spawn (/bin/bash or pwsh.exe).
Watchdog: A timer starts alongside the process. If the timer hits zero, the process is killed.

Code Deep Dive

Let's look at the Zod definition in hooks.ts. We will break it into two parts for clarity.

Part 1: The Basics

This defines the mandatory fields for a command hook.

// hooks.ts (Snippet 1)
const BashCommandHookSchema = z.object({
  // The discriminator (must be "command")
  type: z.literal('command'), 
  
  // The actual script to run
  command: z.string(),
  
  // Which interpreter to use?
  shell: z.enum(['bash', 'powershell', 'zsh', 'sh']).optional(),
});

Explanation:

type: Must be strictly 'command'.
shell: Uses z.enum. This limits the user to only valid choices. You can't set shell to "minecraft".

Part 2: The Safety & Async Rules

This adds the control layers we discussed.

// hooks.ts (Snippet 2)
  // Safety limit in seconds
  timeout: z.number().positive().optional(),

  // Fire and forget?
  async: z.boolean().optional(),
  
  // Run in background, but report errors?
  asyncRewake: z.boolean().optional(),
});

Explanation:

timeout: Must be a .positive() number. You can't have a timeout of -5 seconds.
asyncRewake: This implies async. It's a boolean flag that alters the internal event listener logic to handle crash reports.

Summary

In this chapter, we learned:

Shell Integration is about more than just text; it's about managing the execution environment.
shell lets us choose between Bash and PowerShell to ensure compatibility.
timeout prevents infinite loops and hanging processes.
async allows us to run long tasks (like servers) without blocking the AI.

We now have a safe way to run commands. But sometimes, running a command isn't enough. Sometimes, we need the AI to verify that the command actually did what it was supposed to do.

How do we add a "Brain" to our hooks?

Next Chapter: AI Verification & Interaction

Generated by Code IQ

← Previous

Polymorphic Hook Definitions

Ai Verification & Interaction

Chapter 3: Shell & System Integration

Chapter 3: Shell & System Integration

The Problem: "It's not just about running a command"

Analogy: The Soldier's Orders

The Solution: BashCommandHook

Use Case: Starting a Background Server

Concept 1: The Shell Interpreter (shell)

Concept 2: Timeouts (timeout)

Concept 3: Background Execution (async)

Concept 4: The Watchdog (asyncRewake)

Internal Implementation: Under the Hood

The Execution Sequence

Code Deep Dive

Part 1: The Basics

Part 2: The Safety & Async Rules

Summary

The Solution: `BashCommandHook`

Concept 1: The Shell Interpreter (`shell`)

Concept 2: Timeouts (`timeout`)

Concept 3: Background Execution (`async`)

Concept 4: The Watchdog (`asyncRewake`)