001: Agent/Request

Enables:
- 101: Concept/Idea
Links:
- NPM: @idealic/agent

A single, self-contained LLM invocation, which takes a context and a schema and produces a solution.

The Request is the core computational primitive of the agent system. It provides a structured, reproducible, and extensible pipeline for interacting with an LLM, turning a rich context and a declarative schema into a precise, structured solution. Unlike a simple prompt, a Request is a complete, self-contained unit of work that serves as the engine for all higher-level agent capabilities.

LLM will process the context to generate a solution that conforms to the schema.

Context: A Pipeline of Messages

The foundation of a Request is its context: an array of Message objects. Each Message is a simple structure containing a role (like "system", "user", or "assistant") and content. This structure allows a rich, multi-turn conversation to be presented to the LLM.

Unlike a typical chat-based model where messages are continuously appended to a growing history, the context for each Request is a fully managed, standalone package. It is carefully constructed for a specific task and is not polluted by previous, unrelated interactions. The LLM's responses are not automatically added back; the context is rebuilt for each new computation. This ensures that the process is reproducible and that the LLM has the exact information it needs, without the risk of context truncation or "forgetting" crucial details.

This managed context is the primary mechanism for providing the LLM with prompts, data, and instructions. Everything required for the computation, except for the final output schema, is delivered through these messages.

A simple Message array might look like this:

[
  { "role": "system", "content": "You are a helpful assistant." },
  { "role": "user", "content": "What is the capital of France?" }
]

Custom message types described in Acts:

006: Agent/Data - present data and its meaning to LLM as a message
007: Agent/Input - structured prompt for LLM to use
009: Agent/State - persistent state retained within loop
010: Agent/Plan - prepared plan for multi-step execution

The system extends this basic Message structure by allowing the content field to hold not just text, but specialized objects called custom content types. For example, instead of a string, a message's content could be a structured object like { "type": "input", "input": { ... } }.

This capability makes the context the main point of extension within the system.

Each custom content type is registered with a handler, and these handlers form a processing pipeline. Before the Request is sent to the LLM, the pipeline processes each message in the context. A message's handler can dynamically modify the core components of the Request:

LLM Config: Adjusting parameters like the model, temperature, or other settings.
Schema: Modifying the JSON schema that the final output must conform to.
Context: Altering the final list of messages that will be sent to the LLM, for example by transforming the custom type into a text-based representation or adding new messages.

This powerful pipeline mechanism allows the agent to work with high-level, structured concepts, dynamically constructing the precise LLM invocation needed to perform a task.

Read more at json-schema.org

Schema: Guiding the Solution

The schema is a JSON Schema that defines the exact structure of the desired solution. This is a powerful system that allows for the representation of any type of data, from simple strings to complex, nested objects. The LLM is forced to generate a solution that strictly conforms to this schema, guaranteeing that the output is always well-structured and predictable.

As schemas grow in complexity, they can be designed to guide not only the final output but also the LLM's reasoning process. For example, a schema can include fields for the data itself, as well as separate fields that prompt the LLM to outline its reasoning, chain of thought, or confidence scores. This turns the schema into an active tool for shaping the generation process.

A core principle of this architecture is the composition of schemas. More complex capabilities are built by combining simpler, reusable schema components, allowing for a modular and scalable approach to defining the agent's knowledge and abilities.

Execution and the Solution

After the context is processed, the final array of messages and the schema are sent to the LLM in a single request. The LLM's response is the solution—a structured, JSON-based document that strictly conforms to the provided schema.

This process can be understood as the generation of a mini-narrative. Because an LLM operates as a next-token predictor, it generates the solution from top to bottom, following the structure of the schema. The order and design of the schema's fields have a direct impact on the narrative the LLM produces.

For example, if a schema first requires a field for meta-reasoning (e.g., "thought_process") before a field for the final data, the LLM is forced to first articulate its reasoning before producing the answer. The initial reasoning becomes part of the context that influences the generation of the subsequent data. This powerful mechanism allows us to guide the LLM's thinking, giving us significant control over the final result by shaping the very path it takes to get there.

Heads up

This entire Request pipeline—the context, the schema, and the resulting solution—forms a self-contained, reproducible unit. When saved, this unit is what the system refers to as an 101: Concept/Idea.

From Structured Output to Actionable Choices

002: Agent/Tool

A Request provides a robust mechanism for generating a single, schema-compliant solution. However, to build sophisticated agents, we need more than just structured output. We need a way to present the LLM with a menu of capabilities—distinct actions it can choose from to achieve a goal. This requires a system for defining these actions as discrete, selectable units.

The next document, 002: Agent/Tool, introduces the protocol for defining these capabilities.