UNPKG

@mastra/core

Version:

Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

1,031 lines (723 loc) 47.9 kB
# Processor interface The `Processor` interface defines the contract for all processors in Mastra. Processors can implement one or more methods to handle different stages of the agent execution pipeline. ## When processor methods run The eight processor methods run at different points in the agent execution lifecycle: ```text ┌────────────────────────────────────────────────────────────────────┐ │ Agent Execution Flow │ ├────────────────────────────────────────────────────────────────────┤ │ │ │ User Input │ │ │ │ │ ▼ │ │ ┌────────────────────────┐ │ │ │ processInput │ ← Runs ONCE at start │ │ └───────────┬────────────┘ │ │ │ │ │ ▼ │ │ ┌──────────────────────────────────────────────────────────────┐ │ │ │ Agentic Loop │ │ │ │ │ │ │ │ ┌────────────────────────┐ │ │ │ │ │ processInputStep │ ← Runs at EACH step │ │ │ │ └───────────┬────────────┘ │ │ │ │ │ │ │ │ │ ▼ │ │ │ │ ┌────────────────────────┐ │ │ │ │ │ processLLMRequest │ ← Before provider call │ │ │ │ └───────────┬────────────┘ │ │ │ │ │ │ │ │ │ ▼ │ │ │ │ LLM Execution ──── API Error? ───┐ │ │ │ │ │ │ │ │ │ │ │ ┌───────────┴──────────┐ │ │ │ │ │ │ processAPIError │ │ │ │ │ │ └──────────────────────┘ │ │ │ │ │ (retry loops back to LLM) │ │ │ │ ▼ │ │ │ │ ┌────────────────────────┐ │ │ │ │ │ processOutputStream │ ← Runs on EACH stream chunk │ │ │ │ └───────────┬────────────┘ │ │ │ │ │ │ │ │ │ ▼ │ │ │ │ ┌────────────────────────┐ │ │ │ │ │ processLLMResponse │ ← After stream completes │ │ │ │ └───────────┬────────────┘ │ │ │ │ │ │ │ │ │ ▼ │ │ │ │ ┌────────────────────────┐ │ │ │ │ │ processOutputStep │ ← Runs after EACH LLM step │ │ │ │ └───────────┬────────────┘ │ │ │ │ │ │ │ │ │ ▼ │ │ │ │ Tool Execution (if needed) │ │ │ │ │ │ │ │ │ └──────── Loop back if tools called ────────────│ │ │ │ │ │ │ └──────────────────────────────────────────────────────────────┘ │ │ │ │ │ ▼ │ │ ┌────────────────────────┐ │ │ │ processOutputResult │ ← Runs ONCE after completion │ │ └────────────────────────┘ │ │ │ │ │ ▼ │ │ Final Response │ │ │ └────────────────────────────────────────────────────────────────────┘ ``` | Method | When it runs | Use case | | --------------------- | ------------------------------------------------------------ | ---------------------------------------------------------------------------------------------- | | `processInput` | Once at the start, before the agentic loop | Validate/transform initial user input, add context | | `processInputStep` | At each step of the agentic loop, before each LLM call | Transform messages between steps, handle tool results | | `processLLMRequest` | After LLM request conversion, before the provider call | Rewrite the outbound `LanguageModelV2Prompt` for the current call without persisting changes | | `processAPIError` | When an LLM API call fails | Inspect API rejections, optionally mutate state/messages, and request a retry | | `processOutputStream` | On each streaming chunk during LLM response | Filter/modify streaming content, detect patterns in real-time | | `processLLMResponse` | After the LLM step completes and stream chunks are collected | Capture or cache the full response, run post-call side effects paired with `processLLMRequest` | | `processOutputStep` | After each LLM response, before tool execution | Validate output quality, implement guardrails with retry | | `processOutputResult` | Once after generation completes | Post-process final response, log results | ## Interface definition ```typescript interface Processor<TId extends string = string, TTripwireMetadata = unknown> { readonly id: TId readonly name?: string readonly description?: string /** Index of this processor in the workflow (set at runtime when combining processors). */ processorIndex?: number /** When true, processOutputStream also receives `data-*` chunks. Default: false. */ processDataParts?: boolean /** Callback invoked when this processor detects a violation, regardless of strategy. */ onViolation?: (violation: ProcessorViolation) => void | Promise<void> processInput?( args: ProcessInputArgs<TTripwireMetadata>, ): Promise<ProcessInputResult> | ProcessInputResult processInputStep?( args: ProcessInputStepArgs<TTripwireMetadata>, ): | Promise<ProcessInputStepResult | MessageList | MastraDBMessage[] | undefined | void> | ProcessInputStepResult | MessageList | MastraDBMessage[] | void | undefined processLLMRequest?( args: ProcessLLMRequestArgs<TTripwireMetadata>, ): Promise<ProcessLLMRequestResult> | ProcessLLMRequestResult processLLMResponse?( args: ProcessLLMResponseArgs<TTripwireMetadata>, ): Promise<ProcessLLMResponseResult> | ProcessLLMResponseResult processAPIError?( args: ProcessAPIErrorArgs<TTripwireMetadata>, ): Promise<ProcessAPIErrorResult | void> | ProcessAPIErrorResult | void processOutputStream?( args: ProcessOutputStreamArgs<TTripwireMetadata>, ): Promise<ChunkType | null | undefined> processOutputStep?(args: ProcessOutputStepArgs<TTripwireMetadata>): ProcessorMessageResult processOutputResult?(args: ProcessOutputResultArgs<TTripwireMetadata>): ProcessorMessageResult } ``` ## Properties **id** (`string`): Unique identifier for the processor. Used for tracing and debugging. **name** (`string`): Optional display name for the processor. Falls back to id if not provided. **description** (`string`): Optional human-readable description shown in tracing and Studio. **processorIndex** (`number`): Position of the processor in the combined processor list. Set at runtime by Mastra when processors are merged with memory, workspace, and per-call overrides. You do not set this yourself. **processDataParts** (`boolean`): When true, the processOutputStream method also receives \`data-\*\` chunks emitted by tools via writer.custom(). Defaults to false. **onViolation** (`(violation: ProcessorViolation) => void | Promise<void>`): Optional callback invoked when the processor detects a policy violation, regardless of strategy (block or warn). Use for side effects like alerting, logging to external systems, or emailing users. Errors thrown by this callback are silently caught to prevent interfering with processor logic. The violation object contains processorId, message, and a processor-specific detail field. ## Message arguments Most processor methods receive both `messages` and `messageList`. They point to the same underlying conversation but expose it differently. ### `messages` vs `messageList` - `messages`: A plain array of `MastraDBMessage` objects, scoped to the current stage. For `processInput` and `processInputStep` this excludes system messages. For `processOutputResult` and `processOutputStep` this includes the latest LLM response. The array is backed by `messageList`, so editing a message's `content.parts` in place is visible to downstream processors and to persistence. - `messageList`: The live `MessageList` instance backing the run. It exposes filtered views (input, response, remembered, all), multiple output formats (db, ui, core), and methods for mutating the conversation. Use `messages` when you only need to read, map over, or lightly edit fields on the current stage's messages. Use `messageList` when you need to: - Read messages from another stage, for example input messages while processing output. - Add, remove, or replace whole messages. - Convert to another format such as UI or core messages for a third-party API. `messages` is always derived from `messageList`, so mutating `messageList` is the canonical way to add, remove, or reorder messages. For in-place edits to a message's content (for example, rewriting `content.parts`), mutating `messages` directly is equivalent. If you return a new array from `messages`, Mastra reconciles it against `messageList` for the current stage. ### Persistence When memory is enabled, only what ends up in `messageList` after all processors finish is persisted to storage. The two return styles are equivalent for persistence: - Mutating `messageList` directly (or returning the same `MessageList` instance) — recorded mutations are applied in place, so the saved conversation reflects your changes. - Returning a `MastraDBMessage[]` or `{ messages, systemMessages }` — Mastra reconciles the returned array against `messageList` for the current stage, removing missing messages and replacing system messages. Returning a different `MessageList` instance is an error; always mutate the one passed to your processor. ### Reading text from a message `MastraDBMessage.content` is a structured object, not a string. The canonical way to read user or assistant text is `content.parts`: ```typescript import type { MastraDBMessage } from '@mastra/core/memory' function getText(message: MastraDBMessage): string { let text = '' if (message.content.parts) { for (const part of message.content.parts) { if (part.type === 'text' && typeof part.text === 'string') { text += part.text } } } // Fallback for legacy messages that only have the flattened `content` string if (!text && typeof message.content.content === 'string') { text = message.content.content } return text } ``` Key points: - `message.content.parts` is the primary source. A single message can contain multiple parts, including non-text parts such as tool calls, tool results, and file parts. Filter by `part.type === 'text'` before reading `part.text`. - `message.content.content` is a flattened string kept for backward compatibility. Use it only as a fallback when `parts` is empty or missing. - `message.content` itself is never a plain string on `MastraDBMessage`. Legacy `CoreMessage` shapes may be strings, but processors always receive `MastraDBMessage`. ## Methods ### `processInput` Processes input messages before they're sent to the LLM. Runs once at the start of agent execution. ```typescript processInput?(args: ProcessInputArgs): Promise<ProcessInputResult> | ProcessInputResult; ``` #### `ProcessInputArgs` **messages** (`MastraDBMessage[]`): User and assistant messages to process (excludes system messages). **systemMessages** (`CoreMessage[]`): All system messages (agent instructions, memory context, user-provided). Can be modified and returned. **messageList** (`MessageList`): Full MessageList instance for advanced message management. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Throws a TripWire error that stops execution. Pass \`retry: true\` to request the LLM retry the step with feedback. **retryCount** (`number`): Number of times processors have triggered retry for this generation. Use this to limit retry attempts. Always passed by Mastra; starts at 0. **tracingContext** (`TracingContext`): Tracing context for observability. **requestContext** (`RequestContext`): Request-scoped context with execution metadata like threadId and resourceId. #### `ProcessInputResult` The method can return one of three types: **MastraDBMessage\[]** (`array`): Transformed messages array. System messages remain unchanged. **MessageList** (`MessageList`): The same messageList instance passed in. Indicates you've mutated it directly. **{ messages, systemMessages }** (`object`): Object with both transformed messages and modified system messages. *** ### `processInputStep` Processes input messages at each step of the agentic loop, before they're sent to the LLM. Unlike `processInput` which runs once at the start, this runs at every step including tool call continuations. ```typescript processInputStep?<TTripwireMetadata = unknown>( args: ProcessInputStepArgs<TTripwireMetadata>, ): | Promise<ProcessInputStepResult | MessageList | MastraDBMessage[] | void | undefined> | ProcessInputStepResult | MessageList | MastraDBMessage[] | void | undefined; ``` #### Execution order in the agentic loop 1. `processInput` (once at start) 2. `processInputStep` from inputProcessors (at each step, before LLM call) 3. `prepareStep` callback (runs as part of the processInputStep pipeline, after inputProcessors) 4. `processLLMRequest` from inputProcessors (after prompt conversion, before the provider call) 5. LLM execution 6. `processOutputStream` from outputProcessors (on each streaming chunk) 7. `processLLMResponse` from inputProcessors (after stream completes, pairs with `processLLMRequest`) 8. `processOutputStep` from outputProcessors (after LLM response, before tool execution) 9. Tool execution (if needed) 10. Repeat from step 2 if tools were called #### `ProcessInputStepArgs` **messages** (`MastraDBMessage[]`): All messages including tool calls and results from previous steps (read-only snapshot). **messageList** (`MessageList`): MessageList instance for managing messages. Can mutate directly or return in result. **stepNumber** (`number`): Current step number (0-indexed). Step 0 is the initial LLM call. **steps** (`StepResult[]`): Results from previous steps, including text, toolCalls, and toolResults. **systemMessages** (`CoreMessage[]`): All system messages (read-only snapshot). Return in result to replace. **model** (`MastraLanguageModelV2`): Current model being used. Return a different model in result to switch. **toolChoice** (`ToolChoice`): Current tool choice setting ('auto', 'none', 'required', or specific tool). **activeTools** (`string[]`): Currently active tool names. Return filtered array to limit tools. **tools** (`ToolSet`): Current tools available for this step. Return in result to add/replace tools. **providerOptions** (`SharedV2ProviderOptions`): Provider-specific options (e.g., Anthropic cacheControl, OpenAI reasoningEffort). **modelSettings** (`CallSettings`): Model settings like temperature, maxTokens, topP. **structuredOutput** (`StructuredOutputOptions`): Structured output configuration (schema, output mode). Return in result to modify. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Throws a TripWire error that stops execution. Pass \`retry: true\` to request the LLM retry the step with feedback. **retryCount** (`number`): Current retry attempt count from \`ProcessorContext\`. Starts at \`0\`; use to cap processor-triggered retries. **tracingContext** (`TracingContext`): Tracing context for observability. **requestContext** (`RequestContext`): Request-scoped context with execution metadata. #### `ProcessInputStepResult` `processInputStep` can return several shapes: - **`ProcessInputStepResult` object** — override any combination of the properties below for this step (described next). - **`MessageList`** — return the same `messageList` instance to signal you mutated messages in place. - **`MastraDBMessage[]`** — return a transformed messages array; replaces the step's messages. - **`void` or `undefined`** — return nothing to leave the step unchanged. The object form can return any combination of these properties: **model** (`LanguageModelV2 | string`): Change the model for this step. Can be a model instance or router ID like 'openai/gpt-5.4'. **toolChoice** (`ToolChoice`): Change tool selection behavior for this step. **activeTools** (`string[]`): Filter which tools are available for this step. **tools** (`ToolSet`): Replace or modify tools for this step. Use spread to merge: { tools: { ...tools, newTool } }. **messages** (`MastraDBMessage[]`): Replace all messages. Cannot be used with messageList. **messageList** (`MessageList`): Return the same messageList instance (indicates you mutated it). Cannot be used with messages. **systemMessages** (`CoreMessage[]`): Replace all system messages for this step only. **providerOptions** (`SharedV2ProviderOptions`): Change provider-specific options for this step. **modelSettings** (`CallSettings`): Change model settings for this step. **structuredOutput** (`StructuredOutputOptions`): Change structured output configuration for this step. #### Processor chaining When multiple processors implement `processInputStep`, they run in order and changes chain through: ```text Processor 1: receives { model: 'gpt-5.4' } → returns { model: 'gpt-5.4-mini' } Processor 2: receives { model: 'gpt-5.4-mini' } → returns { toolChoice: 'none' } Final: model = 'gpt-5.4-mini', toolChoice = 'none' ``` #### System message isolation System messages are **reset to their original values** at the start of each step. Modifications made in `processInputStep` only affect the current step, not subsequent steps. #### Use cases - Dynamic model switching based on step number or context - Disabling tools after a certain number of steps - Dynamically adding or replacing tools based on conversation context - Transforming message part types between providers (e.g., `reasoning` → `thinking` for Anthropic) - Modifying messages based on step number or accumulated context - Adding step-specific system instructions - Adjusting provider options per step (e.g., cache control) - Modifying structured output schema based on step context *** ### `processLLMRequest` Processes the final LLM request after Mastra converts the `MessageList` into `LanguageModelV2Prompt` and before the provider call. Use this method for transient, model-aware rewrites that should affect only the current outbound request. Returned prompt changes are forwarded to the model for the current call only. They are not persisted back to `MessageList`, memory, UI history, or later provider calls. ```typescript processLLMRequest?( args: ProcessLLMRequestArgs, ): Promise<ProcessLLMRequestResult> | ProcessLLMRequestResult; ``` #### `ProcessLLMRequestArgs` **prompt** (`LanguageModelV2Prompt`): The LLM request prompt that will be sent to the provider for this call. **model** (`MastraLanguageModel`): The resolved model that will receive the prompt. Use this to scope provider-specific rewrites. **stepNumber** (`number`): Current step number (0-indexed). Step 0 is the initial LLM call. **steps** (`StepResult[]`): Results from previous steps, including text, toolCalls, and toolResults. **state** (`Record<string, unknown>`): Per-processor state that persists across all method calls within this request. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Throws a TripWire error that stops execution and emits a \`tripwire\` chunk. **retryCount** (`number`): Current retry attempt count from \`ProcessorContext\`. Starts at \`0\`; use to cap processor-triggered retries. **requestContext** (`RequestContext`): Request-scoped context with execution metadata. **tracingContext** (`TracingContext`): Tracing context for observability. **writer** (`ProcessorStreamWriter`): Stream writer for emitting custom data chunks during streaming. Use \`writer.custom()\` to send transient UI signals. **abortSignal** (`AbortSignal`): Signal for cancelling the operation. #### Return value `processLLMRequest` returns `ProcessLLMRequestResult`, which is `{ prompt?: LanguageModelV2Prompt } | undefined | void`. - Return `{ prompt }` to replace the outbound prompt for the current provider call. - Return `undefined` or `void` to forward the original prompt unchanged. #### Use cases - Removing or reshaping provider-specific prompt parts before a model call - Normalizing roles or content to match a provider's input requirements - Adapting tool result formats when switching providers mid-loop *** ### `processLLMResponse` Processes the LLM response after the step completes (or after a cached response is replayed) and after output processors have collected the response chunks. This hook pairs with `processLLMRequest`: use `processLLMRequest` to stash state (such as a cache key) before the provider call, and `processLLMResponse` to act on the completed response (such as writing it to a cache). The `state` object is the same instance passed to `processLLMRequest` for the same step, so processors can correlate pre- and post-call work. ```typescript processLLMResponse?( args: ProcessLLMResponseArgs, ): Promise<ProcessLLMResponseResult> | ProcessLLMResponseResult; ``` #### `ProcessLLMResponseArgs` **chunks** (`CachedLLMStepChunk[]`): Chunks produced by the LLM call (or replayed from cache) for this step, in stripped form (\`{ type, payload }\`). **model** (`MastraLanguageModel`): The model that produced (or would have produced) the response. **stepNumber** (`number`): Current step number (0-indexed). **steps** (`StepResult[]`): All completed steps so far, including this step. **state** (`Record<string, unknown>`): Per-processor state shared with \`processLLMRequest\` for the same step. Use this to pass data between the two hooks (e.g. a cache key). **fromCache** (`boolean`): When \`true\`, the response was replayed from a cache via \`processLLMRequest\` returning \`{ response }\`. Processors that write to a cache should skip writes when this is \`true\`. **warnings** (`LanguageModelV2CallWarning[]`): Warnings reported by the language model call (e.g. unsupported settings). **request** (`unknown`): Provider request body, when available. Useful for tracing. **rawResponse** (`unknown`): Raw provider response, when available. Useful for tracing. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Throws a TripWire error that stops execution. **retryCount** (`number`): Current retry attempt count. Starts at \`0\`; use to cap processor-triggered retries. **requestContext** (`RequestContext`): Request-scoped context with execution metadata. **tracingContext** (`TracingContext`): Tracing context for observability. **writer** (`ProcessorStreamWriter`): Stream writer for emitting custom data chunks. **abortSignal** (`AbortSignal`): Signal for cancelling the operation. #### Return value `processLLMResponse` returns `ProcessLLMResponseResult`, which is `undefined | void`. The return value is reserved for future extensibility. #### Use cases - Writing LLM responses to a cache after a live call (paired with cache-key derivation in `processLLMRequest`) - Logging or recording the full response for analytics - Triggering side effects based on the completed response *** ### `processAPIError` Handles LLM API rejection errors before they surface as final errors. This runs when the API call fails with a non-retryable error (such as a 400 or 422 status code). Unlike `processOutputStep` which runs after successful responses, this runs when the API rejects the request. Add processors that implement `processAPIError` to an agent's `errorProcessors` array. Processors can inspect the error, modify the request (for example, by appending messages to the `messageList`), and return `{ retry: true }` to signal a retry with the modified state. ```typescript processAPIError?(args: ProcessAPIErrorArgs): Promise<ProcessAPIErrorResult | void> | ProcessAPIErrorResult | void; ``` #### `ProcessAPIErrorArgs` **error** (`unknown`): The error that occurred during the LLM API call. **messages** (`MastraDBMessage[]`): All messages at the time of the error. **messageList** (`MessageList`): MessageList instance for managing messages. Modify this to change the request before retry. **stepNumber** (`number`): Current step number (0-indexed). **steps** (`StepResult[]`): All completed steps so far. **state** (`Record<string, unknown>`): Per-processor state that persists across all method calls within this request. **retryCount** (`number`): The current retry count for error handlers. Use this to limit retry attempts. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. **writer** (`ProcessorStreamWriter`): Stream writer for emitting custom data chunks during streaming. Use \`writer.custom()\` to send transient UI signals. **requestContext** (`RequestContext`): Request context passed through from the agent call. **abortSignal** (`AbortSignal`): Signal for cancelling the operation. #### `ProcessAPIErrorResult` **retry** (`boolean`): Whether to retry the LLM call after applying modifications. #### Use cases - Handling API-specific rejections by modifying the request and retrying - Converting non-retryable errors into retryable ones with request modifications - Implementing model-specific error recovery strategies #### Example: Custom error recovery ```typescript import { APICallError } from '@ai-sdk/provider' import type { Processor, ProcessAPIErrorArgs, ProcessAPIErrorResult } from '@mastra/core/processors' export class ErrorRecoveryProcessor implements Processor { id = 'error-recovery' processAPIError({ error, messageList, retryCount, }: ProcessAPIErrorArgs): ProcessAPIErrorResult | void { // Only retry once if (retryCount > 0) return // Check for a specific API error if (APICallError.isInstance(error) && error.message.includes('context length exceeded')) { // Trim older messages to fit within context const messages = messageList.get.all.db() if (messages.length > 4) { messageList.removeByIds([messages[1]!.id, messages[2]!.id]) return { retry: true } } } } } ``` *** ### `processOutputStream` Processes streaming output chunks with built-in state management. Allows processors to accumulate chunks and make decisions based on larger context. ```typescript processOutputStream?(args: ProcessOutputStreamArgs): Promise<ChunkType | null | undefined>; ``` #### `ProcessOutputStreamArgs` **part** (`ChunkType`): The current stream chunk being processed. **streamParts** (`ChunkType[]`): All chunks seen so far in the stream. **state** (`Record<string, unknown>`): Mutable per-processor state that persists across every chunk and every method call within a single request. A fresh state object is created for each new generate or stream call. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort the stream. Throws a TripWire error that ends the stream and emits a \`tripwire\` chunk. Pass \`retry: true\` to request another LLM attempt instead of ending. **retryCount** (`number`): Current retry attempt count from \`ProcessorContext\`. Starts at \`0\`; use to cap processor-triggered retries. **messageList** (`MessageList`): MessageList instance for accessing conversation history. **tracingContext** (`TracingContext`): Tracing context for observability. **requestContext** (`RequestContext`): Request-scoped context with execution metadata. **writer** (`ProcessorStreamWriter`): Stream writer for emitting custom data chunks back to the client. Call writer.custom() to emit data-\* typed chunks. Available during streaming. #### Return value `processOutputStream` returns `Promise<ChunkType | null | undefined>`. - Return the `ChunkType` to emit the chunk. Return the original `part` to emit it unchanged, or a new `ChunkType` to emit a modified chunk. - Return `null` to drop the chunk. Nothing is sent to the next processor or the client. - Return `undefined` (including implicit `undefined` from a `return;` statement or a method that falls off the end) to drop the chunk. `null` and `undefined` behave the same way. Dropping a chunk only affects that single chunk. The stream continues and the next chunk is still processed. To stop the stream entirely, call `abort()`. *** ### `processOutputResult` Processes the complete output result after streaming or generation is finished. ```typescript processOutputResult?(args: ProcessOutputResultArgs): ProcessorMessageResult; ``` #### `ProcessOutputResultArgs` **messages** (`MastraDBMessage[]`): The generated response messages. **messageList** (`MessageList`): MessageList instance for managing messages. **state** (`Record<string, unknown>`): Per-processor state that persists across all method calls within this request. Shared with processOutputStream and other methods. **result** (`OutputResult`): Resolved generation result containing \`text\` (accumulated text), \`usage\` (token usage with inputTokens, outputTokens, totalTokens), \`finishReason\` (why generation ended), and \`steps\` (all LLM step results, each with toolCalls, toolResults, reasoning, sources, files, etc.). **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Throws a TripWire error that stops execution and emits a \`tripwire\` chunk. **retryCount** (`number`): Current retry attempt count from \`ProcessorContext\`. Starts at \`0\`; use to cap processor-triggered retries. **tracingContext** (`TracingContext`): Tracing context for observability. **requestContext** (`RequestContext`): Request-scoped context with execution metadata. **writer** (`ProcessorStreamWriter`): Stream writer for emitting custom data chunks back to the client. Call writer.custom() to emit data-\* typed chunks. Available during streaming. *** ### `processOutputStep` Processes output after each LLM response in the agentic loop, before tool execution. Unlike `processOutputResult` which runs once at the end, this runs at every step. This is the ideal method for implementing guardrails that can trigger retries. ```typescript processOutputStep?(args: ProcessOutputStepArgs): ProcessorMessageResult; ``` #### `ProcessOutputStepArgs` **messages** (`MastraDBMessage[]`): All messages including the latest LLM response. **messageList** (`MessageList`): MessageList instance for managing messages. **stepNumber** (`number`): Current step number (0-indexed). **finishReason** (`string`): The finish reason from the LLM (stop, tool-use, length, etc.). **toolCalls** (`ToolCallInfo[]`): Tool calls made in this step (if any). **text** (`string`): Generated text from this step. **usage** (`LanguageModelUsage`): Token usage for the current step (\`inputTokens\`, \`outputTokens\`, \`totalTokens\`). **systemMessages** (`CoreMessage[]`): All system messages for read/modify access. **steps** (`StepResult[]`): All completed steps so far, including the current step. **state** (`Record<string, unknown>`): Per-processor state that persists across all method calls within this request. Shared with processOutputStream and processOutputResult. **abort** (`(reason?: string, options?: { retry?: boolean; metadata?: unknown }) => never`): Function to abort processing. Pass \`retry: true\` to request the LLM retry the step. **retryCount** (`number`): Number of times processors have triggered retry. Use this to limit retry attempts. Always passed by Mastra; starts at 0. **tracingContext** (`TracingContext`): Tracing context for observability. **requestContext** (`RequestContext`): Request-scoped context with execution metadata. #### Use cases - Implementing quality guardrails that can request retries - Validating LLM output before tool execution - Adding per-step logging or metrics - Implementing output moderation with retry capability #### Example: Quality guardrail with retry ```typescript import type { Processor } from '@mastra/core/processors' export class QualityGuardrail implements Processor { id = 'quality-guardrail' async processOutputStep({ text, abort, retryCount }) { const score = await evaluateResponseQuality(text) if (score < 0.7) { if (retryCount < 3) { // Request retry with feedback for the LLM abort('Response quality too low. Please provide more detail.', { retry: true, metadata: { qualityScore: score }, }) } else { // Max retries reached, block the response abort('Response quality too low after multiple attempts.') } } return [] } } ``` ## Processor types Mastra provides type aliases to ensure processors implement the required methods: ```typescript // Must implement processInput, processInputStep, processLLMRequest, or processLLMResponse (or any combination) type InputProcessor = Processor & ( | { processInput: required } | { processInputStep: required } | { processLLMRequest: required } | { processLLMResponse: required } ) // Must implement processOutputStream, processOutputStep, OR processOutputResult (or any combination) type OutputProcessor = Processor & ( | { processOutputStream: required } | { processOutputStep: required } | { processOutputResult: required } ) // Must implement processAPIError type ErrorProcessor = Processor & { processAPIError: required } ``` Configure processors that implement `processAPIError` in `errorProcessors`: ```typescript const agent = new Agent({ // ... errorProcessors: [new PrefillErrorHandler()], }) ``` ## Usage examples ### Basic input processor ```typescript import type { Processor } from '@mastra/core/processors' import type { MastraDBMessage } from '@mastra/core/memory' export class LowercaseProcessor implements Processor { id = 'lowercase' async processInput({ messages }): Promise<MastraDBMessage[]> { return messages.map(msg => ({ ...msg, content: { ...msg.content, parts: msg.content.parts?.map(part => part.type === 'text' ? { ...part, text: part.text.toLowerCase() } : part, ), }, })) } } ``` ### Per-step processor with `processInputStep` ```typescript import type { Processor, ProcessInputStepArgs, ProcessInputStepResult, } from '@mastra/core/processors' export class DynamicModelProcessor implements Processor { id = 'dynamic-model' async processInputStep({ stepNumber, steps, toolChoice, }: ProcessInputStepArgs): Promise<ProcessInputStepResult> { // Use a fast model for initial response if (stepNumber === 0) { return { model: 'openai/gpt-5-mini' } } // Switch to powerful model after tool calls if (steps.length > 0 && steps[steps.length - 1].toolCalls?.length) { return { model: 'openai/gpt-5.4' } } // Disable tools after 5 steps to force completion if (stepNumber > 5) { return { toolChoice: 'none' } } return {} } } ``` ### Message transformer with `processInputStep` ```typescript import type { Processor } from '@mastra/core/processors' import type { MastraDBMessage } from '@mastra/core/memory' export class ReasoningTransformer implements Processor { id = 'reasoning-transformer' async processInputStep({ messages, messageList }) { // Transform reasoning parts to thinking parts at each step // This is useful when switching between model providers for (const msg of messages) { if (msg.role === 'assistant' && msg.content.parts) { for (const part of msg.content.parts) { if (part.type === 'reasoning') { ;(part as any).type = 'thinking' } } } } return messageList } } ``` ### Hybrid processor (input and output) ```typescript import type { Processor } from '@mastra/core/processors' import type { MastraDBMessage } from '@mastra/core/memory' import type { ChunkType } from '@mastra/core/stream' export class ContentFilter implements Processor { id = 'content-filter' private blockedWords: string[] constructor(blockedWords: string[]) { this.blockedWords = blockedWords } async processInput({ messages, abort }): Promise<MastraDBMessage[]> { for (const msg of messages) { const text = msg.content.parts ?.filter(p => p.type === 'text') .map(p => p.text) .join(' ') if (this.blockedWords.some(word => text?.includes(word))) { abort('Blocked content detected in input') } } return messages } async processOutputStream({ part, abort }): Promise<ChunkType | null> { if (part.type === 'text-delta') { if (this.blockedWords.some(word => part.payload.text.includes(word))) { abort('Blocked content detected in output') } } return part } } ``` ### Stream accumulator with state ```typescript import type { Processor } from '@mastra/core/processors' import type { ChunkType } from '@mastra/core/stream' export class WordCounter implements Processor { id = 'word-counter' async processOutputStream({ part, state }): Promise<ChunkType> { // Initialize state on first chunk if (!state.wordCount) { state.wordCount = 0 } // Count words in text chunks if (part.type === 'text-delta') { const words = part.payload.text.split(/\s+/).filter(Boolean) state.wordCount += words.length } // Log word count on finish if (part.type === 'finish') { console.log(`Total words: ${state.wordCount}`) } return part } } ``` ## State lifecycle Every processor receives a `state` object in `processLLMRequest`, `processLLMResponse`, `processOutputStream`, `processOutputStep`, `processOutputResult`, and `processAPIError`. State has three important properties: - **Per-processor**: Each processor gets its own `state` object, keyed by the processor's `id`. Two processors with different ids cannot read or overwrite each other's state. - **Per-request**: A fresh state object is created at the start of every `agent.generate()` or `agent.stream()` call. State does not leak between requests or between users. - **Shared across methods**: Within one request, the same `state` object is passed to `processLLMRequest` (before the provider call), `processLLMResponse` (after the step completes), `processOutputStream` (for every chunk), `processOutputStep` (after every LLM step), `processOutputResult` (once at the end), and `processAPIError` (when an LLM call fails). For example, `processLLMRequest` can stash a cache key and `processLLMResponse` can read it back to write the response. Initialize fields defensively on first access, because `state` starts as an empty object: ```typescript import type { Processor } from '@mastra/core/processors' export class WordCounter implements Processor { id = 'word-counter' async processOutputStream({ part, state }) { state.wordCount ??= 0 if (part.type === 'text-delta') { state.wordCount += part.payload.text.split(/\s+/).filter(Boolean).length } return part } } ``` ## Aborting and tripwire chunks The `abort` function on each method throws a `TripWire` error that stops processing and emits a `tripwire` chunk on the output stream. Clients can detect the chunk to distinguish a blocked response from a normal finish. ```typescript abort('Blocked content detected', { retry: false, metadata: { category: 'pii' } }) ``` - `reason`: A human-readable explanation. Appears as `tripwire.payload.reason`. - `retry`: When `true`, the agent retries the same step with `reason` fed back as feedback. Retries only run when `maxProcessorRetries` is set on the agent or call; otherwise the request aborts. When `errorProcessors` are configured, `maxProcessorRetries` defaults to `10` for that call. - `metadata`: Optional structured data attached to the `tripwire` chunk for downstream consumers. The emitted `tripwire` chunk has the shape: ```typescript type TripwireChunk = { type: 'tripwire' runId: string from: 'AGENT' payload: { reason: string retry?: boolean metadata?: unknown processorId: string } } ``` In non-streaming calls (`agent.generate()`), the result exposes the same information as `result.tripwire` and `result.finishReason === 'other'`. ## Emitting custom data chunks Processors with access to `writer` can stream custom `data-*` chunks to the client by calling `writer.custom(chunk)`. Tools can do the same through their own writer. This is the only way for a processor to emit content outside of normal text and tool chunks. ```typescript await writer.custom({ type: 'data-moderation', runId, from: 'AGENT', data: { level: 'warn', reason: 'Possibly unsafe' }, }) ``` By default, processors do **not** see `data-*` chunks in `processOutputStream` so they don't accidentally process tool telemetry or their own output. Opt in by setting `processDataParts: true` on the processor: ```typescript class ModerationCollector implements Processor { id = 'moderation-collector' processDataParts = true async processOutputStream({ part, state }) { if (part.type === 'data-moderation') { state.warnings ??= [] state.warnings.push(part.data) } return part } } ``` The chunk `type` must start with `data-` to be treated as a custom data chunk. Returning `null` or `undefined` from `processOutputStream` still drops the chunk, so a processor can inspect, modify, or filter custom data the same way it filters text chunks. ## Configuring processors on an agent Processors are attached to an agent through three arrays: ```typescript import { Agent } from '@mastra/core/agent' import { PrefillErrorHandler } from '@mastra/core/processors' const agent = new Agent({ name: 'support-agent', model: 'openai/gpt-5', instructions: '...', inputProcessors: [new ContentFilter(['secret'])], outputProcessors: [new WordCounter()], errorProcessors: [new PrefillErrorHandler()], maxProcessorRetries: 3, }) ``` - `inputProcessors`: Run before the LLM. Receives input messages. - `outputProcessors`: Run during or after the LLM response. Receives output chunks or messages. - `errorProcessors`: Run when the LLM API call throws. Receives the raw error. Each array also accepts a function so processors can be built per-request from `RequestContext`: ```typescript new Agent({ // ... inputProcessors: ({ requestContext }) => { const blockedWords = requestContext.get('blockedWords') ?? [] return [new ContentFilter(blockedWords)] }, }) ``` ### Per-call overrides `agent.generate()` and `agent.stream()` accept `inputProcessors`, `outputProcessors`, `errorProcessors`, and `maxProcessorRetries`. When any processor array is set on the call, it **replaces** the matching array configured on the agent for that request. Memory, workspace, skill, channel, and browser processors that Mastra adds automatically are always preserved and run around your array. ```typescript await agent.stream('Summarize this', { outputProcessors: [new StreamFilter()], maxProcessorRetries: 5, }) ``` `maxProcessorRetries` passed on the call overrides the agent default. If neither is set, processor-requested retries are treated as aborts. ## Related - [Processors overview](https://mastra.ai/docs/agents/processors): Conceptual guide to processors - [Guardrails](https://mastra.ai/docs/agents/guardrails): Security and validation processors - [Memory Processors](https://mastra.ai/docs/memory/memory-processors): Memory-specific processors