Introduction

Agent Builder Platform is a comprehensive solution for building and deploying AI agents that can search documents, process information, and interact with tools. It provides a unified API for creating and managing intelligent agents that integrate with your applications.

Some of the key capabilities of Agent Builder Platform include:

RAG (Retrieval-Augmented Generation): Search through documents intelligently with context-aware responses
Graph RAG: Combine knowledge graph traversal with semantic search for context-aware responses grounded in structured entity relationships
Tool Agents: Perform actions using function calling and extensible tool integrations
Task Agents: Process structured data with template-based prompts and schema validation
Code Interpreter: Execute Python code in a secure sandbox for data analysis, visualization, and programmatic processing
Streaming Support: Receive real-time incremental responses for all agent types
Guardrails: Built-in content safety and prompt attack protection

The Agent Builder Platform provides the following agent types:

Additional Tool Agent capabilities:

Quickstart Guide

This guide will help you get started with creating and using different types of agents in our platform.

Prerequisites

Valid authentication token
Access to the API endpoints
Appropriate permissions (READ, CREATE, EDIT, INVOKE, DELETE)

Creating Your First Agent

All agents within an environment must have a unique name. Duplicate names will result in an error when creating or updating an agent.

name vs displayName

name is a unique identifier used for lookups and must be unique within an environment. A future release will restrict name to lowercase letters, numbers, hyphens, underscores, and dots only (e.g. "insurance-claim-evaluator").
displayName is an optional human-readable label shown in UIs (e.g. "Insurance Claim Evaluator"). If omitted, it defaults to the value of name.

Tool Agent
RAG Agent
Task Agent

Available Tool Types

The following tool types are supported:

function: Call predefined functions (e.g., multiply)
structured_output: Force JSON output matching a schema
task_agent: Reference a Task Agent as a tool for multi-agent orchestration
mcp: Connect to an MCP server for external tools
rag: Attach Content Lake retrieval as a tool
analytics: Execute Python code in a secure sandbox

Tool agents can perform specific tasks using predefined tools. Here are common examples:

Basic Calculator Agent

{
    "name": "calculator",
    "displayName": "Calculator",
    "description": "Math assistant",
    "agentType": "tool",
    "config": {
        "tools": [
            {
                "toolType": "function",
                "name": "multiply",
                "description": "Multiplies two numbers",
                "funcName": "multiply"
            }
        ],
        "llmModelId": "anthropic.claude-3-haiku-20240307-v1:0",
        "systemPrompt": "You are a helpful assistant with access to various tools.",
        "inferenceConfig": {
            "maxTokens": 4000
        },
        "guardrails": ["HAIP-Prompt_attack-Medium"]
    }
}

Structured Output Agent

Experimental Feature

Structured Output on Tool Agent is currently an experimental feature. The API and functionality may change in future releases.

{
    "name": "person-extractor",
    "displayName": "Person Extractor",
    "description": "Extracts structured information about people",
    "agentType": "tool",
    "config": {
        "tools": [
            {
                "toolType": "structured_output",
                "name": "structured_output", 
                "description": "Extracts structured information about a person from text",
                "outputSchema": {
                    "type": "object",
                    "properties": {
                        "name": {
                            "type": "string",
                            "description": "Full name of the person"
                        },
                        "age": {
                            "type": "integer",
                            "description": "Age of the person"
                        },
                        "occupation": {
                            "type": "string",
                            "description": "Person's job or profession"
                        }
                    },
                    "required": ["name"]
                }
            }
        ],
        "llmModelId": "anthropic.claude-3-sonnet-20240229-v1:0",
        "systemPrompt": "You are a helpful assistant specialized in extracting structured information about people from text.",
        "inferenceConfig": {
            "maxTokens": 4000,
            "temperature": 0.1
        },
        "guardrails": ["HAIP-Profanity"]
    }
}

Example usage:

POST /v1/agents/{agent_id}/versions/{version_id-or-latest}/invoke
{
    "messages": [
        {
            "role": "user",
            "content": "John Doe is a 35-year-old software engineer" 
        }
    ]
}

Expected response:

{
    "name": "John Doe",
    "age": 35,
    "occupation": "software engineer"
}

RAG (Retrieval-Augmented Generation) agents are perfect for document search and Q&A:

{
    "name": "document-helper",
    "displayName": "Document Helper",
    "description": "Document assistant",
    "agentType": "rag",
    "notes": "Agent for document queries",
    "config": {
        "hxqlQuery": "SELECT * FROM SysContent",
        "llmModelId": "amazon.nova-micro-v1:0"
    }
}

To query the RAG agent:

POST /v1/agents/{agent_id}/versions/{version_id-or-latest}/invoke
{
    "messages": [
        {
            "role": "user",
            "content": "What's in our HR policy about vacation days?"
        }
    ],
    "hxqlQuery": "SELECT * FROM SysContent",
    "guardrails": ["HAIP-Hate-High"]
}

Task Agents (structured input) are a helper wrapper built on top of Tool Agents to process structured inputs with schema validation and template rendering:

{
    "name": "document-classifier",
    "displayName": "Document Classifier",
    "description": "Classifies documents based on content",
    "agentType": "task",
    "config": {
        "llmModelId": "anthropic.claude-3-haiku-20240307-v1:0",
        "systemPrompt": "Classify the document titled '{{title}}' with content: {{content}}. Provide a category and confidence score.",
        "inputSchema": {
            "type": "object",
            "properties": {
                "title": {
                    "type": "string",
                    "description": "Document title"
                },
                "content": {
                    "type": "string", 
                    "description": "Document content to classify"
                }
            },
            "required": ["title", "content"]
        },
        "tools": [
            {
                "toolType": "structured_output",
                "name": "structured_output",
                "description": "Extracts classification results from the document",
                "outputSchema": {
                    "type": "object",
                    "properties": {
                        "category": { "type": "string", "description": "Predicted category of the document" },
                        "confidence": { "type": "number", "description": "Confidence score (0-1)" }
                    },
                    "required": ["category", "confidence"]
                }
            }
        ],
        "inferenceConfig": {
            "maxTokens": 1000,
            "temperature": 0.1
        },
        "guardrails": ["HAIP-Profanity"]
    }
}

To invoke the Task agent:

POST /v1/agents/{agent_id}/versions/{version_id}/invoke-task
{
    "inputs": {
        "title": "Financial Report Q3",
        "content": "Revenue increased by 15% compared to last quarter..."
    },
    "guardrails": ["HAIP-Insults-High"]
}

Task Agent vs Tool Agent

The main difference between Task and Tool Agents is the prompt which is now a template that gets populated with the inputs.

"systemPrompt": "Classify the document titled '{{title}}' with content: {{content}}. Provide a category and confidence score.",

Streaming Responses

Our platform supports streaming responses for Tool, RAG, and Task Agents, allowing you to receive data incrementally as it's generated. This is particularly useful for long-running operations or when you want to display results to users in real-time.

Graph RAG

Graph RAG agents do not currently support streaming — invoking a graphRag agent on /invoke-stream will fail inside the runtime. Use the non-streaming /invoke endpoint for Graph RAG.

Streaming Endpoints

Tool and RAG Agents: POST /v1/agents/{agent_id}/versions/{version_id-or-latest}/invoke-stream
Task Agents: POST /v1/agents/{agent_id}/versions/{version_id-or-latest}/invoke-task-stream

Understanding the Stream

When you invoke the streaming endpoints, the server will send back a sequence of data chunks with the appropriate format based on the agent type.

For Tool Agent and RAG Agent streams (/invoke-stream), the system uses text/event-stream content type.
For Task Agent streams (/invoke-task-stream), the system uses text/plain content type.

In all cases, the payload is newline-delimited JSON (NDJSON): each line in the response body is a self-contained JSON object, followed by a newline character. The stream is terminated by a special chunk with type response.completed.

Not SSE framing

Although /invoke-stream is served with Content-Type: text/event-stream, the payload is not Server-Sent Events framing — there are no data: prefixes, no event: fields, and no blank-line event terminators. Do not consume this endpoint with a browser EventSource; use a raw HTTP/fetch client that reads the body line-by-line and parses each line as JSON.

Processing Streamed Data

To process the stream, your client should:

Open a connection to the streaming endpoint.
Read the response line by line.
Parse the line as a JSON object
If a chunk has type response.completed, close the connection.
Each JSON object (chunk) is either a Created, Completed, TextDelta, or ToolCall chunk, determined by the value of type field.
Extracting CreatedChunk: The CreatedChunk object has type response.created and signals the start of the streaming response. It has an id, and all subsequent chunks will have the same id.
```
{"type": "response.created", "response": {"id": "resp_id", "model": "LLM-model-id", "object": "response", "createdAt": 1760631101}}
```
Extracting Content: The TextDelta object has type response.output_text.delta and contains a delta field with a segment of the text response. You should append these segments together to form the complete message.
```
{"type": "response.output_text.delta", "role": "assistant", "delta": "The result of 3x5 is ", "id": "resp_id"}
```

Extracting Tool Calls: The ToolCallChunk object has type response.function_call_arguments.done.

{"type": "response.function_call_arguments.done", "name": "multiply", "itemId": "tool_123", "arguments": "{\"a\": 3, \"b\": 5}", "id": "resp_id"}

Extracting CompletedChunk: The CompletedChunk object has type response.completed and signals the end of the streaming response. It may have customOutputs with sourceNodes from the request.

{"type": "response.completed", "response": {"id": "resp_id", "model": "LLM-model-id", "object": "response", "createdAt": 1760631101, "customOutputs": {"sourceNodes": [{"docId": "doc-id", "chunkId": "chunk-id", "score": 0.05, "text": "source node text"}], "ragMode": "normal"}}}

Example: Streaming with a Tool Agent (using cURL)

Let's say you have a Tool Agent (like the Calculator example) with ID your-tool-agent-id and using the most recent version of config.

Request:

curl -N -X POST "http://your-api-base-url/v1/agents/your-tool-agent-id/versions/latest/invoke-stream" \\
-H "Authorization: Bearer your-jwt-token" \\
-H "Content-Type: application/json" \\
-d '{
    "messages": [
        {
            "role": "user",
            "content": "What is 3x5?"
        }
    ]
}'

Expected Response Stream (raw text/event-stream lines):

{"type": "response.created", "response": {"id": "resp-id", "model": "LLM model ID", "object": "response", "createdAt": 1760632921}}
{"type": "response.output_text.delta", "role": "assistant", "delta": "Okay, I can help", "id": "resp-id"}
{"type": "response.output_text.delta", "role": "assistant", "delta": " with that. ", "id": "resp-id"}
{"type": "response.function_call_arguments.done", "id": "resp-id", "arguments": "{\"a\": 3, \"b\": 5}", "itemId": "tooluse_abc", "name": "multiply"}
{"type": "response.output_text.delta", "role": "assistant", "delta": "The result of 3 x 5", "id": "resp-id"}
{"type": "response.output_text.delta", "role": "assistant", "delta": "is 15.", "id": "resp-id"}
{"type": "response.completed", "response": {"id": "resp-id", "model": "LLM model ID", "object": "response", "createdAt": 1760632921}}

(Note: The exact chunking and content can vary. Some chunks might have empty content.)

Processing the above stream would yield:

Aggregated Content: "Okay, I can help with that. The result of 3 x 5 is 15."

Tool Call:

{
    "itemId": "tooluse_abc",
    "name": "multiply",
    "arguments": "{\"a\": 3, \"b\": 5}"
}

Example: Streaming with a RAG Agent (using cURL)

For a RAG Agent with ID your-rag-agent-id and the most recent version of the config.

Request:

curl -N -X POST "http://your-api-base-url/v1/agents/your-rag-agent-id/versions/latest/invoke-stream" \\
-H "Authorization: Bearer your-jwt-token" \\
-H "Content-Type: application/json" \\
-d '{
    "messages": [
        {
            "role": "user",
            "content": "What is our vacation policy?"
        }
    ],
    "hxqlQuery": "SELECT * FROM SysContent"
}'

Expected Response Stream (raw text/event-stream lines):

{"type":"response.created","response":{"id":"44de5f46-1ad8-4d26-ab5a-46a928cdaa3f","model":"amazon.nova-micro-v1:0","object":"response","createdAt":1760631101}}
{"type":"response.output_text.delta","role":"assistant","delta":"Our vacation policy states that","id":"44de5f46-1ad8-4d26-ab5a-46a928cdaa3f"}
{"type":"response.output_text.delta","role":"assistant","delta":" employees are entitled to X days","id":"44de5f46-1ad8-4d26-ab5a-46a928cdaa3f"}
{"type":"response.completed","response":{"id":"44de5f46-1ad8-4d26-ab5a-46a928cdaa3f","model":"amazon.nova-micro-v1:0","object":"response","createdAt":1760631104,"customOutputs":{"sourceNodes":[{"docId":"doc-id","chunkId":"chunk-id","score":0.05,"text":"source node text"}],"ragMode":"normal"}}}

Processing the above stream would yield:

Aggregated Content: "Our vacation policy states that employees are entitled to X days"

tip

When using streaming, ensure your client correctly handles line endings and JSON parsing for each chunk. Remember that tools like Swagger UI may not display streaming responses correctly; cURL, Postman (with appropriate settings), or custom client code are better choices.

Response Format

All non-streaming agent invocations return an AgentResponse object with the following structure:

{
    "object": "response",
    "createdAt": 1741705500,
    "model": "anthropic.claude-3-haiku-20240307-v1:0",
    "output": [
        {
            "type": "message",
            "status": "completed",
            "role": "assistant",
            "content": [
                {
                    "type": "output_text",
                    "text": "The response text from the agent."
                }
            ]
        }
    ],
    "customOutputs": {
        "sourceNodes": [...],
        "ragMode": "normal"
    }
}

Response Fields

Field	Type	Description
`object`	string	Always `"response"`
`createdAt`	integer	Unix timestamp of when the response was created
`model`	string	The LLM model ID used for this response
`output`	array	List of output items (messages and/or tool calls)
`customOutputs`	object \| null	Additional output data (present for RAG agents)

Output Types

Each item in the output array has a type field:

TextOutput (`"message"`)

A text response from the agent:

Field	Type	Description
`type`	string	`"message"`
`status`	string	`"completed"`
`role`	string	`"assistant"`
`content`	array	List of content blocks
`content[].type`	string	`"output_text"` for text content
`content[].text`	string	The text content of the response

ToolCallOutput (`"function_call"`)

A function call made by the agent, as emitted in the non-streaming AgentResponse.output array. (In streaming responses the equivalent signal is a response.function_call_arguments.done chunk carrying itemId — see Processing Streamed Data.)

Field	Type	Description
`type`	string	`"function_call"`
`status`	string	`"completed"`
`callId`	string	Unique identifier for this tool call
`name`	string	Name of the tool/function called
`arguments`	string	JSON-encoded arguments passed to the tool

Custom Outputs

The customOutputs field is present for RAG agents and contains retrieval-specific data:

Field	Type	Description
`sourceNodes`	array	Documents retrieved from Content Lake used as context
`sourceNodes[].docId`	string	Document identifier
`sourceNodes[].chunkId`	string	Chunk identifier within the document
`sourceNodes[].score`	number	Relevance score (0–1)
`sourceNodes[].text`	string	Text content of the retrieved chunk
`ragMode`	string	RAG mode used: `"normal"` or `"deepResearch"`

Guardrails

Guardrails are policy-based content filters that help ensure safe and appropriate AI interactions. They can detect and filter content such as profanity, insults, hate speech, and prompt injection attacks.

Discovering Available Guardrails

The list of available guardrails is dynamic and managed by the platform. Use the API to discover what's available in your environment:

GET /v1/guardrails

Example Response:

[
    {
        "name": "HAIP-Profanity",
        "description": "Filters profane language from inputs and outputs"
    },
    {
        "name": "HAIP-Insults-High",
        "description": "Filters insulting content with high sensitivity"
    },
    {
        "name": "HAIP-Insults-Low",
        "description": "Filters insulting content with low sensitivity"
    },
    {
        "name": "HAIP-Hate-High",
        "description": "Filters hate speech with high sensitivity"
    },
    {
        "name": "HAIP-Prompt_attack-Medium",
        "description": "Detects and blocks prompt injection attacks"
    }
]

info

Guardrail names and descriptions are managed at the platform level and may change over time. Always use GET /v1/guardrails to discover the current list rather than hardcoding guardrail names.

Applying Guardrails

Guardrails can be applied in two ways:

At Agent Creation

Include guardrails in the agent's config to apply them to all invocations:

{
    "name": "safe-assistant",
    "displayName": "Safe Assistant",
    "description": "An assistant with content safety guardrails",
    "agentType": "tool",
    "config": {
        "llmModelId": "anthropic.claude-3-haiku-20240307-v1:0",
        "systemPrompt": "You are a helpful assistant.",
        "tools": [...],
        "guardrails": ["HAIP-Profanity", "HAIP-Insults-High", "HAIP-Prompt_attack-Medium"]
    }
}

Per Invocation

Pass additional guardrails in the request body to apply them to a specific invocation only. These are applied in addition to any guardrails defined in the agent config:

{
    "messages": [
        {
            "role": "user",
            "content": "Tell me about our company policies."
        }
    ],
    "guardrails": ["HAIP-Hate-High"]
}

Quickstart Guide​

Prerequisites​

Creating Your First Agent​

Basic Calculator Agent​

Structured Output Agent​

Streaming Responses​

Streaming Endpoints​

Understanding the Stream​

Processing Streamed Data​

Example: Streaming with a Tool Agent (using cURL)​

Example: Streaming with a RAG Agent (using cURL)​

Response Format​

Response Fields​

Output Types​

TextOutput ("message")​

ToolCallOutput ("function_call")​

Custom Outputs​

Guardrails​

Discovering Available Guardrails​

Applying Guardrails​

At Agent Creation​

Per Invocation​