Tool Call Support¶

Major power of an AI agent comes from it's ability to invoke a tool or a set of tools to perform a task. Sentinel AI supports tool calling as a standard feature.

Tools can be defined as part of the agent class itself and are auto-discovered during instantiation
Tool methods can have complex object types as input and output. they can also optionally have an instance of AgentRunContext as the first parameter.
Tool methods need to be described using the @Tool annotation
Tool methods from other c lasses can be registered at runtime or during agent creation.
SentinelAI provides an abstraction called ToolBox to allow developers to build libraries of tools that do related work. ToolBoxes can be registered with the agent dynamically.

The `AgentRunContext` class¶

The AgentRunContext class is used to pass information between the agent and the tool. It can be passed as the first parameter to the tool method. Along with other stuff, the context can be used to access the current user request being processed, the request metadata and so on.

Member	Type	Description
`runId`	`String`	An ID for this particular run, used to track the run in logs and events.
`request`	`R`	The user request being processed.
`requestMetadata`	`AgentRequestMetadata`	Metadata for the request, such as session and user information.
`agentSetup`	`AgentSetup`	Required setup for the agent, including model and tool configurations.
`oldMessages`	`List<AgentMessage>`	List of old messages exchanged during the agent's interaction.
`modelUsageStats`	`ModelUsageStats`	Tracks usage statistics for the model during this run.

Tools as part of the agent class¶

SentinelAI will auto register all methods defined as part of the agent class. The methods need to be annotated with @Tool for this functionality to work.

TestAgent.java

class TestAgent extends Agent<String, String, TestAgent> {

    @JsonClassDescription("Information about the user")
    private record UserInfo(//(1)!
            @JsonProperty(required = true)
            @JsonPropertyDescription("Name of the user")
            String name) {
    }

    public TestAgent(@NonNull AgentSetup setup) {
        super(String.class,
              "Greet the user",
              setup,
              List.of(),
              Map.of());
    }

    @Tool("Call this tool to get appropriate greeting for the user")//(2)!
    public String greet(final UserInfo userInfo) {
        return "Hello " + userInfo.name();
    }

    @Override
    public String name() {
        return "test-agent";
    }
}

Complex object type for the input parameter. Document everything properly using appropriate annotations.
Tool Description. Make it verbose and clear. This will be used to generate the tool description for the LLM.

Tip

The @JsonProperty and JsonPropertyDescription annotations are used to mark the parameter as required and to provide a description for the parameter. It is highly recommended to provide documentation for the tool itself and it's parameters to help the LLM understand the usage for the tools better.

Externally defined tools¶

External tools can be defined and registered with the agent at runtime or at startup. This is useful when you want to create a library of tools that can be used by multiple agents. The tools can be defined as part of a ToolBox or registered individually.

Registering methods from other classes as tools¶

You can register methods from other classes as tools when instantiating the agent. SentinelAI provides the ToolUtils utility class to easily read and register methods from other classes. The methods need to be annotated with @Tool for this functionality to work.

Let's say the tool is defined in an external class called ExternalClass:

ExternalClass.java

public class ExternalClass {
    @Tool("Get appropriate greeting for the user")
    public String greet(@JsonProperty(required = true)
                        @JsonPropertyDescription("Name of the user")
                        String name) {
        return "Hello " + name;
    }
}

These tools can be used when creating the agent like so:

TestAgent.java

public class TestAgent extends Agent<String, String, TestAgent> {

    public TestAgent(@NonNull AgentSetup setup) {
        super(String.class,
              "Greet the user",
              setup,
              List.of(),
              ToolUtils.fromObject(new ExternalClass()));//(1)!
    }

    @Override
    public String name() {
        return "test-agent";
    }
}

The ToolUtils.fromObject (or ToolUtils.readTools) method will read all the methods annotated with @Tool and register them with the agent. The methods can have complex object types as input and output. They can also optionally have an instance of AgentRunContext as the first parameter.

As seen in the example, tools can be read and registered directly during agent creation, or by calling the registerTools method.

Tool Retries & Timeouts¶

SentinelAI supports configuring retries and timeouts at both the model level and the individual tool level.

Model Call Retries & Timeouts¶

The ModelSettings object allows you to configure a global timeout for LLM calls using java.time.Duration. Additionally, you can provide a RetrySetup in AgentSetup to handle transient failures.

final var agentSetup = AgentSetup.builder()
        .model(model)
        .modelSettings(ModelSettings.builder()
                .timeout(Duration.ofSeconds(30)) //(1)!
                .build())
        .retrySetup(RetrySetup.builder()
                .totalAttempts(3) //(2)!
                .delayAfterFailedAttempt(Duration.ofSeconds(1))
                .retriableErrorTypes(Set.of(ErrorType.JSON_ERROR, ErrorType.MODEL_CALL_COMMUNICATION_ERROR)) //(3)!
                .build())
        .build();

Global timeout for the model call.
Number of retry attempts.
Specific error types that should trigger a retry.

Individual Tool Timeouts¶

You can also specify timeouts for individual tools using the @Tool annotation. This is useful for tools that might perform long-running operations.

@Tool(value = "Long running operation", timeoutSeconds = 60)
public String longRunningTool() {
    // implementation
}

Using ToolBox¶

ToolBox is a very simple interface to define a set of tools that are related to each other. The tools can be registered with the agent all together by registering the toolbox using the registerToolbox methods.

TestToolBox.java

public class TestToolBox implements ToolBox {
    @Override
    public String name() {
        return "test-toolbox";
    }

    @Tool("Get appropriate greeting for the user")
    public String greet(@JsonProperty(required = true)
                        @JsonPropertyDescription("Name of the user")
                        String name) {
        return "Hello " + name;
    }
}

Create the agent as before:

TestAgent.java

public class TestAgent extends Agent<String, String, TestAgent> {

    public TestAgent(@NonNull AgentSetup setup) {
        super(String.class,
              "Greet the user",
              setup,
              List.of(),
              Map.of());
    }

    @Override
    public String name() {
        return "test-agent";
    }
}

Register the toolbox(es) at runtime:

final var agent = new TestAgent(agentSetup)
                        .registerToolbox(new TestToolBox());
//Use agent

Tip

We recommend combining related functionality into toolboxes and making libraries out of them to be used across agent.

Large Response Blocking¶

When a tool returns a very large response, it can consume a significant portion of the model's context window, leaving little room for the conversation history and the model's own reasoning. To guard against this, Sentinel AI automatically intercepts every tool response and checks its estimated token count before adding it to the message history.

How It Works¶

After a tool executes, the SafeToolRunner wraps the result and performs the following steps:

Estimate tokens: Calls Model.estimateTokenCount() on the tool response message.
Compute the ceiling: Derives the maximum allowed tokens as contextWindowSize × maxToolResponsePercentage / 100.
Allow or block:
- If the response is within the ceiling it is passed through unchanged.
- If the response exceeds the ceiling it is replaced with a TOOL_CALL_PERMANENT_FAILURE error that instructs the LLM to retry the call with a narrower query: "Tool response too large. Max allowed: N. Actual: M tokens. Modify the request to reduce output size."
Unsupported models: If the active model does not support token estimation (Model.TOKEN_COUNT_UNKNOWN), the guard is skipped and a warning is logged.

Configuring the Limit¶

The limit is controlled by the maxToolResponsePercentage field in AgentSetup:

final var agentSetup = AgentSetup.builder()
        .model(model)
        .modelSettings(modelSettings)
        .maxToolResponsePercentage(15) // (1)!
        .build();

Allow tool responses up to 15 % of the model's context window. The default is 10 %.

The effective token ceiling is computed as:

maxAllowedTokens = contextWindowSize × maxToolResponsePercentage / 100

The context window size is taken from ModelSettings.modelAttributes.contextWindowSize. If not explicitly set, it defaults to 128 000 tokens.

Value	Behaviour
`1` – `100`	Used as-is to derive the token ceiling.
`<= 0`	Falls back to the default value of 10 %.
`> 100`	Falls back to the default value of 10 %.

Permanent failure — no automatic retry

When a tool response is blocked, the error type is TOOL_CALL_PERMANENT_FAILURE. This means Sentinel AI will not automatically retry the tool call. The LLM is expected to adapt its next tool invocation to fetch a smaller result set.

Tuning the limit

Start with the default of 10 % and increase cautiously. A higher percentage allows richer tool responses but reduces the space available for conversation history and model reasoning. For tools that are expected to return large payloads (e.g. file contents, search results), consider adding pagination or filtering parameters so the LLM can request smaller chunks.

Tool Call Support¶

The AgentRunContext class¶

Tools as part of the agent class¶

Externally defined tools¶

Registering methods from other classes as tools¶

Tool Retries & Timeouts¶

Model Call Retries & Timeouts¶

Individual Tool Timeouts¶

Using ToolBox¶

Large Response Blocking¶

How It Works¶

Configuring the Limit¶

The `AgentRunContext` class¶