Agents Hosting
Deploy your agents as serverless auto-scalable endpoints.
Blaxel Agents Hosting lets you bring your agent code and deploys it as a serverless auto-scalable endpoint — no matter your development framework.
An AI agent is any application that leverages generative AI models to take autonomous actions in the real world—whether by interacting with humans or using APIs to read and write data.
Essentials
Agents Hosting is a serverless computing service that allows you to host any application without having to manage infrastructure. It gives you full observability and tracing out of the box.
It doesn’t force you into any kind of workflow or shaped box — you can host any app on Blaxel as long as it exposes an HTTP API. This makes Blaxel completely agnostic of the framework used to develop your workflow or agent.
Blaxel optimizes the experience specifically for agentic AI use cases, delivering a fully serverless experience even for the longer-running tasks typical of AI agents. For example, telemetry focuses on crucial agent metrics like end-to-end latency and time-to-first-token.
Main features
Some features of running workloads on Agents Hosting:
- a default invocation endpoint for synchronous requests
- an asynchronous invocation endpoint, for agent workloads lasting from dozens of seconds to 10 minutes
- full logging, telemetry and tracing — out-of-the-box
- revisions manage your agents’ lifecycle across iterations. You can ship as a new revision and rollback instantly
- an SDK to connect to other Blaxel resources (like models and tools) with adapters to most popular agent frameworks
Requirements & limitations
Agents Hosting have few requirements or limitations:
-
Agents Hosting only supports applications developed in Python and in TypeScript.
-
The only requirement to deploy an app on Agents Hosting is that it exposes an HTTP API server. You can use express, fastify, FastAPI, etc. for this.
-
Deployed agents have a runtime limit after which executions time out. This timeout duration is determined by your chosen infrastructure generation. For Mk 2 generation, the maximum timeout is 10 minutes.
-
The synchronous endpoint has a timeout of 100 seconds for keeping the connection open when no data flows through the API. If your agent streams back responses, the 100-second timeout resets with each chunk streamed. For example, if your agent processes a request for 5 minutes while streaming data, the connection stays open. However, if it goes 100 seconds without sending any data — even while calling external APIs — the connection will timeout.
Deploy your agent to Blaxel
The Blaxel SDK allows you to connect to and orchestrate Blaxel resources when developing your agents, such as: model APIs, tool servers, sandboxes, batch jobs, or other agents. It also ensures telemetry, secure connections to third-party systems or private networks and smart global placement of workflows.
This packaging makes Blaxel fully agnostic of the framework used to develop your agent and doesn’t prevent you from deploying your software on another platform.
Develop an agent using Blaxel resources
Read our guide for developing AI agents leveraging Blaxel computing services.
Deploy an agent on Blaxel
Learn how to deploy and manage your agent on Blaxel.
Use your agents in your apps
Once your agent is deployed on Blaxel, you can start using it in your applications.
Whether you need to process individual inference requests or integrate the agent into a larger application workflow, Blaxel provides flexible options for interaction. Learn how to authenticate requests, handle responses, and optimize your agent’s performance in production environments.