Cost optimized inference for agents
Save on token costs for long-running agent workloads with Aquaduck inference. Aquaduck works well with agents that support custom providers. Here are some popular agents you can start with.
Aquaduck is ideal for long-running, high-throughput use cases where latency is flexible.