Locust's AI load tests can not only call your LLM, but also bypass it and call tools directly. This reduces the cost and focuses on the things that may not be as easy to scale up when there is a surge in traffic. Here's a basic example using Anthropic's Model Context Protocol (MCP).