What are the daily, monthly, concurrent, or other rate limits for the ALFA APIs?

Last updated: February 23, 2026

ALFA enforces several rate limits across its API surfaces to maintain system stability and ensure fair usage across clients.

  • Agent creation is limited to 50 creations per day, ensuring that workplan or agent instantiation does not overwhelm backend planning resources.

  • For conversational workloads, chatting within a thread supports up to 1,000 requests per minute, allowing for high-frequency interactive use cases.

  • All other thread-related endpoints support significantly higher throughput at 10,000 requests per minute, providing ample capacity for history retrieval or artifact polling.

  • The remaining non-thread API endpoints—including workplans, reports, documents, and datasets—generally operate under a limit of 20 requests per minute, which is sufficient for typical analytical and automation workloads.

These limits define the expected operational boundaries but may be adjusted depending on account tier or platform configuration.