Home Artificial Intelligence Introducing more enterprise-grade features for API customers

Introducing more enterprise-grade features for API customers

0
Introducing more enterprise-grade features for API customers

To assist organizations scale their AI usage without over-extending their budgets, we’ve added two latest ways to scale back costs on consistent and asynchronous workloads:

  • Discounted usage on committed throughput: Customers with a sustained level of tokens per minute (TPM) usage on GPT-4 or GPT-4 Turbo can request access to provisioned throughput to get discounts starting from 10–50% based on the scale of the commitment.
  • Reduced costs on asynchronous workloads: Customers can use our latest Batch API to run non-urgent workloads asynchronously. Batch API requests are priced at 50% off shared prices, offer much higher rate limits, and return results inside 24 hours. This is right to be used cases like model evaluation, offline classification, summarization, and artificial data generation.


We plan to maintain adding latest features focused on enterprise-grade security, administrative controls, and price management. For more information on these launches, visit our API documentation or get in contact with our team to debate custom solutions in your enterprise.

LEAVE A REPLY

Please enter your comment!
Please enter your name here