Skip to content

Recharge & Billing

The platform is prepaid wallet based: recharge USDT into your organization wallet first, then calls are billed from the balance on real usage.

Recharge (USDT)

  1. In the console, open Recharge and create an order.
  2. The system returns a unique amount and a payment entry (Infini-hosted on-chain USDT).
  3. Complete the on-chain payment as prompted.
  4. After on-chain confirmation, the balance is credited automatically; check order status under "Recharge Records".

Pricing basis

Currently 1 USDT = 1 USD, no FX conversion. You pick the chain on the payment page; the actual receiving chain follows the callback.

Order status roughly flows: created → processing → completed. Overpayments are handled per platform policy (see the recharge page and order detail).

Billing model

Each call is charged in three steps:

  1. Hold: before the call, an estimated amount is frozen from available balance to prevent concurrent overspend.
  2. Settle: after the call completes, you're charged by real tokens / images.
  3. Release: the frozen excess over actual cost is released back to available balance.

So "available balance" may briefly dip below "total balance" mid-request, and recovers afterward. If settlement fails abnormally (network drop, client disconnect), the hold is auto-released by a background safety net after a while.

Pricing units

UnitApplies toHow it's billed
Per tokenChat / text / embeddingsBy input + output tokens × price per 1M tokens
Per imageImage generationBy the actual number of images returned (not the request n)
Task billingVideo / Midjourney / async imageSettled per task; failed / cancelled / expired tasks are not charged, the hold is released

Per-model input / output / cache prices and multipliers are on the console "Model Prices" page or Models & plans.

Cache billing

On a prompt cache or platform response-cache hit, the cache-read portion is billed at cache_read_price, usually much lower than the input price. Response headers carry cache-hit info:

HeaderMeaning
X-TT-Cache-StatusCache hit status (HIT / MISS, etc.)
X-TT-Cache-SavingsAmount saved this call due to cache
X-TT-Cache-Read-TokensTokens billed at the cache price

Balance and usage

Console pageWhat to look at
RechargeCurrent balance, recharge orders, ledger
UsageToken usage and charges by time / model, CSV export
RequestsOne call's billing detail, error detail (by request id)

When balance runs out

Calls return 402 billing_shortfall. Recommendations:

  • Set monthly limits on keys for critical apps so one app can't drain the balance.
  • Watch the balance and recharge ahead of time.

See Errors & rate limits.

Gateway: gateway.mindproxy.ai · Built with VitePress