Databricks SDK for JavaScript
    Preparing search index...

    Interface PutInferenceEndpointAiGatewayResponse

    interface PutInferenceEndpointAiGatewayResponse {
        fallbackConfig?: FallbackConfig;
        guardrails?: AiGuardrails;
        inferenceTableConfig?: InferenceTableConfig;
        rateLimits?: AiGatewayRateLimit[];
        usageTrackingConfig?: UsageTrackingConfig;
    }
    Index

    Properties

    fallbackConfig?: FallbackConfig

    Configuration for traffic fallback which auto fallbacks to other served entities if the request to a served entity fails with certain error codes, to increase availability.

    guardrails?: AiGuardrails

    Configuration for AI Guardrails to prevent unwanted data and unsafe data in requests and responses.

    inferenceTableConfig?: InferenceTableConfig

    Configuration for payload logging using inference tables. Use these tables to monitor and audit data being sent to and received from model APIs and to improve model quality.

    rateLimits?: AiGatewayRateLimit[]

    Configuration for rate limits which can be set to limit endpoint traffic.

    usageTrackingConfig?: UsageTrackingConfig

    Configuration to enable usage tracking using system tables. These tables allow you to monitor operational usage on endpoints and their associated costs.