Databricks SDK for JavaScript
    Preparing search index...

    Interface CreateInferenceEndpointRequest

    interface CreateInferenceEndpointRequest {
        aiGateway?: AiGatewayConfig;
        budgetPolicyId?: string;
        config?: EndpointCoreConfig;
        description?: string;
        emailNotifications?: EmailNotifications;
        name?: string;
        rateLimits?: RateLimit[];
        routeOptimized?: boolean;
        tags?: EndpointTag[];
    }
    Index

    Properties

    aiGateway?: AiGatewayConfig

    The AI Gateway configuration for the serving endpoint. NOTE: External model, provisioned throughput, and pay-per-token endpoints are fully supported; agent endpoints currently only support inference tables.

    budgetPolicyId?: string

    The budget policy to be applied to the serving endpoint.

    The core config of the serving endpoint.

    description?: string
    emailNotifications?: EmailNotifications

    Email notification settings.

    name?: string

    The name of the serving endpoint. This field is required and must be unique across a . An endpoint name can consist of alphanumeric characters, dashes, and underscores.

    rateLimits?: RateLimit[]

    Rate limits to be applied to the serving endpoint. NOTE: this field is deprecated, please use AI Gateway to manage rate limits.

    routeOptimized?: boolean

    Enable route optimization for the serving endpoint.

    tags?: EndpointTag[]

    Tags to be attached to the serving endpoint and automatically propagated to billing logs.