Interface IngestionConfig

Configuration for the -managed ingestion pipeline. Groups the ingestion destination (required) and optional backfill source.

interface IngestionConfig {
    backfillJobId?: bigint;
    backfillSource?: BackfillSource;
    deduplicationColumns?: string[];
    ingestionDestination?: IngestionDestination;
    ingestionJobId?: bigint;
    ingestionPipelineId?: string;
}

Index

Properties

backfillJobId? backfillSource? deduplicationColumns? ingestionDestination? ingestionJobId? ingestionPipelineId?

Properties

`Optional`backfillJobId

backfillJobId?: bigint

The ID of the Databricks Job that performs the historical backfill of the ingestion Delta table.

`Optional`backfillSource

backfillSource?: BackfillSource

A user-provided source for backfilling data. Historical data is used when creating a training set from streaming features linked to this Stream. The backfill data stored in this location will be copied into the ingestion table for offline querying and training. The schema for this source must match exactly that of the key and payload schemas specified for this Stream.

`Optional`deduplicationColumns

deduplicationColumns?: string[]

Column paths used to identify duplicate rows during ingestion; only one row per distinct combination of these values is kept. Use dot notation for nested fields (e.g. value.user_id). Empty list means every column is compared.

`Optional`ingestionDestination

ingestionDestination?: IngestionDestination

Destination for the -managed Delta table that holds an offline copy of the streaming data for querying and training. This table contains both 1) forward-filled data from the Stream and 2) backfilled data from the BackfillSource (if provided). This table is created and managed by and is deleted when the Stream is deleted.

`Optional`ingestionJobId

ingestionJobId?: bigint

The ID of the Databricks Job that performs the forward-fill ingestion.

`Optional`ingestionPipelineId

ingestionPipelineId?: string

The ID of the SDP pipeline that continuously copies new events from the streaming source into the ingestion Delta table.

Interface IngestionConfig

Index

Properties

Properties

`Optional`backfillJobId

`Optional`backfillSource

`Optional`deduplicationColumns

`Optional`ingestionDestination

`Optional`ingestionJobId

`Optional`ingestionPipelineId

Settings

On This Page

Interface IngestionConfig

Index

Properties

Properties

OptionalbackfillJobId

OptionalbackfillSource

OptionaldeduplicationColumns

OptionalingestionDestination

OptionalingestionJobId

OptionalingestionPipelineId

Settings

On This Page

`Optional`backfillJobId

`Optional`backfillSource

`Optional`deduplicationColumns

`Optional`ingestionDestination

`Optional`ingestionJobId

`Optional`ingestionPipelineId