Databricks SDK for JavaScript
    Preparing search index...

    Dataset. Represents a reference to data used for training, testing, or evaluation during the model development process.

    interface Dataset {
        digest?: string;
        name?: string;
        profile?: string;
        schema?: string;
        source?: string;
        sourceType?: string;
    }
    Index

    Properties

    digest?: string

    Dataset digest, e.g. an md5 hash of the dataset that uniquely identifies it within datasets of the same name.

    name?: string

    The name of the dataset. E.g. “my.uc.table@2” “nyc-taxi-dataset”, “fantastic-elk-3”

    profile?: string

    The profile of the dataset. Summary statistics for the dataset, such as the number of rows in a table, the mean / std / mode of each column in a table, or the number of elements in an array.

    schema?: string

    The schema of the dataset. E.g., MLflow ColSpec JSON for a dataframe, MLflow TensorSpec JSON for an ndarray, or another schema format.

    source?: string

    Source information for the dataset. Note that the source may not exactly reproduce the dataset if it was transformed / modified before use with MLflow.

    sourceType?: string

    The type of the dataset source, e.g. ‘databricks-uc-table’, ‘DBFS’, ‘S3’, ...