Datasets

View as MarkdownOpen in Claude

A dataset may consist of data in many files stored under one dataset id. It is used to store the data that is uploaded by the Data Collector.

Important Attributes

workspace_id

  • The ID of the Prolific workspace the Dataset will be created in.

name

  • The name or identifier of the AI Task Builder dataset.

status

  • The status of a dataset can transition to one of the following:
    • UNINITIALISED - This means that the dataset has been created, but no data has been uploaded to it yet.
    • PROCESSING - This means that the dataset is being processed into data points to use in the task configuration process.
    • READY - This means that the dataset is completely processed into data points and ready to be used within a batch.
    • ERROR - This means that something has gone wrong during processing and the data may not be usable.

total_datapoint_count

  • The number of data points synthesised from processing a dataset.