Import data with S3

This guide will walk you through importing a dataset stored in Amazon S3 to your Centaur Labs project for labeling.

Collect your assets in an Amazon S3 bucket. Don't have an Amazon S3 account? Follow Amazon's guide to set up your first S3 bucket.

Input the bucket name in the Centaur data imports flow.

  • If your files are in a subfolder within the bucket, include the full path. For example, topbucket/subfolder/
  • Specific files can be marked for importation using the source_files parameter in the API call. For more details, refer to the updated API documentation.

Copy the permissions policy from the data imports flow.

In your S3 bucket go to the Permissions tab. Click Edit next to Bucket policy, and paste in the permissions from your clipboard. Save changes to apply the policy.

Return to the Centaur workflow, and click Verify S3 Bucket Access. You'll get a green check mark if we've successfully connected to your bucket. Initiate your import, and we'll notify you via email once it's complete.

S3 is covered by the AWS BAA and is configured in a HIPAA-compliant manner.