Files Upgrade and Migration Guide
There are no breaking changes to the data upload or registration process. The current method of uploading, and registering of data continues to be supported.
This guide provides targeted instructions for customers transitioning to the new Files interface with their existing Datasets. We recommend upgrading to Files promptly to benefit from enhanced centralized data handling, flexible management, and improved metadata and permissions control. However, you can continue using your current Datasets without disruption and upgrade at your own pace. Select your preferred data management option below and follow the corresponding steps.
What should I do?
First, what happened to my existing data?
- Datasets are now in the Annotate section of the application.
After Files is enabled:
- All data on Encord is now managed through Files. Your existing Datasets have been converted into Mirrored Datasets, which automatically track their corresponding folders without displaying them in the Files interface. These Mirrored Datasets are visible in the Datasets section of the Encord platform and can be used just as you have always used Datasets. While this provides a simpler experience, it comes with some reduced flexibility. Mirrored Datasets are identifiable by the M on the Dataset icon.
Before Files is enabled:
After Files is enabled:
Choose your data management style
Choose the guide that best fits your needs to get started with Files:
Option 1: I want to keep using Encord as I always have
Your Datasets are now referred to as Mirrored Datasets. The process for adding data to Mirrored Datasets remains unchanged in both the platform and the SDK. There are only minor adjustments to how Mirrored Datasets are created within the Encord platform. SDK users can continue creating Datasets and adding data as they have been accustomed to.
1. Create Mirrored Datasets using the Encord app:
- Click the New dataset button in the Datasets section in Annotate.
-
Give your Dataset a meaningful title and description. A clear title and description keeps your data organized.
-
Hover over Looking to create a mirrored dataset? and click Yes, proceed to create a Mirrored Dataset.
-
Click Create dataset to create the Dataset.
Option 2: I want to add new data to Files but leave my existing data unchanged
Your existing Datasets are now Mirrored Datasets, and you can continue adding new files directly to Mirrored Datasets. Mirrored Datasets can be upgraded to standard Datasets and folders at any time.
New files are added to, and stored in the Files section of the Encord platform.
Managing files and folders in the Encord app
Managing files and folders in the Encord app
You can add files to Encord in the Datasets screen and in the Files screen.
1. Creating folders
- Navigate to Files under the Index heading in the Encord platform.
- Click the + New folder button to create a new folder. A dialog to create a new folder appears.
-
Give the folder a meaningful name and description.
-
Click Create to create the folder. The folder is listed in Files.
2. Adding files to folders
- Navigate to Files under the Index heading in the Encord platform.
- Click the +Upload files button.
- Select a folder to upload your files to. You can click +New folder to create a new folder.
- Select the type of data you want to add.
- Click Import to upload the files to Encord.
3. Create a Dataset
To take advantage of all of the Files functionality you must create a standard Dataset (as opposed to a Mirrored Dataset).
- Click the New dataset button in the Datasets section in Annotate.
- Give your Dataset a meaningful title and description. A clear title and description keeps your data organized.
- Click Create dataset to create the Dataset.
4. Attach files to a Dataset
- Navigate to the Datasets section under the Annotate heading.
- Click the Dataset you want to add data to.
- Click +Attach existing files.
-
Select the folders containing the files you want to attach to the Dataset. To select individual files, double-click a folder to see its contents, and select the files you want to add to the Dataset.
-
Click Attach data to attach the selected files to the Dataset.
Managing files and folders using the Encord SDK
Managing files and folders using the Encord SDK
1. Creating folders
The following script creates a new folder in the root directory of Files. Ensure that you:
- Replace <private_key_path> with the path to your private key.
- Replace <folder_name> with the name you want to give your folder. We recommend using unique folder names.
- Replace
A folder to store my files
with a meaningful description for your folder.
2. Adding files to folders
The following scripts initiate uploads from your cloud storage to a specified folder in Encord. It works for all file types.
Upload is still in progress, try again later!
is returned, use the script to check the upload status to see whether the upload has finished. Ensure that you:
- Replace
<private_key_path>
with the path to your private key. - Replace
<integration_title>
with the title of the integration you want to use. - Replace
<folder_name>
with the folder name. The scripts assume that the specified folder name is unique. - Replace
path/to/json/file.json
with the path to a JSON file specifying which cloud storage files should be uploaded. - If creating a new folder, replace
A folder to store my files
with a meaningful description for your folder.
3. Create a Dataset
The following example creates a Dataset called “Houses” that expects data hosted on AWS S3.
- Substitute
<private_key_path>
with the file path for your private key. - Replace “Houses” with the name you want your Dataset to have.
4. Attach files to a Dataset
Now that you uploaded your data and created a Dataset, its time to add your files to the Dataset. The following scripts add all files in a specified folder to a Dataset.
- Replace
<private_key_path>
with the path to your private key. - Replace
<folder_name>
with the name you want to give your Storage folder. - Replace
<dataset_hash>
with the hash of the Dataset you want to add the data units to.
Option 3: I want to fully transition to Files
First, upgrade your Mirrored Datasets:
Upgrading Mirrored Datasets is a one-way action that changes how files are added to Datasets. We recommend uploading files through the Files interface and then attaching them to Datasets, as outlined below. However, you can still upload files directly to Datasets by specifying the folder in the upload call, as shown in the SDK reference.
-
Navigate to Datasets in the Index section of the Encord platform.
-
Select the Mirrored Dataset you want to upgrade.
-
Click the Upgrade to standard dataset and folder button.
New files are added to and stored in the Files section of the Encord platform.
Managing files and folders in the Encord app
Managing files and folders in the Encord app
You can upload files to Encord in the Datasets screen and in the Files screen.
1. Creating folders
- Navigate to Files under the Index heading in the Encord platform.
- Click the + New folder button to create a new folder. A dialog to create a new folder appears.
-
Give the folder a meaningful name and description.
-
Click Create to create the folder. The folder is listed in Files.
2. Adding files to folders
- Navigate to Files under the Index heading in the Encord platform.
- Click the +Upload files button.
- Select a folder to upload your files to. You can click +New folder to create a new folder.
- Select the type of data you want to add.
- Click Import to upload the files to Encord.
3. Create a Dataset
To take advantage of all of the Files functionality you must create a standard Dataset (as opposed to a Mirrored Dataset).
- Click the New dataset button in the Datasets section in Annotate.
- Give your Dataset a meaningful title and description. A clear title and description keeps your data organized.
- Click Create dataset to create the Dataset.
4. Attach files to a Dataset
- Navigate to the Datasets section under the Annotate heading.
- Click the Dataset you want to add data to.
- Click +Attach existing files.
-
Select the folders containing the files you want to attach to the Dataset. To select individual files, double-click a folder to see its contents, and select the files you want to add to the Dataset.
-
Click Attach data to attach the selected files to the Dataset.
Managing files and folders using the Encord SDK
Managing files and folders using the Encord SDK
1. Creating folders
The following script creates a new folder in the root directory of Files. Ensure that you:
- Replace <private_key_path> with the path to your private key.
- Replace <folder_name> with the name you want to give your folder. We recommend using unique folder names.
- Replace
A folder to store my files
with a meaningful description for your folder.
2. Adding files to folders
The following scripts initiate uploads from your cloud storage to a specified folder in Encord. It works for all file types.
Upload is still in progress, try again later!
is returned, use the script to check the upload status to see whether the upload has finished. Ensure that you:
- Replace <private_key_path> with the path to your private key.
- Replace <integration_title> with the title of the integration you want to use.
- Replace <folder_name> with the folder name. The scripts assume that the specified folder name is unique.
- Replace
path/to/json/file.json
with the path to a JSON file specifying which cloud storage files should be uploaded. - If creating a new folder, replace
A folder to store my files
with a meaningful description for your folder.
3. Create a Dataset
The following example creates a Dataset called “Houses” that expects data hosted on AWS S3.
- Substitute
<private_key_path>
with the file path for your private key. - Replace “Houses” with the name you want your Dataset to have.
4. Attach files to a Dataset
Now that you uploaded your data and created a Dataset, its time to add your files to the Dataset. The following scripts add all files in a specified folder to a Dataset.
- Replace
<private_key_path>
with the path to your private key. - Replace
<folder_name>
with the name you want to give your Storage folder. - Replace
<dataset_hash>
with the hash of the Dataset you want to add the data units to.