Direct access integrations enable you to register data to Encord without requiring Encord to sign the URLs. Consequently, customers are tasked with providing direct, pre-signed URLs when utilizing a direct access integration. Direct access integrations support all data modalities.
Public Dataset Aggregation: Researchers or analysts may need to aggregate public datasets from various sources for analysis or modeling purposes. Direct access integrations allow them to seamlessly gather data from publicly accessible sources without the need for authentication, simplifying the data acquisition process.
Third-Party Data Integration: Businesses often rely on third-party data providers for enriching their datasets or enhancing analytical insights. With direct access integrations, organizations can easily incorporate data from external sources by providing pre-signed URLs, streamlining the integration process and enabling timely access to valuable data.
Private Data Access: Organizations may have proprietary datasets stored in private cloud environments with restricted access controls, such as IP whitelists. Direct access integrations with Strict client-only access enable authorized users to directly upload and access this data in Encord without compromising security or violating access policies.
Data protection: Direct access paired with Strict client-only access ensures compliance with stringent security requirements and regulatory standards, at the expense of some of Encord’s more advanced features.
Select Direct Access from the list of possible integration types.
Click Create.
After a Direct Access integration is created, it can be used to register data.
Create a Direct Access Integration
Create a JSON File for Data Registration
Create a JSON file to specify the files you want to register with Encord. The objectURL
s in the JSON file must be pre-signed.
Import your Direct Access Data
Register data in the UI
Register data using the SDK
To use your data in Encord, it must be uploaded to the Encord Files storage. Once uploaded, your data can be reused across multiple Projects and contain no labels or annotations themselves. Files stores your data, while Projects store your labels. The following script creates a folder in Files and uses your AWS integration to register data in that folder.
The following script creates a new folder in Files and initiates uploads from AWS. It works for all file types.
Upload is still in progress, try again later!
is returned, use the
script to check the upload status to see whether the upload has finished.Ensure that you:
<private_key_path>
with the path to your private key.<integration_title>
with the title of the integration you want to use.<folder_name>
with the folder name. The scripts assume that the specified folder name is unique.path/to/json/file.json
with the path to a JSON file specifying which cloud storage files should be uploaded.A folder to store my files
with a meaningful description for your folder."my": "folder_metadata"
with any metadata you want to add to the folder.The script has several possible outputs: