Documentation Index
Fetch the complete documentation index at: https://docs.encord.com/llms.txt
Use this file to discover all available pages before exploring further.
Default Embeddings
Embeddings are calculated using our purpose-built models, which perform well across a broad range of tasks. For highly specialized domains, custom embeddings may be more appropriate. Our built-in embeddings power the following features:- Natural language search (not supported for custom embeddings)
- Image similarity search
- Embeddings view
Custom Embeddings Support
We currently support embeddings of:
- 1 to 4,096 dimensions for data curation.
- 1 to 2,000 dimensions for label validation.
Use Custom Embeddings
To bring your custom embeddings into Encord, you first need to create a key in your metadata schema. After the key is in your schema, you can import your custom embeddings. Custom embeddings can be used for data curation and label validation. To use custom embeddings- Create a new
embeddingtype in your Schema. - Upload your embeddings.
- Select your custom embeddings from the Embeddings view.
Before you can use embedding plots with your custom embeddings, you need to configure your root Folder in Files.
1. Create New Embedding Type
A key is required in your custom metadata schema for your embeddings. You can use any string as the key for your embeddings. We strongly recommend that you use a string that is meaningful. If you do not include a key in your metadata schema, your imported embeddings are treated as strings. Useadd_embedding to add an embedding to your metadata schema.
We currently support embeddings of:
- 1 to 4,096 dimensions for data curation.
- 1 to 2,000 dimensions for label validation.
2. Upload Embeddings
With the key in the custom metadata schema ready, we can now import our embeddings. Custom embedding sizes are flexible and can be set anywhere between 1 and 4096. You can import embeddings after you have added your data or during your data registration.Your key frames (frames specified with or without embeddings) always appear in Encord, regardless of what sampling rate you specify.
config is not specified, the sampling_rate is 1 frame per second, and the keyframe_mode is frame.
Specifying a
sampling_rate of 0 only imports the first frame and all keyframes of your video.Import while registering videos
Import while registering videos
Import while importing videos
This JSON file imports embeddings while registering your data with Encord from a cloud integration.config is optional when importing your custom embeddings:config is not specified, the sampling_rate is 1 frame per second, and the keyframe_mode is frame.Specifying a
sampling_rate of 0 only imports the first frame and all keyframes of your video.Update specific videos
Update specific videos
Update specific videos
Import while registering data units
Import while registering data units
Import while importing data units
This JSON file imports embeddings while registering your data with Encord from a cloud integration.Update specific data units
Update specific data units
Import specific data units
The custom embeddings format for images, text files, PDFs, and audio files follows the same format as importing custom metadata.3. Select your Custom Embeddings
After you import your custom embeddings they are available for:- Filtering using custom embeddings
- Similarity searches using your custom embeddings
- Embedding view and 2D plots with selection based workflows
Before you can use embedding plots with your custom embeddings, you need to configure your root Folder in Files.
Compute Metrics and Embeddings
Before you can perform filtering, use similarity searches, or use embedding plots with your custom embeddings, you need to compute Metrics and Embeddings.- Navigate to Projects > Explore.
- Click Metrics & Embeddings.
- Click Compute for either option.
-
Specify the following:
- Similarity & Natural language search and quality metrics: Enable to compute embeddings and quality metrics. Access quality metrics for filtering and sorting.
- Select embeddings: Default embeddings are computed by Encord. Alternatively, import and select your own custom embeddings.
- Embeddings plot, Diversity and Uniqueness metrics: Enable to compute UMAP reduction to generate 2D embeddings plots to visualize your data. Also access diversity and uniqueness metrics for curation.
- Click Start computation.
Filtering with Custom Embeddings
You must calculate metrics and embeddings before you can filter with your custom embeddings.
-
Click the Filter dropdown or press F.
- Click Custom Embeddings from the menu.
- Select your custom embedding to filter your data.
- Select True to display images, frames, or videos with the custom embeddings.
Similarity Searches with Custom Embeddings
You must calculate metrics and embeddings before using the similarity search with your custom embeddings.
- Click the Embeddings icon in the Explorer. The Embeddings screen appears.
-
Select the embedding you want to use from the Select custom embeddings menu.

- Click the Grid icon.
-
Hover over an image or frame with the custom embedding.

- Click the Similarity Search icon. Images and Frames sort according to similarity.
Adjust Similarity Search Distance
- Click the Embeddings icon in the Explorer. The Embeddings screen appears.
-
Select the embedding you want to use from the Select custom embeddings menu.

- Click the Grid icon.
-
Hover over an image or frame with the custom embedding.

-
Click the Similarity Search icon.
Images and Frames sort according to similarity AND a Distance filter appears.
- Adjust the Distance filter slider to change the similarity search results.
Embedding View
You must calculate metrics and embeddings before you can view embedding plots.
Notice how images are clustered around certain regions. By defining a rectangular area on the plot, users can quickly isolate and analyze data points within that defined region. This approach facilitates the exploration of commonalities among these samples.
Hover over clusters or individual data points on the plot to visually check frames.
Upon selecting a region, the content within the Explorer page adjusts accordingly. Various actions can be executed with the chosen group:
- Use Collections to tag and group images.
- Establish subsets similar to these and then conduct comparisons.

