Skip to main content

Finding Data Outliers

Find outliers in your dataset using Encord Active's Data Quality tab

With Encord Active, you can quickly find image outliers for pre-defined metrics, custom metrics, and label classes. Encord Active finds outliers using precomputed Interquartile ranges.

Prerequisites: Dataset

Setup

If you haven't installed Encord Active, visit installation. In this workflow we will be using the BDD validation dataset.

Steps

1. Find outliers

Navigate to the Data Quality > Summary tab. Here, the metrics will be presented as expandable panes.

Click on a metric to get deeper insight into moderate outliers and severe outliers. The most severe outliers are presented first in the pane.

Use the slider to navigate your data from most severe outlier to least severe.

data-quality-outliers.png

2. Tag outliers

When you have identified outliers of interest, use the tagging or bulk tagging feature to save a group of images.

data-quality-outliers-tagging.png

After creating a tagged image group, you can access it at the bottom of the left sidebar in the Actions tab.

3. Act on outliers

Within the Actions tab, click Filter dataframe on and select tags. Next, choose the tags you would like to export, relabel, augment, review, or delete from your dataset.

data-quality-outliers-action.png