Audio Classifications
Learn how Classifications and Audio files work using the Encord SDK.
Audio files support annotation using classifications. Classifications are applied to a range (duration in milliseconds) on the audio file. Using the Encord SDK you can import classifications directly to audio files that already exist in an Annotate Project. You can also use the SDK to view classifications that exist on an audio file.
Critical Information
Using classifications on an audio file is smilar to using labels or classifications on a video file. There are some critical differences though:
-
Only classifications are supported for audio files. Object labels are not supported on audio files. This means that
frame_view
andframe_view_metadata
are NOT supported. -
1 “frame” = 1 millisecond for audio files when importing classifications.
-
get_annotation
only works whenframe=0
. This is because audio files do not have frames. This means ALL annotations made on the classification instance share the same annotation data. -
When creating a classification for an audio file, the
classification_instance
uses therange_only=True
argument.
For example, Classification.create_instance(range_only=True)
or classification_instance = ClassificationInstance(range_only=True)
- To view the classifications present on a particular range, you MUST use
get_ranges
to loop through the classifications.
View Classifications on Audio Files
Export Classifications from Audio Files
Import Classifications to Audio Files
Was this page helpful?