Text and HTML
Learn how to label HTML and other text files in Annotate’s Label Editor.
Text files include formats such as .txt
, .md
, .rst
, .xml
, .html
, .json
, and more. However, HTML files are categorized slightly differently from other text file types. As a result, this documentation is divided into separate Text and HTML sections.
The following types of labels can be applied to all text files, and at least one must be present in your project’s ontology to enable text labeling:
- Text Region: Object labels applied to a specific region within the text file.
- Classifications: Classification labels applied to the entire file.
Label Text and HTML Files
Text
HTML
Encord supports both raw HTML files and single-extension HTML files. The key difference is that single-extension HTML files include all the necessary elements to render the webpage, such as CSS and JavaScript.
Using Relation Text Attributes - Text and HTML
Relation text attributes link an object to another object. In images or videos this could be linking a chicken’s body to its wing. In PDFs or text documents (text or HTML files) this could be linking words with measurements, or linking words with other words.
Your Ontology must MUST contain text attributes with the Relation feature enabled if you want to use relation text attributes.
While annotating, you can create a relationship between any object with a relation text attribute and any other object. After you create a relation between objects a workflow arrow points from the object with the relation text attribute to the other object.