Types of Annotation
Overview of supported annotation formats: bounding box, polygon, segmentation, and more.
Last updated
Overview of supported annotation formats: bounding box, polygon, segmentation, and more.
Last updated
JTheta supports manual bounding box creation, polygon annotation, instance segmentation, and semantic segmentation.
In the context of annotation on the JTheta platform, a bounding box refers to a rectangular annotation tool used to mark and define regions of interest (ROIs) within an image. These regions are often used for labeling objects, text, or features in an image for purposes like:
Object Detection: Identifying specific objects in an image (e.g., cars, people, animals)
Drawing the Box: You click and drag on the image to draw a rectangular region around the object or area of interest.
Labeling: Assign a specific label or tag to the bounding box to describe what it contains (e.g., "Car," "Person").
Adjustments: Resize or move the box as needed for accuracy.
Bounding boxes are an essential part of annotation workflows, enabling structured and precise data labeling for documentation or AI training purposes. Let me know if you'd like to explore the features in more detail!
Polygon
In the JTheta annotation platform, a polygon is a flexible annotation tool used to outline objects or regions in an image with irregular or non-rectangular shapes. It is especially useful for annotating objects that do not fit neatly into rectangular bounding boxes, such as:
Curved or Complex Shapes: Roads, rivers, or irregularly shaped objects.
Precise Boundaries: Objects like animals, vehicles, or building structures with unique contours.
Fine-Grained Annotation: Areas requiring detailed outlines, like facial features or product components.
Placing Points:
Click on the image to place points around the object's perimeter.
Each point becomes a vertex of the polygon.
Connecting Points:
As you place points, they are automatically connected by straight lines to form the polygon's edges.
Completing the Polygon:
Close the shape by connecting the last point to the first, forming a complete polygon.
Labeling:
Assign a label or category to the polygon to describe the annotated object or region.
Editing:
Adjust points and edges as needed to refine the shape for better accuracy.
Polygons provide higher precision compared to bounding boxes and are commonly used in applications like object segmentation, AI training datasets, and detailed documentation.
Auto Snapping Polygon - Auto-Snapping using SAM
Auto-snapping polygon is a feature in image annotation tools that helps users draw polygons more efficiently by automatically aligning polygon vertices (points) to the nearest relevant edges or boundaries in the image. This feature is particularly useful when annotating objects with well-defined edges or contours, as it reduces the manual effort of precisely placing each vertex and ensures greater accuracy.
Edge Detection: Automatically detects the edges or contours of objects in the image.
Precision: Vertices snap to the nearest boundary, improving annotation accuracy.
Time-Saving: Reduces the time required for manual adjustments of vertices.
User Control: Often includes toggles to enable or disable snapping based on user preference.
Smoothness: Provides a smoother annotation experience, especially for complex shapes.
This feature is commonly used in machine learning projects for training data preparation, where accurate object boundaries are critical for model performance.
"In jtheta, we provide an Auto-Snapping Polygon Tool feature, as shown in the image below. This tool simplifies the annotation process by automatically generating a polygon around the object with just a click. Once you point to an object and click, the tool detects the object's edges and creates a polygon, significantly reducing manual effort."
Semantic Segmentation
Semantic segmentation is a computer vision task that involves dividing an image into regions and assigning a label to every pixel based on the object or category it belongs to. This process creates a pixel-level understanding of the image, where each pixel is classified into predefined categories, such as "car," "tree," "road," or "sky."
Pixel-Level Classification: Every pixel in the image is assigned a class label.
Uniform Labeling: All pixels belonging to the same class are grouped under the same label, regardless of whether they are part of separate instances (e.g., all cars are labeled as "car").
Applications:
Medical Imaging: Segmenting regions like organs or tumors.
Satellite Imaging: Classifying land, water, vegetation, and urban areas.
In an image of a city street:
The road pixels are labeled as "road."
The car pixels are labeled as "car."
The building pixels are labeled as "building." Semantic segmentation would not differentiate between individual cars but would classify all as "car."
Instance Segmentation
Instance segmentation is a computer vision task that identifies, delineates, and classifies individual instances of objects in an image. Unlike semantic segmentation, which labels all pixels belonging to a category without differentiating between individual objects, instance segmentation provides both pixel-level masks and instance-specific information for each detected object.
Object Differentiation: Distinguishes between different objects of the same class (e.g., two cars in the same image will have separate masks and labels).
Pixel-Level Accuracy: Provides precise object boundaries for each instance.
Classification and Localization: Combines object detection (bounding boxes) and segmentation to locate and classify objects in the image.
Semantic Segmentation: Labels pixels by class (e.g., "car," "tree," "road"), without distinguishing between separate objects of the same class.
Instance Segmentation: Identifies individual objects within the same class and provides separate masks for each instance.
In an image of a parking lot with three cars:
Semantic Segmentation: All car pixels are labeled as "car."
Instance Segmentation: Each car is assigned a unique identifier (e.g., "car 1," "car 2," "car 3") and has its own mask.
Instance segmentation is a key component of many real-world AI systems, where both the identity and precise shape of objects are critical.