Introducing CholecInstanceSeg: A New Dataset for Surgical Tool Segmentation
A dataset created to improve tool segmentation in laparoscopic surgeries.
― 7 min read
Table of Contents
In laparoscopic and robotic surgeries, it is important to accurately identify and segment Tools being used during operations. This is essential for making surgeries safer and easier. While there are some datasets that offer information about surgical procedures, they often do not provide detailed labels for tools. Furthermore, many of the existing datasets focus on surgeries performed on pigs rather than people, which means they are not as useful for everyday medical practice.
To fill this gap, we present CholecInstanceSeg, a large dataset created for tool Segmentation in laparoscopic surgery. This dataset includes new labels for surgeries on patients, unlike previous datasets. CholecInstanceSeg contains 41,933 annotated frames taken from 85 clinical surgeries and features 64,400 identified tools. Each tool is labeled with a specific mask and ID. We have conducted rigorous quality checks to ensure the accuracy of our labels. This dataset is intended to improve the development of computer programs that can assist in surgery.
Background
Minimally invasive surgeries (MIS) have become more common due to their advantages compared to open surgeries. These advantages include less blood loss, less pain for patients, quicker recovery times, and fewer complications after surgery. However, MIS often relies on cameras for visualization, which can limit the surgeon's view and make complex procedures challenging.
To help overcome these visualization problems, techniques that assist with computer-based interventions are being developed. One of the main challenges is accurately identifying surgical tools in these limited views. This capability is essential for creating technologies that can assist surgeons or even perform certain tasks autonomously.
Currently available datasets that focus on surgical tool segmentation have several limitations. Many of them focus on animal surgeries rather than human ones, which limits their usefulness in daily clinical situations. Others provide only basic segmentation information for one type of tool or do not give detailed information about individual tools.
To address these concerns, we created the CholecInstanceSeg dataset, which features 41,933 frames from 85 laparoscopic surgeries. This is the largest dataset available for tool instance segmentation in surgery.
Data Sources
CholecInstanceSeg was created using frames from three existing laparoscopic surgery datasets: Cholec80, CholecT50, and CholecSeg8k. These datasets contain a total of 85 video sequences.
When referring to the source datasets, we separate videos from image sequences. Videos are audio-visual files, while image sequences consist of individual frames extracted from videos. The frames used in CholecInstanceSeg come from public datasets that already had the necessary ethical approvals for use.
CholecSeg8k includes 8,080 frames from 17 videos in the Cholec80 dataset. This dataset predominantly offers labels for semantic segmentation but lacks specific instance information. The CholecT50 dataset is made up of 50 videos and has more detailed labeling, including interaction details between tools and tissues; however, it also lacks the comprehensive tool labeling we aimed to provide.
Cholec80 consists of 80 videos of laparoscopic surgeries and includes some information on tool presence, but it does not give complete Annotations.
Segmentation Labels
CholecInstanceSeg focuses on labeling various types of tools used in laparoscopic surgeries. We recorded seven distinct tool categories based on our observations: Graspers, Bipolar tools, Hooks, Clippers, Scissors, Irrigators, and a Snare. The Snare is an addition that was not included in previous datasets due to its unique characteristics and functionality.
When labeling, we included all tools but excluded some items that could complicate the dataset, such as surgical needles and other items not directly categorized as surgical tools. We focused on labeling tools only when they were visible and did not label instrument ports unless they were visibly holding a tool.
We also established guidelines to maintain a high quality of annotations. While we aimed for high accuracy, we allowed for some minor imperfections in the labeled boundaries, as complete perfection wasn't feasible given time and budget constraints. Our goal was to ensure that the essential characteristics of each tool could still be identified.
Annotation Tools and Techniques
To gather annotations for CholecInstanceSeg, we used a customized version of an open-source annotation tool based on a popular project called LabelMe. This tool is tailored for interactive segmentation and facilitates identifying surgical instruments.
In some cases, the annotation tool had to be switched to manual input when the automated system encountered difficulties. This challenge often arose in complex scenes where tools and backgrounds were similar in color or when visibility was poor due to smoke or glare.
One of the datasets we used, CholecSeg8k, had some semantic segmentation labels already. We modified these to include instance IDs so we could distinguish different tools in the same image. To convert semantic segmentation to instance-level annotations, we used a technique called connected component analysis.
The process involved organizing tools by category and identifying any overlaps or occlusions that could confuse the software. We created multiple workflows to correctly assign instance IDs and to identify problems, such as errors in labeling or missing tool annotations.
To enhance annotation speed and quality, we employed a semi-automatic approach. This method combined human input and AI-driven model training to assist in generating instance segmentation labels for a larger number of images.
Annotation Process
The overall process for creating the CholecInstanceSeg dataset can be summarized in several key steps:
- Annotation of Instance-CholecSeg8k: We converted the existing dataset with semantic segmentation labels to our format with instance IDs. This took about two weeks and involved several quality checks. 
- Creating the Labelling Protocol: Over three weeks, we developed guidelines to tackle challenges that arose during the initial annotation. This included strategies for handling difficult cases that involved high complexity. 
- Annotation of Instance-CholecT50-Sparse: This step took three weeks and involved using our developed guidelines to accurately label the data. 
- Annotation of Instance-CholecT50-Full: This semi-automatic approach required a further three weeks. It built on the previous annotations and included iterative training and corrections to enhance quality. 
- Annotation of Instance-Cholec80: To ensure diversity within the dataset, we spent one week manually annotating this partition. 
- Final Quality Control: We allocated two weeks for conducting thorough reviews of all annotations to ensure accuracy and consistency. 
During the annotation, a primary annotator managed the majority of tasks, while a secondary annotator with more experience provided support in quality checks and annotation guidance. An expert team also assisted in resolving any unclear situations.
Dataset Overview
CholecInstanceSeg is publicly available and is structured in a way that allows researchers to easily access and utilize the data. The dataset is arranged into training and validation directories, with images in PNG format and annotations in JSON format. The annotations follow a particular structure that indicates class labels, polygon coordinates, and instance IDs.
To maintain consistency across our dataset and related datasets, we preserved video names and frame IDs, which will allow for easier integration between our dataset and existing resources.
Performance and Validation
To ensure the effectiveness of the CholecInstanceSeg dataset, we trained two baseline models: Mask R-CNN and Mask2Former. These models were selected for their generalizability in training segmentation tasks. We measured their performance using established metrics to provide benchmarks for future research.
In addition to performance validation through model training, we conducted label agreement analyses to confirm accuracy. Two annotators independently labeled a subset of the dataset to evaluate inter-annotator agreement. Results showed a high level of consistency, indicating that our labeling process was effective.
We also compared manual and semi-automatic annotations to ensure that the speed of the automated processes did not come at the cost of quality. The results were promising, demonstrating that both approaches yielded similar results.
Conclusion
CholecInstanceSeg is designed to bridge gaps in existing datasets by providing a comprehensive and high-quality resource for the development of instance segmentation algorithms in laparoscopic surgeries. It offers extensive annotations across various tools and is validated to ensure reliability for research and development purposes.
This dataset is intended to support the creation and refinement of technologies that can assist surgeons, making procedures safer and more efficient. By making this dataset publicly available, we hope to encourage further innovation in the field of surgical tool segmentation and computer-assisted surgeries overall.
Title: CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
Abstract: In laparoscopic and robotic surgery, precise tool instance segmentation is an essential technology for advanced computer-assisted interventions. Although publicly available procedures of routine surgeries exist, they often lack comprehensive annotations for tool instance segmentation. Additionally, the majority of standard datasets for tool segmentation are derived from porcine(pig) surgeries. To address this gap, we introduce CholecInstanceSeg, the largest open-access tool instance segmentation dataset to date. Derived from the existing CholecT50 and Cholec80 datasets, CholecInstanceSeg provides novel annotations for laparoscopic cholecystectomy procedures in patients. Our dataset comprises 41.9k annotated frames extracted from 85 clinical procedures and 64.4k tool instances, each labelled with semantic masks and instance IDs. To ensure the reliability of our annotations, we perform extensive quality control, conduct label agreement statistics, and benchmark the segmentation results with various instance segmentation baselines. CholecInstanceSeg aims to advance the field by offering a comprehensive and high-quality open-access dataset for the development and evaluation of tool instance segmentation algorithms.
Authors: Oluwatosin Alabi, Ko Ko Zayar Toe, Zijian Zhou, Charlie Budd, Nicholas Raison, Miaojing Shi, Tom Vercauteren
Last Update: 2024-06-23 00:00:00
Language: English
Source URL: https://arxiv.org/abs/2406.16039
Source PDF: https://arxiv.org/pdf/2406.16039
Licence: https://creativecommons.org/licenses/by-nc-sa/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to arxiv for use of its open access interoperability.
Reference Links
- https://github.com/labdeeman7/cholec_instance_seg
- https://github.com/orgs/cocodataset/repositories
- https://www.synapse.org/Synapse:syn60239970/wiki/628710
- https://github.com/cocodataset/cocoapi/blob/master/PythonAPI/pycocotools/cocoeval.py
- https://www.youtube.com/watch?v=puuDpgUIhqI
- https://github.com/cocodataset/panopticapi/tree/master
- https://github.com/CAMMA-public/TF-Cholec80
- https://github.com/CAMMA-public/cholect50
- https://www.kaggle.com/datasets/newslab/cholecseg8k
- https://github.com/haochenheheda/segment-anything-annotator
- https://github.com/labelmeai/labelme
- https://github.com/labdeeman7/my_samannotator
- https://phakir.re-mic.de/data/