Open Access to Medical Imaging Dataset Could Advance Computer-Aided Detection
Researchers announce the open availability of the largest CT lesion-image database accessible to the public. DeepLesion, created by a team from the National Institutes of Health (NIH) Clinical Center, could help foster the development of deep-learning approaches for computer-aided detection (CADe) and diagnosis (CADx).
DeepLesion was developed by mining historical medical data from the NIH PACS (picture archiving and communication system), using the annotations, i.e., bookmarks, of clinically meaningful findings in medical images from the archive. The characteristics of the bookmarks were analyzed, harvested, and sorted to create the database.
The ground truth and two enlarged lymph nodes are correctly detected, even though the lymph nodes are not annotated in the dataset. Courtesy of SPIE.
In addition to building the database, the team also developed a universal lesion detector based on the database. This detector could serve as an initial screening tool for radiologists or other specialist CADe systems in the future.
With over 32,000 annotated lesions from over 10,000 case studies, the DeepLesion dataset is now the largest publicly available medical image dataset. It contains multiple lesion types, including kidney lesions, bone lesions, lung nodules, and enlarged lymph nodes.
“We hope the dataset will benefit the medical imaging area just as ImageNet benefited the computer vision area,” said researcher Ke Yan.
In addition to lesion detection, the DeepLesion database could also be used to classify lesions, retrieve lesions based on query strings, or predict lesion growth in new cases based on existing patterns in the database.
Future work will include extending the database to other image modalities, including data from multiple hospitals, and improving the accuracy of the detector algorithm.
The database can be downloaded at
https://nihcc.box.com/v/DeepLesion.
The research was published in
SPIE Digital Library (
doi:10.1117/1.JMI.5.3.036501).
LATEST NEWS