Applicaiton Required

CBIS-DDSM

Breast Cancer

This CBIS-DDSM (Curated Breast Imaging Subset of DDSM) is an updated and standardized version of the Digital Database for Screening Mammography (DDSM). The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information. The scale of the database along with ground truth validation makes the DDSM a useful tool in the development and testing of decision support systems. The CBIS-DDSM collection includes a subset of the DDSM data selected and curated by a trained mammographer. The images have been decompressed and converted to DICOM format. Updated ROI segmentation and bounding boxes, and pathologic diagnosis for training data are also included. Published research results from work in developing decision support systems in mammography are difficult to replicate due to the lack of a standard evaluation data set; most computer-aided diagnosis (CADx) and detection (CADe) algorithms for breast cancer in mammography are evaluated on private data sets or on unspecified subsets of public databases. Few well-curated public datasets have been provided for the mammography community. These include the DDSM, the Mammographic Imaging Analysis Society (MIAS) database, and the Image Retrieval in Medical Applications (IRMA) project. Although these public data sets are useful, they are limited in terms of data set size and accessibility.

For example, most researchers using the DDSM do not leverage all its images for a variety of historical reasons. When the database was released in 1997, computational resources to process hundreds or thousands of images were not widely available. Additionally, the DDSM images are saved in non-standard compression files that require the use of decompression code that has not been updated or maintained for modern computers. Finally, the ROI annotations for the abnormalities in the DDSM were provided to indicate a general position of lesions, but not a precise segmentation for them. Therefore, many researchers must implement segmentation algorithms for accurate feature extraction. This causes an inability to directly compare the performance of methods or to replicate prior results. The CBIS-DDSM collection addresses that challenge by publicly releasing an curated and standardized version of the DDSM for evaluation of future CADx and CADe systems (sometimes referred to generally as CAD) research in mammography.

For scientific inquiries about this dataset, please contact Dr. Daniel Rubin, Department of Biomedical Data Science, Radiology, and Medicine, Stanford University School of Medicine (dlrubin@stanford.edu). A manuscript describing the dataset in detail is under review in Scientific Data and will be linked here when published.

データとリソース

追加情報

フィールド
最終更新 12月 3, 2019, 11:32 (CST)
作成日 5月 30, 2018, 16:17 (CST)

推薦資料集:


  • 108~109年國內海洋污染事件統計

    Payment instrument Free
    Update frequency Irregular
    國內海洋污染事件統計:通報事件件數以縣市區分。
  • 111年度新北市附屬單位預算營業基金現金流量綜計表(依基金別分列)

    Payment instrument Free
    Update frequency Irregular
    1.單位:新臺幣千元。2.各項欄位說明詳參""新北市政府主計處網頁""或電洽主計處查詢。
  • 新竹市殯葬管理所辦理110年度公墓人次統計表

    Payment instrument Free
    Update frequency Irregular
    新竹市殯葬管理所辦理公墓人次統計表
  • 青年事務局熱點資料一覽表

    Payment instrument Free
    Update frequency Irregular
    熱點名稱:青創指揮部青年創業基地、安東街青年創業基地、新明市場青年創業基地、中央大學創新育成中心、中原大學創新育成中心、龍華科技大學創新育成中心、開南大學創新育成中心、長庚大學創新育成中心、萬能科技大學創新育成中心、健行科技大學創新育成中心、桃園創新技術學院創新育成中心、國立臺北商業大學創新育成中心、國立體育大學產學合作暨創新育成中心、桃園市青年體驗學習園區
  • 新竹市統計年報-歷年財政狀況

    Payment instrument Free
    Update frequency Irregular
    新竹市歷年財政狀況(決算審定數)