Applicaiton Required

CBIS-DDSM

Breast Cancer

This CBIS-DDSM (Curated Breast Imaging Subset of DDSM) is an updated and standardized version of the Digital Database for Screening Mammography (DDSM). The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information. The scale of the database along with ground truth validation makes the DDSM a useful tool in the development and testing of decision support systems. The CBIS-DDSM collection includes a subset of the DDSM data selected and curated by a trained mammographer. The images have been decompressed and converted to DICOM format. Updated ROI segmentation and bounding boxes, and pathologic diagnosis for training data are also included. Published research results from work in developing decision support systems in mammography are difficult to replicate due to the lack of a standard evaluation data set; most computer-aided diagnosis (CADx) and detection (CADe) algorithms for breast cancer in mammography are evaluated on private data sets or on unspecified subsets of public databases. Few well-curated public datasets have been provided for the mammography community. These include the DDSM, the Mammographic Imaging Analysis Society (MIAS) database, and the Image Retrieval in Medical Applications (IRMA) project. Although these public data sets are useful, they are limited in terms of data set size and accessibility.

For example, most researchers using the DDSM do not leverage all its images for a variety of historical reasons. When the database was released in 1997, computational resources to process hundreds or thousands of images were not widely available. Additionally, the DDSM images are saved in non-standard compression files that require the use of decompression code that has not been updated or maintained for modern computers. Finally, the ROI annotations for the abnormalities in the DDSM were provided to indicate a general position of lesions, but not a precise segmentation for them. Therefore, many researchers must implement segmentation algorithms for accurate feature extraction. This causes an inability to directly compare the performance of methods or to replicate prior results. The CBIS-DDSM collection addresses that challenge by publicly releasing an curated and standardized version of the DDSM for evaluation of future CADx and CADe systems (sometimes referred to generally as CAD) research in mammography.

For scientific inquiries about this dataset, please contact Dr. Daniel Rubin, Department of Biomedical Data Science, Radiology, and Medicine, Stanford University School of Medicine (dlrubin@stanford.edu). A manuscript describing the dataset in detail is under review in Scientific Data and will be linked here when published.

Data and Resources

Additional Info

Field Value
Last Updated December 3, 2019, 11:32 (CST)
Created May 30, 2018, 16:17 (CST)

推薦資料集:


  • 中長期資金運用申請及處理流程圖

    Payment instrument Free
    Update frequency Irregular
    中長期資金運用制度係依據行政院82年7月1日起實施之「振興經濟方案」,以規劃統合有效運用我國社會中長期資金於民間投資與國家建設,促進經濟穩定發展為目標,行政院並於83年6月核頒「中長期資金運用策劃及推動要點」,成立跨部會之「中長期資金運用策劃及推動小組」,並於同年11月正式實施。
  • 高雄市旗津區109年公告地價

    Payment instrument Free
    Update frequency Irregular
    109年-高雄市旗津區公告地價
  • 安平漁港水域安全告示牌位置表

    Payment instrument Free
    Update frequency Irregular
    安平漁港水域安全告示牌位置表資料
  • 臺北市傳染病預防接種完成率

    Payment instrument Free
    Update frequency Irregular
    臺北市傳染病預防接種完成率時間數列統計資料
  • 「金融卡-交易時間帶分佈」結構比統計(月報)

    Payment instrument Free
    Update frequency Irregular
    提供民眾查詢金融卡交易依時間帶分佈月統計資訊(財金資訊公司)