Applicaiton Required

CBIS-DDSM

Breast Cancer

This CBIS-DDSM (Curated Breast Imaging Subset of DDSM) is an updated and standardized version of the Digital Database for Screening Mammography (DDSM). The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information. The scale of the database along with ground truth validation makes the DDSM a useful tool in the development and testing of decision support systems. The CBIS-DDSM collection includes a subset of the DDSM data selected and curated by a trained mammographer. The images have been decompressed and converted to DICOM format. Updated ROI segmentation and bounding boxes, and pathologic diagnosis for training data are also included. Published research results from work in developing decision support systems in mammography are difficult to replicate due to the lack of a standard evaluation data set; most computer-aided diagnosis (CADx) and detection (CADe) algorithms for breast cancer in mammography are evaluated on private data sets or on unspecified subsets of public databases. Few well-curated public datasets have been provided for the mammography community. These include the DDSM, the Mammographic Imaging Analysis Society (MIAS) database, and the Image Retrieval in Medical Applications (IRMA) project. Although these public data sets are useful, they are limited in terms of data set size and accessibility.

For example, most researchers using the DDSM do not leverage all its images for a variety of historical reasons. When the database was released in 1997, computational resources to process hundreds or thousands of images were not widely available. Additionally, the DDSM images are saved in non-standard compression files that require the use of decompression code that has not been updated or maintained for modern computers. Finally, the ROI annotations for the abnormalities in the DDSM were provided to indicate a general position of lesions, but not a precise segmentation for them. Therefore, many researchers must implement segmentation algorithms for accurate feature extraction. This causes an inability to directly compare the performance of methods or to replicate prior results. The CBIS-DDSM collection addresses that challenge by publicly releasing an curated and standardized version of the DDSM for evaluation of future CADx and CADe systems (sometimes referred to generally as CAD) research in mammography.

For scientific inquiries about this dataset, please contact Dr. Daniel Rubin, Department of Biomedical Data Science, Radiology, and Medicine, Stanford University School of Medicine (dlrubin@stanford.edu). A manuscript describing the dataset in detail is under review in Scientific Data and will be linked here when published.

データとリソース

追加情報

フィールド
最終更新 12月 3, 2019, 11:32 (CST)
作成日 5月 30, 2018, 16:17 (CST)

推薦資料集:


  • 桃園市應實施防火管理制度之場所

    Payment instrument Free
    Update frequency Irregular
    依「消防法」第13條規定一定規模以上供公眾使用建築物,應由管理權人,遴用防火管理人,責其製定消防防護計畫,報請消防機關核備,並 依該計畫執行有關防火管理上必要之業務。 本資料集提供應實施防火管理制度之場所相關規定。
  • 化學成分、元素、核種檢驗分析之事項及收費基準

    Payment instrument Free
    Update frequency Irregular
    行政院原子能委員會核能研究所提供產學研界申請使用及參考。
  • 商業登記(依營業項目別)-製茶業

    Payment instrument Free
    Update frequency Irregular
    提供全國製茶業(C111010)商業登記資料。
  • 綜合所得稅納稅人申報誠實程度及歸戶績效單項分配各級距申報統計表

    Payment instrument Free
    Update frequency Irregular
    綜合所得稅納稅人申報誠實程度及歸戶績效單項分配各級距申報統計表
  • 檔案支援教學網主題檔案瀏覽統計

    Payment instrument Free
    Update frequency Irregular
    本局建置「檔案支援教學網」(Archival Resources for Teaching,簡稱ART),提供高中職教師挑選適用之國家檔案影像應用於教學教材中,使檔案應用得與學校教育結合,將檔案內涵與應用融入學校課程。本網站主題檔案瀏覽統計係分年分月統計民眾點選各主題瀏覽的資料次數。