Abstract Objectives: According to the World Health Organization (WHO), breast cancer is the most frequent malignant neoplasia and leading cause of cancer death among women worldwide. Low- and middle-income countries hold the worst survival rates mainly owing to a lack of access to appropriate diagnosis and treatment related resources. For proper early diagnosis, it is established that besides the physical structure itself (e.g., mammography units), there's a need for adequate interpretation of imaging and that might be a particularly major problem in low-income societies once there is a tendency of greater education setbacks. Mammography datasets can improve this resource-driven gap by enabling the development of artificial intelligence technologies (AI) which can make breast cancer diagnosis more accurate in a cost-effective and scalable way. We aim to create a new database of high quality digital mammography images suitable for AI development and education. Methods: Our mammography database was developed by means of retrospective selection of 100 exams performed by Hospital São Paulo - Federal University of São Paulo ranging from 2019 to 2023. The project is assumed to be safe, versatile, and usable, and required an extensive search for the appropriate tool. Ambra Health, an American company, has developed cloud-based software for medical image management and stood out as a viable alternative. Their platform meets international data security criteria, they also made the intended careful customization possible, in addition to the possibility of associating image and text attachments. The categories were created in accordance with the BI-RADS® descriptors, a wide range of clinical scenarios and additional materials available, and they served as the basis for the advanced search feature, which intuitively filters exams that meet the selected criteria simultaneously. The platform was integrated with an automatic anonymization system upon upload, ensuring data privacy. After submission, the exams are retained in a restricted area for anonymization verification, categorization, and attachment management, before being released to the end-user. So as to broaden geographic coverage, the descriptors were entered in American English, respecting the origin of the BI-RADS® lexicon, as for the website structure, automatic translation to the accessing browser standard language was selected. Results: Our website is active and available at http://mamografia.unifesp.br, with access granted upon a simple registration process. 941 mammography images from 100 anonymized cases, 62% of which include 3D images, can be filtered based on the combination of 113 clinical and imaging variables, as well as attachment availability. The language is adaptable to the user's native language, and categorized searches can be accessed directly from the browser or downloaded as customized datasets. Additionally, features such as saved searches or starred exams are also available. Conclusion: We have developed an online and free mammography database that is completely innovative by integrating various resources into a single platform. We provide high-resolution and 3D digital images that can be searched using an advanced search system. Moreover, we offer supplementary clinical information in various attachment formats, favoring a rich clinical correlation. In this way, we have achieved the ambivalence of our goal, which was to promote education and research. *"images speak louder than words" Database: https://mamografia.unifesp.br Tutorials: https://www.youtube.com/@Mamografiaunifesp e-mail: acesso@mamografia.unifesp.br password: acesso@mamografia12 (valid until dec/23) Citation Format: Natália Cordeiro, Gil Facina, Afonso Nazário, Vanessa Sanvido, Joaquim Araujo Neto, Morgana Silva, Simone Elias. Towards Precision Medicine in Breast Imaging: A Novel Open Mammography Database with Tailor-Made 3D Image Retrieval for Artificial Intelligence and Teaching [abstract]. In: Proceedings of the 2023 San Antonio Breast Cancer Symposium; 2023 Dec 5-9; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2024;84(9 Suppl):Abstract nr PO2-29-02.
Read full abstract