Analyzing huge pathology images with open source software
1 Univ Paris Diderot, Laboratoire IMNC, UMR 8165 CNRS, Univ Paris-Sud, Orsay F-91405, France
2 Univ Paris Diderot, Laboratoire de pathologie, Hôpital Saint-Louis APHP, INSERM UMR-S 728, Paris F-75010, France
3 CNRS, UMR 8165, Laboratoire IMNC, Univ Paris-Sud, Univ Paris Diderot, Orsay F-91405, France
4 Present address: CNRS, UMR 8148, Laboratoire IDES, Univ Paris-Sud, Orsay F-91405, France
5 Present address: CNRS, UMR 8608, IPN, Univ Paris-Sud, Orsay F-91405, France
6 MRI-Montpellier RIO Imaging, CRBM, Montpellier F-34293, France
7 CIRAD, Montpellier CEDEX 5 F-34398, France
Diagnostic Pathology 2013, 8:92 doi:10.1186/1746-1596-8-92Published: 6 June 2013
Digital pathology images are increasingly used both for diagnosis and research, because slide scanners are nowadays broadly available and because the quantitative study of these images yields new insights in systems biology. However, such virtual slides build up a technical challenge since the images occupy often several gigabytes and cannot be fully opened in a computer’s memory. Moreover, there is no standard format. Therefore, most common open source tools such as ImageJ fail at treating them, and the others require expensive hardware while still being prohibitively slow.
We have developed several cross-platform open source software tools to overcome these limitations. The NDPITools provide a way to transform microscopy images initially in the loosely supported NDPI format into one or several standard TIFF files, and to create mosaics (division of huge images into small ones, with or without overlap) in various TIFF and JPEG formats. They can be driven through ImageJ plugins. The LargeTIFFTools achieve similar functionality for huge TIFF images which do not fit into RAM. We test the performance of these tools on several digital slides and compare them, when applicable, to standard software. A statistical study of the cells in a tissue sample from an oligodendroglioma was performed on an average laptop computer to demonstrate the efficiency of the tools.
Our open source software enables dealing with huge images with standard software on average computers. They are cross-platform, independent of proprietary libraries and very modular, allowing them to be used in other open source projects. They have excellent performance in terms of execution speed and RAM requirements. They open promising perspectives both to the clinician who wants to study a single slide and to the research team or data centre who do image analysis of many slides on a computer cluster.
The virtual slide(s) for this article can be found here: