Professional Documents
Culture Documents
13(2), 1101-1112
http://dx.doi.org/10.13005/bbra/2139
Breast cancer is one of the most common Histopathological studies are still most
cancers in women in the world. In 2015, an reliable and effective technique in cancer research.
estimated 60,290 new cases of breast carcinoma Till now, analysis of histopathology images has
were diagnosed in situ, 83% of which were ductal been done manually via observing dysplastic
carcinoma in situ (DCIS) and 12% lobular appearances such as minute structures,
carcinoma in situ (LCIS)1-2. Cancer begins when distribution, finding of tubules, nuclei, regularities
genes in a cell become abnormal and the cell starts of cell shapes and size across the tissues by the
to growing and divide out of control. Cancerous pathologist to decide whether it is benign or
cells replicate much faster than normal healthy malignant. The distortion in the shape of cells and
cells. It divides and multiplies to form a tumor that change in the density of cluster of cells are the
may be benign (non-cancerous) and malignant signatures of the occurrence of malignancy in body
(cancerous)3. tissue4-6. Pathologists face several problems while
observing the histopathological image due to
overlapping, blurriness, artifacts, weak boundary
detection and uneven dying. Moreover, it is found
* To whom all correspondence should be addressed.
E-mail: anuranjeeta.rs.bme11@itbhu.ac.in
to be very time consuming and tedious process.
This depends on perceptions and level of expertise
1102 ANURANJEETA et al., Biosci., Biotech. Res. Asia, Vol. 13(2), 1101-1112 (2016)
of pathologists. For cancer detection, section 5 draws the conclusion of the work
morphological feature extraction is the main tool presented in this paper.
for analyzing the cellular organization, abnormality,
and changes in the physiological state of the cells7- MATERIALSAND METHODS
9
. Analysis of the cells based on their morphological
differences was applied to study the differentiation Images collection
of benign or malignant cells. A computer aided Histopathology breast cancer cell
diagnosis system is proposed in this paper as a datasets used in present work have been taken
qualitative and quantitative tool for analysis and from www.bioimage.ucsb.edu (Centre for Bio-image
classification10-11. Several types of research have Informatics, University of California, Santabarbara
analyzed histopathology images that relate image (UCSB) for analysis. Microphotographs of breast
analysis of cells morphology to the malignancy cancer histopathology of total 58 images were
detection. A. Madabhushi observed the challenges taken, 26 out of which were malignant and 32 are
in digital imaging that led to improvement in image benign. With the help of cropping histopathology
analysis techniques resulting in improved images are fragmented into single and group cells.
opportunities to the pathologist for the treatment A dataset of single cells consisting of 218 benign
of benign tissues12. A. D. Belsare et al. worked on and 233 malignant and a dataset of the group of
the tissue structure and presented cell distribution cells consisting of 72 benign and 73 malignant were
in a tissue. They described irregularities of the framed. Structural, intensity and textures based 30
shapes of cells to determine the level of malignancy features were used to distinguish between benign
and benign in histopathology images 13 . and malignant cells. The images acquired from
Bhattacharjee et al. presented a review of computer- histopathology breast cancer (UCSB) datasets
aided diagnosis system to detect cancer from were already stained to visualize various parts,
histopathology images using image processing cellular structures such as cells, nuclei, and
method14. Demir et al. presented on both tissue cytoplasm of the tissue. Certain special stains are
level and cellular level automatic diagnosis of used to bind selectively to particular components.
biopsy image using image processing techniques, The nuclei were stained blue with hematoxylin
feature extraction and classification techniques15. while cytoplasm and extra cellular components were
S. Petushi obtained the intensity of the pixels that in pink due to eosin staining.
are registered and calculate the mean of the
neighboring pixels16. Bergmeir et al. proposed a y
contrast, correlation, energy, homogeneity, gray Input cellular image data set
Pre-processing
morphological and texture features (GLCM) Gaussian filtering using unsharp masking to increase
the contrast of cytoplasm and nucleus
extracted from the segmented histopathology
images. In this work, diverse image processing I i+1 Segmentation using band thresholding
Stop
proposed algorithm. Section 3 describes the results
and Section 4 describes discussions. Finally, Fig. 1. Schematic flowchart of the proposed method
ANURANJEETA et al., Biosci., Biotech. Res. Asia, Vol. 13(2), 1101-1112 (2016) 1103
boundary respectively. Standard deviation is then cells. Moreover, the statistics computed on these
measured. After that, it is converted to gray level properties is used to identify cancer in a tissue.
image having one bounding box marked by yellow Structure-based features used in this paper are the
color [22-25]. Single cells and the group of cells area, convex area, perimeter, major axis length,
have been taken into account for segmentation minor axis length, circularity, eccentricity, and
and analysis. Figure 2 depicts the results obtained solidity are explained in Table 3.
by implementing the steps discussed above. Intensity features
Extraction of morphological features Pixel based features provide information
The most significant portion of this work about the intensity (gray-level or color) histogram
is the computation of features. To do the same, the of the pixels located in cells. These features were
total features of the particular ROI (region of extracted from the gray-level or color histogram of
interest) are extracted to distinguish different types the image. This includes max intensity, min
of cells such as benign and malignant. This based intensity, mean intensity and standard deviation
on their structural, intensity, and texture features that explained in Table 4. These types of features
in single cells and the group of cells were do not provide any information about the spatial
computed from the segmented cell images as shown division of the pixels.
in Table 1. Further, 30 features have been computed Texture features
for the cells present in the image. The texture features provide information
The quantification of these features helps about the variation in the intensity of a surface
to differentiate the malignant cells from benign and quantify properties such as regularity,
Group cells
Group cells
ANURANJEETA et al., Biosci., Biotech. Res. Asia, Vol. 13(2), 1101-1112 (2016) 1105
Feature
Structure Features Description
No.
F1. Cell Area (A) The Cell area can be represented by nucleus region containing a
total number of non-zero pixels in the region.
= (, )
=1 =1
A is cell area and B is the segmented image of i rows and columns.
F 2. Convex Area Scalar that specifies the number of pixels in Convex Image.
F 3. Cell Perimeter (P) Cell perimeter calculates the distance between each adjoining pair of
pixels around the border of the region. It is defined as:
= + 2 ( ) .
F 4. Major Axis Length It specifies the length (in pixels) of the major axis of the ellipse that
has the same normalized second central moments as the region.
= ( 1 2 )2 + ( 1 2 )2
1 , 1 and 2, 2 are end points on the major axis.
F 5. Minor Axis Length It specifies the length (in pixels) of the minor axis of the ellipse that
has the same normalized second central moments as the region.
= ( 2 1 )2 + ( 2 1 )2
1 , 1 and 2, 2 are end points on minor axis.
F 6. Circularity This dimensionless parameter is calculated by area and perimeter.
4 [ ]
=
[ ]2
F 7. Eccentricity The ratio of major axis length and minor axis length is known as
eccentricity and defined as:
=
F 8. Solidity Scalar was specifying the proportion of the pixels in the convex hull
that are also in the region, computed as Area/Convex Area.
=
9. Max Intensity Scalar was specifying the value of the pixel with the greatest intensity in
the region.
10. Min Intensity Specifying the value of the pixel with the lowest intensity in the region.
11. Mean Intensity It specifies the mean of all the intensity values in the region.
12. Standard deviation It is a measure of contrast
1
= ( )2
=1
1106 ANURANJEETA et al., Biosci., Biotech. Res. Asia, Vol. 13(2), 1101-1112 (2016)
F 13. Autocorrelation Correlation is a measure of the linear dependency of gray levels on those of
neighbouring pixels or specified points. It indicates the local gray-level
dependency on the texture image; higher values can be obtained for similar gray-
level regions.
F 14. Contrast This measure provides evidence of how sharp are the structural variations in the
image.
= | |2 ( , )
,
F 15. Correlation The correlation feature is a measure of gray level dependency of the image.
( )( ) (, )
=
,
F 16. Cluster Prominence Cluster prominence is also a measure of asymmetry. When the prominence cluster
value is high, the image is less symmetric. Also, when prominence cluster value is
low, there is a peak in the GLCM matrix around the mean values.
F 17. Energy Provides the sum of squared elements in the GLCM. Also known as uniformity
= ( , )2
,
F 18. Entropy Entropy is a statistical measure of randomness that can be used to characterize the
texture of the input image. Entropy is defined as : sum(p.*log2(p))
F 19. Homogeneity Measures the closeness of the distribution of elements in the GLCM to the GLCM
diagonal.
(, )
=
1+| |
,
F20 .. ..
magnitude of shape based features in comparison the features have an insignificant relation and
to benign cells, and there was variation in other minor difference such as standard deviation,
features values. All the malignant cells in the single minimum intensity, and correlation. These features
cells and the group have increased the size (area, are insignificant for the differentiation point of view
convex area, perimeter, major axis, minor axis) and in both cases in single cells and the group of cells
elongated shape (circularity, eccentricity) and as they are having almost analogous values, as
greater magnitude of the maximum and mean presented in Table 6. Dissimilarity, energy, entropy,
intensity. This size, shape and intensity based homogeneity, maximum probability difference
feature were significant for the differentiation point variance, difference entropy, inverse difference
of view for single cells and group cells. Some of normalized (INN) and inverse difference moment
Table 6. Comparative parameter of single cells and group cells of benign and malignant cells of breast cancer image
normalizes features are significant in single cells The results presented here are expressed
only and insignificant in group cells. Texture feature as Mean S.D. Statistical analysis has been
of group cells does not belong to each single cell performed using Graph Pad Prism software (version
and the group it takes as a whole image. Cluster 5.1). To perform unpaired, two-tailed students t-
prominence feature is insignificant in single cells tests, p-value < 0.05 was used for significance.
and significant in group cells.
Fig. 3. Variations of values of various features for (a) single cell (b) group cells for breast cancer
Fig. 4. Variations of values of various features for (a) single cell (b) group cells for breast cancer
Fig. 5. Variations of values of various features for (a) single cell (b) group cells
ANURANJEETA et al., Biosci., Biotech. Res. Asia, Vol. 13(2), 1101-1112 (2016) 1109
Fig. 6. Variations of values of various features for (a) single cell (b) group cells.
Following table represents the numerical value of the features of single and group cells
texture bio-images for classification of colon 34. Hamilton, P., Wang, Y., Crookes, D., Diamond,
cancer cells. WSEAS Transactions on Biology J., and Turner, R., 2007. Segmentation of
and Biomedicine, 8(2), pp.39-50. squamous epithelium from ultra-large cervical
32. R. M. Haralick, K. Shanmugam, and I. Dinstein., histological virtual slides. In Engineering in
1973.Textural features for image classification, Medicine and Biology Society, 2007. EMBS
IEEE Transactions on Systems, Man and 2007. 29th Annual International Conference of
Cybernetics, vol. 3, no. 6, pp. 610621. the IEEE (pp. 775-778). IEEE
33. Doyle, S., Agner, S., Madabhushi, A., Feldman, 35. Mouelhi, M., Sayadi, F., Fnaiech, K. M., and K.
M. and Tomaszewski, J., 2008. Automated B. Romdhane., 2013. Automatic image
grading of breast cancer histopathology using segmentation of nuclear stained breast tissue
spectral clustering with textural and architectural sections using color active contour model and
image features. In Biomedical Imaging: From an improved watershed method. Biomedical
Nano to Macro, 2008. ISBI 2008. 5th IEEE Signal Processing and Control, vol. 8, no. 5,
International Symposium on pp. 496-499, IEEE. pp. 421436.