Sei sulla pagina 1di 14

This article has been accepted for publication in a future issue of this journal, but has not been

fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
1
Double-column manuscript

Symmetry Information Based Fuzzy Clustering


for Infrared Pedestrian Segmentation
Xiangzhi Bai, Yingfan Wang, Haonan Liu and Sheng Guo

 infrared images are also affected by noises and they are always
Abstract—Pedestrian detection in infrared images is always a low contrast. All of these problems make it difficult to segment
challenging task. Segmentation is an important step of pedestrian or detect pedestrians in infrared images.
detection. An accurate segmentation could provide more Segmentation is crucial for pedestrian detection. In the
information for further analysis. In this paper, an improved
process of segmentation, the regions which may contain
Fuzzy C-Means clustering method, which incorporates geometric
symmetry information, is proposed for infrared pedestrian pedestrians should be segmented as the regions of interest (ROI)
segmentation. In the proposed method, symmetry information is for further recognition. An accurate segmentation would
introduced by Markov Random Field (MRF) theory. Moreover, a evidently reduce false alarms and provide more details for
new metric is utilized to handle the weak symmetry of pedestrian. further analysis to ensure the precision of recognition. Precision
In addition, a whole procedure is proposed to extract infrared is an essential requirement in pedestrian detection. Hence,
pedestrians. The experimental results indicate that our method
pedestrian segmentation is an important step of pedestrian
performs better for infrared pedestrian segmentation and obtains
better segmentation results compared with other state-of-the-art detection.
methods. There are many image segmentation methods. Threshold
segmentation is a widely-used method for image segmentation.
Index Terms—Pedestrian segmentation, Infrared images, And Otsu method [6] is a famous threshold segmentation
Fuzzy C-Means, Symmetry. methods. However, Otsu method can not segment the image
correctly when the histogram of the image is unimodal or close
I. INTRODUCTION to unimodal. To solve this problem, Unimodal thresholding [7]

T he infrared imaging technology is useful in many fields,


especially for pedestrian detection. Pedestrians are usually
warmer than background and appear brighter, which makes
was proposed. But this method is mainly used in the unimodal
situation. Minimum error threshold [8] is another classical
threshold method. Images are supposed to obey the Gaussian
pedestrians possible to be detected. This property would be distribution in this method, which limits its application.
helpful for reducing the number of accidents and injury Marker-based watershed [9] is a morphological algorithm
fatalities, especially in intelligent transportation system at night. which imitates the process of topography immersion. But the
Hence, pedestrian detection is a key technology in automotive problem of accurate foreground object segmentation is still
safety, intelligent video surveillance and so on. challenging. Circular Shortest Path (CSP) [10] is a novel
Many methods have been proposed for pedestrian detection. method for infrared pedestrian segmentation and could produce
These methods have been proved to be effective to some degree good results. But the segmentation results of CSP are still not
[1-5]. However, infrared pedestrian detection is still a precise enough. Sometimes this method even fails to segment
challenging task. Pedestrians are non-rigid objects and they the limbs of pedestrians.
appear with a wide range of clothes, sizes and postures. These Fuzzy C-Means (FCM) is a classical clustering method.
factors make it difficult to fix a template for pedestrians. FCM methods do not assign pixels to only one class, but assign
Moreover, pedestrians are not the only warm objects in the pixels to different classes with different memberships. These
environment. Many other objects, such as lamps and vehicles, memberships effectively provide the uncertainties for image
could also be captured by infrared cameras. They will disturb segmentation. Thus, FCM and its variants have been widely
the detection of pedestrians. In addition, pedestrians may used in image segmentation [11-13]. However, not considering
distribute heat heterogeneously. Clothes and other factors may spatial information makes FCM sensitive to noises. To solve
influence the heat dissipating capacity of the pedestrian, which this problem and improve the performance of FCM, some
results in intensity inhomogeneity in infrared images. Besides, improved methods [14-19] have been proposed. In these
methods, spatial information is introduced to enhance
The authors thank the sponsorship of National Natural Science Foundation robustness of FCM. Effective improvements have been made in
of China (61271023) and Program for New Century Excellent Talents in
Universities (NCET-13-0020). (Corresponding author: Xiangzhi Bai. The first
these improved methods, but they still have some problems:
two authors contributed equally to this paper.). just considering the local information, difficult parameter
X. Bai, Y. Wang, H. Liu and S. Guo are with Image Processing Center, selection and ignoring geometric information.
Beijing University of Aeronautics and Astronautics, 100191 Beijing, China.
(E-mail: jackybxz@buaa.edu.cn).
In this paper, to segment infrared pedestrians, an improved
X. Bai is also with State Key Laboratory of Virtual Reality Technology and FCM method which incorporates geometric symmetry
Systems, Beihang University, 100191 Beijing, China.

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
2

information is proposed. Considering the symmetry feature of C. Gibbs Fuzzy C Means (GFCM)
pedestrians, we use symmetry information to improve the To improve the performance of FCM, Gibbs FCM (GFCM)
performance of FCM for infrared pedestrian segmentation. A is proposed in [22]. In GFCM, the neighborhood information is
weight is added to the objective function, in which symmetry introduced by Markov Random Field (MRF) theory. The
information is introduced by Markov Random Field (MRF) objective function of GFCM is:
theory. However, pedestrians are non-rigid objects and the N c

symmetry is not strict. To overcome this problem, a new J GFCM   (1  Pj (i ))uijm || x j  vi ||2 . (3)
j 1 i 1
distance metric is utilized to ensure the rationality of symmetry
information. Finally, our method is compared with eight GFCM accomplishes the clustering process by minimizing
existing segmentation methods. The experimental results show the objective function (3). 1  Pj (i ) takes a value between 0
that our method performs better for infrared pedestrian and 1. It is considered as the resistance to assign pixel x j to
segmentation compared with other segmentation methods.
The rest of this paper is organized as follow: FCM algorithm class i by its neighborhood N j . 1  Pj (i ) is defined as
and some related works are introduced briefly in section II. refusable level and represents the spatial constraints. It will be
Section III describes our improved FCM method in details and discussed in details in section III.
the whole procedure of infrared pedestrian segmentation is
shown in section IV. Experiment is performed and the
D. Symmetry Incorporated Fuzzy C Means (SymFCM)
experimental results are analyzed in section V. Some extensive
works are discussed in section VI to analyze the performance of Geometric information is also ignored in FCM. Although
our method in other application areas. many methods have been proposed to improve the performance
of FCM, they still lack robustness as a result of ignoring
II. RELATED WORKS geometric information. To further enhance robustness of FCM,
a modified FCM clustering method SymFCM is proposed [23],
A. Fuzzy C Means (FCM) in which symmetry information is incorporated. The objective
FCM algorithm was first proposed by Dunn [20] and was function of SymFCM is as below:
further generalized by Bezdek [21]. The objective function of N c

FCM is as below: J SymFCM   uijm (|| x j  vi ||2  j || x 'j  vi ||2 ). (4)


j 1 i 1
N c
J   uijm || x j  vi ||2 . (1) 1
j 1 i 1 j   ( j   )/ 
(5)
1 e
Where X  x1 , x2 , , xN  represents the gray values of  

pixels and N is the number of pixels. c is the number of  j  1 || x j  x'j || (6)


clusters. uij is the membership of x j belonging to class i and x is the mirror pixel of x j and  j is a constant weighing
'
j

vi is the clustering center of class i . m is the fuzzy factor, the symmetry of x j . x j is the mean value of the pixels in the
which is usually set as 2. 
Apparently, FCM method does not consider spatial neighborhood window with center x j . x'j is the mean value of
information of the dataset. Every pixel is considered as isolated.
the pixels in the neighborhood window with center x 'j . u is the
Therefore FCM method is sensitive to noises.
mean value of  j for all the pixels.  is the standard deviation
B. Markov Random Field (MRF) Theory of  j for all the pixels. Hence, if the symmetry of x j is strong,
Assuming that X  x1 , , xN  are random variables in a  
x j is close to x'j . Then  j is large and  j  1 . On the
dataset and S  1, , N are the sites of these variables. Each  
contrary, if the symmetry of x j is weak, x j and x'j are
variable takes a label in the set L  1, , c .
MRF theory defines the conception of neighborhood system dissimilar. Then  j is small and  j  0 . Therefore, the
N j and it should satisfy these conditions: (1) j  N j ; (2) contribution of the mirror pixel is determined by the symmetry
of pixel x j .
j  N r  r  N j . When the dataset X satisfies the following
In this way, symmetry information is introduced into FCM.
conditions, it is considered as a Markov Random Field:
However, SymFCM just takes the mirror pixel into
P( x j | xS  j )  P( x j | xN j ), P( x j )  0, x j  X . (2)
consideration. Hence, the use of symmetry information is
P ( x j ) represents the probability of the variable on the site of insufficient.
j taking a value of x j . In image processing, formula (2)
III. GIBBS SYMMETRY INCORPORATED FCM ALGORITHM
means the label of x j is influenced by its neighbors directly.
Pedestrians are highlight regions in infrared images. Thus
the pixels belonging to pedestrians present obvious clustering

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
3

property. Moreover, infrared images are always low-contrast the resistance to assign pixel x j to class i by its neighbors is
and have blur edges. Usually, there are transitions between the
small. If the refusable level is large, x j and its neighbors may
foreground and background instead of clear boundaries in
infrared images. All these problems above increase the be irrelevant. Then, the resistance to assign x j to class i by its
uncertainty of infrared images. Hence, fuzzy clustering would neighbors is large. Hence, the refusable level represents the
be an effective method for infrared pedestrian segmentation. relativity between x j and its neighbors. As we have analyzed,
Infrared images lack texture features and the shape of
infrared pedestrian is nearly symmetric. Therefore, vertical there is a great possibility that pixel x j and its mirror pixel x 'j
symmetry is an important feature of infrared pedestrians. are highly relevant. Hence, we can use symmetry information
Symmetry information could be introduced into FCM to to redefine the refusable level. The redefined refusable level
improve the performance of infrared pedestrian segmentation. Wij represents the relativity between x j and its mirror pixel
x 'j , along with the relativity between x j and its neighbors.
A. Improvement based on Strict Symmetry Information Symmetry information would be helpful for improving the
The Hamersley–Clifford theorem indicates that a positive results of infrared pedestrian segmentation.
probability distribution can be represented as a Markov So, we define the revised refusable level as below:
Random Field. Hence, the MRF theory and Gibbs distribution
Wij  (1  Pj (i))(1  Q j (i)). (11)
can be incorporated to describe the relativity between pixel x j
The difference between (1  Pj (i )) and Wij is that symmetry
and its neighbors. In GFCM, the calculation of 1  Pj (i ) is
information is used to define the revised refusable level. Wij
based on MRF theory and Gibbs distribution:
Pj (i)  Z 1 exp(U j (i)). (7) represents the relativity between x j and its mirror pixel x 'j ,
c along with the relativity between x j and its neighbors. It
Z = exp(U j (i)). (8)
i 1 means the resistance to assign pixel x j to class i by its
Where Z is a normalization constant and c is the number neighbors and the mirror pixel. If Wij is small, it indicates that
of clusters. U j (i ) is the energy function of pixel x j . It is
its neighbors and the mirror pixel tend to assign x j to class i .
defined as below:
U j (i )   V j (i ). (9) But if Wij is large, it means its neighbors and the mirror pixel
jC
refuse to assign x j to class i . Hence, Wij represents the spatial
Where C represents the clique which is a subset of sites S .
constraints and symmetry constraints of the pixel x j .
Every pair of sites in the clique are neighbors. V j (i ) is the
clique potentials and it is determined by the local configuration The calculation of 1  Pj (i ) is the same to GFCM and a
on the clique. In GFCM, a special multiple level logic model similar procedure is used to calculate 1  Q j (i) :
[22], which only considers two-sites cliques, is utilized to
Qj (i)  Z 1 exp(E j (i)). (12)
calculate the energy function:
U j (i )   (1  ur ,i ).
c
(10) Z = exp( E j (i)). (13)
r N j
i 1

Where N j is the neighborhood of x j . And u r ,i is the Following the form of 1  Pj (i ) , E j (i ) should be also a form
membership of its neighbor xr belonging to class i . of energy function, which is similar to U j (i ) in formula (10).
However, this method may produce misclassifications when Symmetry is an important feature of infrared pedestrians, x j
the pixel is on the boundary of the target region. To overcome
this problem, spatial shape information is used to revise the and its mirror pixel x 'j may have similar property. Shape
refusable level in [24] and [25]. But this method also neglects information and symmetry information should also be
symmetry information. considered in the revised refusable level. Hence, symmetry
In infrared images, the shape of infrared pedestrian is nearly information in four directions is utilized to define E j (i ) , as
symmetric, which means symmetry is a distinct feature of
shown in Fig. 1. Therefore, the difference between E j (i ) and
infrared pedestrian. There is a great possibility that x j and its
U j (i ) is that shape information and symmetry information are
mirror pixel x 'j belong to the same class. Hence, symmetry
considered in E j (i ) . Then, E j (i ) would effectively represents
information would help to improve the performance of FCM
for infrared pedestrian segmentation. the relativity between x j and its mirror pixel x j , along with
The refusable level 1  Pj (i ) represents the relativity the relativity between x j and its neighbors. The definition of
between pixel x j and its neighbors. If x j and its neighbors are E j (i ) is shown in formula (14).
highly relevant, the refusable level would be small. This means

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
4

E j (i )  (1  ui ( a, b  1))(1  ui ( a, b  1))(1  ui ( a , b  1))(1  ui ( a , b  1))


4 ' ' ' '

 4 (1  ui ( a  1, b  1))(1  ui ( a  1, b  1))(1  ui ( a  1, b  1))(1  ui ( a  1, b  1))


' ' ' '

(14)
 4 (1  ui ( a  1, b))(1  ui ( a  1, b))(1  ui ( a  1, b ))(1  u ( a  1, b ))
' ' ' '

 4 (1  ui ( a  1, b  1))(1  ui ( a  1, b  1))(1  ui ( a  1, b  1))(1  ui ( a  1, b  1)).


' ' ' '

Fig. 1. The symmetry information of four directions

(a, b) is the coordinate of x j and ( a ' , b ' ) is the coordinate on (7) and (10). Finally, we can know that W fore  Wback , which
of the mirror pixel x 'j . ui (a, b) is the membership of x j means the revised refusable level decreases the resistance to
assign pixel x j to the class of pedestrian and increases the
belonging to class i . x j and its mirror pixel x 'j are symmetric
with respect to the symmetry axis. Actually, to determine the resistance to assign pixel x j to the class of background.
mirror pixel of x j , the vertical symmetry axis of the pedestrian
which passes through the middle of the pedestrian is detected
before clustering. The method to detect the vertical symmetry
axis will be discussed in detail in section IV. Z is a
normalization constant.
The revised refusable level would be effective spatial Fig. 2. Pixels on the boundary of the pedestrian
constraints and geometric symmetry constraints. For pixel x j , (2). In the second case, the pixel x j is on the boundary of the
the revised refusable level Wij represents the resistance to pedestrian, as shown in Fig. 2. The blue part is inside the
pedestrian, and the white part is outside the pedestrian. We
assign x j to class i by its neighbors and the mirror pixel. The
define the membership of the blue part belonging to the
fourth root is to ensure the revised refusable level keeps the pedestrian as ubfore and the membership of the blue part
same order of magnitude with 1  Pj (i ) .
belonging to the background as ubback . As the blue part is inside
If the first-order derivative of U j (i ) in formula (7) is
the pedestrian, we can infer that ubfore  ubback . The membership
Pj (i )
calculated, we will find that  0 . This means 1  Pj (i ) of the white part belonging to the pedestrian is uwfore and the
U j (i )
membership of the white part belonging to the background is
decreases following the decreasing of U j (i ) . Similarly, we can uwback . As the white part is outside the pedestrian, we can infer
Q j (i) that uwfore  uwback . Then, according to formula (14), we can
infer that  0 . Then 1  Q j (i) decreases following the
E j (i) calculate E fore and Eback :
decreasing of E j (i ) . Therefore Wij decreases following the
E fore  4 (1  ubfore )4  3  4 (1  ubfore )2 (1  uwfore )2 . (15)
decreasing of 1  Pj (i ) and 1  Q j (i) based on formula (11).
We explain the effectiveness of our revised refusable level Eback  4 (1  ubback )4  3  4 (1  ubback )2 (1  uwback )2 . (16)
Wij in two situations: It is obvious that ubfore and uwback are large while ubback and
(1). In the first case, the pixel x j is inside the pedestrian. We uwfore are small. They all range from 0 to 1. In our experiment,
define the membership of its neighbors belonging to the we find that ubback is very close to uwfore and ubfore is very close
pedestrian as u fore and the membership of its neighbors
to uwback as well. This property holds for almost all the test
belonging to the background as uback , respectively. As x j is images. For example, as shown in Fig. 3, three pixels on the
inside the pedestrian, the membership of its neighbors boundary of the pedestrian are marked by red dots. The average
belonging to the pedestrian is large. This means u fore  uback . ubfore , ubback , uwfore and uwback of these three pixels are obtained
Thus, we can infer that (1  Q fore )  (1  Qback ) based on (12) and from our experiment. The values of these memberships are
(14). Similarly, (1  Pfore )  (1  Pback ) could be derived based
ubfore  0.9993, ubback  0.0007, uwfore  0.0002, uwback  0.9998

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
5

. We can see that ubback is very close to uwfore and ubfore is very Here the distance measure between x j and center z is the
close to uwback . Therefore, we assume that ubback  uwfore and Euclidean norm. Actually, z is the mean value of X . When
the dataset is free of noise, the mean value is a good estimation
ubfore  uwback following [25].
of clustering center. However, once noisy points are mixed, an
obvious deviation of clustering center would produce. Hence,
mean value and the Euclidean distance is not robust. The
clustering center may be affected by the mirror pixels with
weak symmetry. Ideally, the contribution of the pixels with
strong symmetry should be enhanced. The negative effect of
the pixels with weak symmetry should be suppressed. Based on
this idea, we define the followed term.
N

Fig. 3. Pixels on the boundary of the pedestrian  (1  exp( max(|| x


j 1
rN j
'
r  vi ||2 ))). (19)
Moreover, the comparison of experimental results in section V
also proves that our results are closer to the ground truth. This In this paper, the Euclidean norm || x'j  vi ||2 is replaced by
means the pixels on the boundary of the pedestrian are (1  exp(  max(|| xr'  vi ||2 ))) to handle the weak symmetry of
classified correctly in our experimental results. Hence, our r N j

assumption is reasonable and effective. Then we can infer that pedestrians. Under the new metric, we use the same procedure
E fore  Eback based on this assumption and formulas (15), (16). to minimize formula (19). Then the clustering center vi can be
Therefore, W fore  Wback is obtained according to (11) and (12), expressed as below:
which also means the revised refusable level decreases the N exp(  max(|| xr'  vi ||2 ))
vi   N
r N j
resistance to assign pixel x j to the class of pedestrian and x 'j

increases the resistance to assign pixel x j to the class of


j 1

j 1
exp(  max(|| xr  vi || ))
r N j
' 2

(20)
background. N j
By analyzing the two situations above, we prove that the = N
x 'j .
revised refusable level in our method is reasonable. For the
j 1

j 1
j

pixels belonging to pedestrians, the revised refusable level will


decrease the resistance to assign the pixels to the class of Where N j is the neighborhood of the mirror pixel x 'j .  is
pedestrian. Meanwhile, it will increase the resistance to assign a constant and xr' is the neighbors of the mirror pixel x 'j .
the pixels to the class of background.
max(|| xr'  vi ||2 ) is the maximum value of || xr  vi || .
' 2
By considering strict symmetry information, the objective r N j
function of improved FCM can be expressed as below:
N c
 j  exp(  max(|| xr'  vi ||2 )) is a form of weight. From the
r N j
J  W u (|| x j  vi ||  j || x  vi || ).
m 2 ' 2
(17)
definition of  j , we can infer that it is a decreasing function.
ij ij j
j 1 i 1

x is the mirror pixel of x j and  j is a constant weighing


'
j Hence, j increases following the decreasing of
the symmetry of x j . max(|| x  vi || ) under the new metric. Therefore, when the
'
r
2
r N j

symmetry of the mirror pixel is strong, max(|| xr'  vi ||2 ) is


r N j
B. Improvement based on Weak Symmetry Information
Pedestrians are non-rigid objects. Hence, they are not strictly small. This means the weight  j is big and the contribution of
symmetric. For example, the mirror pixel of x j may be outside the mirror pixel for determining clustering center is big. When
the pedestrian, which means the mirror pixel could be seen as the mirror pixel is outside the pedestrian, the symmetry is weak.
noise point compared with x j . In this case, max(|| xr'  vi ||2 ) is big and the weight  j is small.
r N j

Suppose that X  x1 , x2 xN  is a dataset and z is the This means the contribution of the mirror pixel for determining
clustering center of X . By minimizing the mean square error the clustering center is small. Therefore, the contribution of
N those pixels with strong symmetry is enhanced and the
 || x j  z ||2 , we can obtain a good estimation of z [26]: contribution of those pixels with weak symmetry is suppressed.
j 1
 is defined as the reciprocal of the variance in Alternative
N
FCM method (AFCM) [26]. Actually, the influence of  has
x j

z
j 1
. (18) been analyzed in [26]. If  is very small, the membership of
N AFCM will be very close to FCM membership. And if  is
very large, the AFCM membership performs well for the

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
6

characteristic of separation. This means the reciprocal of the N c

variance is a good estimation for  in the metric J GSFCM  Wij uijm (|| x j  vi ||2
j 1 i 1 (23)
(1  exp(  || xr'  vi ||2 )) . In this paper, we redefine  as   j (1  exp(  max(|| x  vi || )))). ' 2
r
r N j
below:
1 S fore The calculation of  has been analyzed and  j is
=  . (21)
 fore S calculated according to formulas (5) and (6). Under the
c
 fore is the variance of pedestrian region and S fore is the constraints uij  [0,1], u i 1
ij  1 , Lagrange multiplier is used to
area of pedestrian region. S is the area of the entire image. To construct a new objective function F to solve the objective
calculate  fore and S fore , FCM is utilized to segment the image function:
N c
in advance and the approximate region of the pedestrian can be
F  Wij uijm (|| x j  vi ||2
identified according to the segmentation result of FCM. We j 1 i 1
treat those pixels, which are assigned to the class of pedestrians (24)
c
in the segmentation result of FCM, as the pedestrian region.   j (1  exp(  max(|| x  vi || ))))   ( uij  1).
'
r
2
r N j
Thus,  fore and S fore can be calculated. i 1

Then we calculate the first derivations with respect to uij , vi ,


As we have used FCM to segment the image in advance and
 fore represents the variance of pedestrian region. 1/  fore is  and set them as 0 :
F
much larger than the reciprocal of the variance of the entire  mWij uijm 1 (|| x j  vi ||2
image. Moreover, infrared pedestrians used in our experiment uij (25)
are not very small. Then S fore / S takes a value between 0 and   j (1  exp(  max(|| xr'  vi ||2 ))))    0.
r N j
1. Hence, the redefined  is still larger than the reciprocal of
F N
the variance of the entire image. In brief, the redefined  is  Wij uijm (|| x j  vi ||
vi j 1 (26)
also a form of variance and is larger than that of AFCM. Hence,
it is reasonable to infer that the redefined  in the proposed     j || xr'  vi ||  exp(  max(|| xr'  vi ||2 )))  0.
rN j
method is more useful than that of AFCM. F c
In addition, it is not difficult to calculate  . We just need to   uij  1  0. (27)
 i 1
calculate the variance and area of pedestrian region before
By solving (25), (26) and (27) simultaneously, we can derive
clustering. And the pedestrian region contains those pixels
the formulas of membership uij and clustering center vi :
which are assigned to the class of pedestrians in the result of
FCM segmentation results. uij 
More relative information of x 'j are taken into consideration 1
by using max(|| x  vi || ) . Moreover, taking the maximum of
'
r
2
Wij (|| x j  vi ||  j (1  exp(  max(|| xr'  vi ||2 ))))
2
(28)
r N j rN j
c
.
|| xr  vi ||2 can further reduce the weight  j , which means the  W (|| x
1
k 1 ij j  vk ||2  j (1  exp(  max(|| xr'  vk ||2 ))))
negative effects of those pixels with weak symmetry are further r N j

suppressed. vi 
By considering weak symmetry information, the objective N
function of our proposed method can be expressed as below:
N c
W u
j 1
m
ij ij ( x j     j  x 'j  exp(  max(|| xr'  vi ||2 )))
r N j (29)
J = uijm (|| x j  vi ||2  j (1  exp(  max(|| xr'  vi ||2 )))). (22) N
.
j 1 i 1
rN j
W u
j 1
m
ij ij (1     j  exp(  max(|| x 'j  vi ||2 )))
r N s
xr' is the neighbor of the mirror pixel x 'j and  j is a
Compared with visible images, infrared images have less
constant weighing the symmetry of x j . useful features. But infrared pedestrians are highlight regions.
Thus the symmetry of infrared pedestrian is clearer and more
reliable in infrared images. Especially when the pedestrian is
C. The Final Improved FCM based on Symmetry Information
vertical, it would be almost strictly symmetric in infrared
In this part, we combine the improvements proposed in the images. Hence, symmetry is a reliable feature in infrared
two parts above to improve the performance of FCM. The final images. As we have analyzed above, the proposed method has
objective function of our improved FCM method can be considered the symmetry information flexibly. Therefore, the
expressed as below: proposed method is mainly focused on infrared images.
The robustness of our method is enhanced by introducing
symmetry information. To segment the pedestrians with weak

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
7

Fig. 4. The process of symmetry axis detection

symmetry, a new metric is utilized to replace the Euclidean Moreover, the two image sets are captured at day and night,
distance, which makes our method more effective for infrared respectively. Thus the image sets used in our experiment have
pedestrian segmentation. different contrasts and noise levels. The sizes of the two image
sets are 400  700 and 287  384 , respectively. The
IV. THE WHOLE PROCEDURE OF INFRARED PEDESTRIAN experiment is executed under the environment of Intel Core
SEGMENTATION i5-6500 CPU, 3.20GHz, with 8GB RAM. To prove the
As symmetry information is used in our method, symmetry efficiency of our method for infrared pedestrian segmentation,
axis detection is the first task. To detect the symmetry axis, some widely used algorithms are utilized as comparison
Hou’s method [27] is used to generate the saliency map, which methods, including unimodal thresholding [7], maker based
is used to identify the pedestrian regions. Then the pedestrian watershed [9], FCM, AFCM [26], FLICM [11], KIFCM [14],
regions in the infrared images are processed by FCM. GQFCM [25] and Circular Shortest Path (CSP) [10]. These
Afterwards, the sum of the columns of the FCM result is methods are also tested in the same datasets. The experimental
calculated and a summation curve is generated. Finally, the results demonstrate that our method performs better for infrared
locations of the peaks of the curve are considered as the pedestrian segmentation compared with these comparison
locations of the symmetry axis. The detection process is shown methods.
in Fig. 4.
After the symmetry axis is obtained, the proposed FCM A. Parameter Information
method which has been described in details in section III is used
Any infrared image utilized in our experiment contains four
to segment infrared pedestrians. The proposed method is
parts: sky, street scene, pedestrians and ground. Thus we set the
executed in the following steps:
number of clusters as 4 for FCM clustering methods.
(1). Initialize the number of clusters c , fuzzy factor m ,
To make it fair, some main parameters used in FCM, AFCM,
threshold  and the maximum number of iterations T .
FLICM, KIFCM, GQFCM and our proposed method are the
(2). Update the revised refusable level Wij . same and they are shown in table I.
(3). Update the membership metric U and clustering center TABLE I
PARAMETER INFORMATION
V .
Maximum
(4). If || J (t  1)  J (t ) ||  , stop; otherwise go to step 2). Clustering Fuzzy The
Name number
number factor threshold
of iterations
Where J represents the value of the objective function and
Symbol c m  T
t is the number of iterations. Value 4 2 0.00001 200
The clustering result is a binary image, in which the The CSP method proposed in [10] has been also used for
background is set as black and the target is set as white. But in infrared pedestrian segmentation and obtained good results.
some cases, the clustering results may contain false alarms. There is only one parameter in the CSP method, which is the
False alarms are the regions which are assigned as pedestrians threshold for saliency and is manually set as 0.25. It has been
but do not contain any pedestrian actually. False alarms are analyzed in [10] that 0.25 is an effective value for CSP.
usually very small or connected to image border, which could KIFCM method utilized the Gaussian kernel and hesitation
be easily removed if necessary. To remove these false alarms, degree to improve the performance of FCM. There are two
those regions which are very small or connected to image parameters:  for calculating the Gaussian kernel and  for
border will be set as background.
calculating the hesitation degree. In this paper, we use the
“trial-and-error” method to determine the two parameters. We
V. EXPERIMENTAL RESULTS
find that if  and  are too small or too big, KIFCM cannot
The infrared pedestrian dataset used in this paper contains
segment the infrared pedestrian correctly. Thus we test  from
two image sets with different types of infrared pedestrians and
1 to 200 and  from 0 to 100. Then we calculate the
backgrounds. The first image set contains 339 pedestrians in
186 infrared images. The second image set contains 67 quantitative measures to identify the best parameters. Finally,
pedestrians in 53 infrared images. And, the backgrounds of the we obtain  =45 and   0.75 in our experiment for
second image set are more complex compared with the first one. comparison.

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
8

B. Segmentation Result Comparison These comparisons demonstrate that the proposed method
To make a fair comparison, all the segmentation results of performs better for infrared pedestrian segmentation compared
the comparison methods are also processed by the same with the comparison methods.
additional processing steps as our method, which have been C. Quantitative Comparison
described in C part of section III. These additional processing
To further prove the effectiveness of our method, three
steps are helpful for reducing false alarms in the comparison
quantitative measures are utilized for quantitative comparison.
results.
Misclassification error (ME) [28], [29] reflects the ratio of
An example of segmentation results is shown in Fig. 5. We
misclassifications between foreground and background. Hence,
can see that the unimodal method can separate pedestrians from
a lower ME means better segmentation results. The relative
background in some cases. But some parts of pedestrians may
foreground area error (RAE) [30] reflects the accuracy and
lose due to the problem of intensity inhomogeneity. Marker
integrality of the segmented target. Obviously, a lower RAE
based watershed can reduce false alarms and has some
indicates that the segmented result is closer to the ground truth.
improvements. But this method is sensitive to the markers and
F-measure ( F ) [32], [33] is the harmonic mean of precision
fails to separate pedestrians in many cases. FCM can not finish
the task, and many false alarms remained in the segmentation and recall, which fully takes false alarms and missing alarms
results of FCM. Compared with FCM, there is no significant into consideration. A higher F-measure means a more effective
improvement in the results of AFCM, FLICM and KIFCM. method. The definitions of ME, RAE and F-measure are as
False alarms are still remained in the results of these methods. below [10]:
GQFCM segments infrared pedestrians successfully and | B  BS |  | FT  FS |
ME  1  T . (30)
reduces false alarms. However, in the segmentation results of | B T FT |
GQFCM, some parts of trees and street lamps are still remained
 FT  FS
in the results of the first image set. Moreover, some parts of  F if FT  FS
pedestrians may lose and there are many false alarms remained 
RAE  
T
. (31)
in the results of the second image set. The Circular Shortest  FS  FT if F  F
Path can successfully segment pedestrians and remove false  FS S T

alarms. But this method fails to segment the limbs of pedestrian


(1   )  P  R
in some cases, which leads to a partial loss of pedestrians. Our F  . (32)
method can separate pedestrians from background and remove PR
false alarms effectively. In addition, the limbs of pedestrian are In formulas (30) and (31), BT and FT are the areas of the
segmented correctly in our segmentation results. Therefore, the background and foreground of the ground truth, respectively.
segmented pedestrians in our results are more accurate. BS and FS are the areas of the background and foreground of
the segmentation results, respectively.
Suppose that there are PS positive samples and NS
negative samples in a dataset. In the experimental result, there
are TP positive samples which are correctly classified as
positive and there are FN positive samples which are
incorrectly classified as negative. Hence, PS =TP  FN .
Similarly, there are FP negative samples which are
incorrectly classified as positive and there are TN negative
samples which are correctly classified as negative. Hence,
NS =FP  TN . The definitions of P and R are as below:
TP
P= . (33)
Fig. 5. An example of segmentation result TP  FP
More examples of segmentation results and the ground truth TP
of the pedestrians are shown in Fig. 6, Fig. 7 and Fig. S1 (Fig. R= . (34)
TP  FN
S1 is supplied in the supplementary material). We can see that
P is precision. It represents the proportion of positive
false alarms are remained in the segmentation results of FCM,
samples which are correctly classified in the positive result of
AFCM, FLICM and KIFCM. Actually, these four methods can
the classifier. R is recall. It represents the proportion of
not finish this task. In the results of Unimodal, Watershed and
positive samples which are correctly classified in the positive
GQFCM, improvements are made and false alarms are reduced
samples of the whole dataset.  is a constant. Following [32]
in some cases. In the results of Circular Shortest Path, false
alarms are removed significantly. But the limbs of pedestrians and [33], we set   0.5 in our experiment.
are not well segmented in some cases. In the results of our The quantitative measures are calculated on the two image
method, the false alarms are removed and the limbs of sets. The results of quantitative measures are shown in table II.
pedestrians are segmented correctly. From table II, we can see that our method has the lowest ME
and RAE, in the meanwhile, the highest F-measure compared

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
9

Fig. 6. More examples of segmentation results

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
10

Fig. 7. More examples of segmentation results

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
11

TABLE II
THE VALUES OF QUANTITATIVE MEASURES
Our
Unimodal Watershed FCM AFCM FLICM KIFCM GQFCM CSP
method
ME (%) 0.5235 1.1707 2.2672 2.8366 2.2181 2.5236 0.8933 0.2014 0.1400

RAE (%) 47.0864 34.2855 52.5249 61.4967 51.5960 56.8152 36.0871 16.4881 8.6018

Fα (%) 69.0452 85.1308 90.4818 87.1914 91.0669 90.4307 88.6198 92.3489 96.4115

TABLE III
THE COMPARISON RESULT OF AVERAGE TIME
Our
Method Unimodal Watershed FCM AFCM FLICM KIFCM GQFCM CSP
Method
Time (s) 0.3427 1.3524 9.1407 9.8547 102.5202 10.9727 2.3316 1.6881 45.927

TABLE IV
THE COMPARISON RESULT OF TIME COMPLEXITY
Method Unimodal Watershed FCM AFCM FLICM KIFCM GQFCM CSP Our Method

Complexity O( N ) O( N ) O( Nc) O( Nc) O( NcSr ) O( Nc) O( Nc) O( N ) O( Nc log Sr )

Fig. 8. Segmentation of other kinds of images

with other methods. The lowest ME means the pixels which are all methods. Our method is faster than FLICM but slower than
incorrectly classified by our method are less than other other methods. Parallel programming and GPU are effective
comparison methods. The lowest RAE indicates that the ways to reduce the running time, which would be the future
pedestrian target segmented by our method is the closest to the work. However, our method performs very well for infrared
ground truth among all the methods. Moreover, the highest pedestrian segmentation and the performance is better than
F-measure indicates that the segmentation results of our other methods. In all, our method is effective for infrared
method are more accurate and holonomic. It also proves that pedestrian segmentation.
our method performs better for infrared pedestrian As the computational cost above is also influenced by the
segmentation compared with other methods. programming style, a further time complexity analysis is
Hence, the quantitative comparison results also prove that presented in this paper, as shown in table IV. N is the number
our method is more effective for infrared pedestrian of pixels. c is the number of clusters and S r represents the
segmentation compared with other comparison methods.
size of neighborhood N j . Unimodal thresholding, Watershed
D. Computational Comparison and CSP are non-iterative methods. They have the same time
Table III shows the average running time of each method. complexity O( N ) . The time complexity of FCM is O( Nc)
All the methods are tested under the same computational [11]. And, it is easy to derive that AFCM, KIFCM and GQFCM
environment of Intel Core i5-6500 CPU, 3.20GHz, with 8GB have the same complexity with FCM [11]. A fuzzy factor G
RAM and on the same image sets. Table III shows that
has been added to consider neighborhood information in
unimodal, watershed and CSP method consume less time, as
FLICM, thus we can infer that the complexity of FLICM should
these methods do not need iterative calculations. The running
be O( NcSr ) based on [11]. As shown in formula (23), the
time of clustering methods increases compared with the
non-iterative methods. FLICM consumes the most time among maximum operation is applied on the neighborhood in our

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
12

Fig. 9. Segmentation of noisy infrared pedestrian images

Fig. 10. Visual effect comparison

method and the complexity of the maximum operation is Fig. 8c), which also proves that our proposed method with
O(log Sr ) . Thus the time complexity of our method is symmetry information could be used for MRI brain image
O( Nc log Sr ) . Considering log Sr  Sr , the time complexity of segmentation. Obviously, our method performs well for the
segmentation of loosely symmetric objects. That is because:
our method is less than FLICM, which is consistent with the
(1). We have analyzed the parameter  j . If the symmetry of
average running time comparison shown in table III.
x j is strong,  j  1 . On the contrary, if the symmetry of
VI. EXTENSIVE WORKS AND DISCUSSIONS x j is weak,  j  0 . This means if the symmetry of the
A. Extension to Other Application Areas pixel x j is strong, then the influence of the mirror pixel
Actually, symmetry is a common feature of objects. There
will be enhanced. Otherwise, if the symmetry is weak, the
are many symmetric objects around us, such as buildings,
influence of the mirror pixel will be ignored.
vehicles, animals, clothes and even human faces. They are all
(2). As we have discussed in section III, || x'j  vi ||2 is
symmetric to some degrees. Hence, there is no doubt that
symmetry information would be a great help for segmenting replaced by (1  exp(  max(|| xr'  vi ||2 ))) in our
r N j
these objects. Thus it is possible to generalize the idea of our
method to other application areas. To prove the possibility of method. The effect of this improvement is that the
generalizing the idea of our method to other application areas, contributions of those pixels with strong symmetry are
some real-world visible images, infrared ship images and MRI enhanced and the contributions of those pixels with weak
images are used for further experiment. The segmentation symmetry are suppressed.
results are shown in Fig. 8. In all, the use of symmetry information in our method is
In Fig. 8, we can see that the targets are not strictly more flexible and reasonable. Therefore, the applicability of
symmetric, even asymmetric in vertical direction. But our our method could be generalized, not limited to strictly
method still performs well. In real-world visible images, the symmetric objects or infrared images.
targets are extracted successfully. Similarly, although infrared
ships are not strictly symmetric, they are well segmented by our B. The Analysis of Noise Effect
method. Hence, our method can also segment other infrared Infrared images are always accompanied by low resolution,
targets successfully besides infrared pedestrians. Correct low-contrast and intensity inhomogeneity. These shortcomings
segmentation results of MRI images are obtained as shown in are usually caused by noises in infrared imaging process. In the

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
13

meanwhile, pedestrian is not the only object which is warm in TABLE V


QUANTITATIVE COMPARISON
the environment. Many warm objects can be captured by
Method Our
infrared camera. These objects are also highlight regions in FCM_S FLICM
in [35] method
infrared images, such as lamps and vehicles. All of these could ME (%) 2.5692 2.2181 2.4808 0.1400
be recognized as noises compared with infrared pedestrians.
Thus noise tolerance is a necessary demand for an infrared RAE (%) 59.4112 51.5960 59.0261 8.6018
pedestrian segmentation method. To analyze the effect of Fα (%) 91.4662 91.0669 90.8551 96.4115
noises on our method, a further experiment is executed.
To analyze the effect of noises on infrared pedestrian
Actually, in our method, symmetry information is handled
segmentation, different percentages (standard deviation from
by adding a term into the objective function, which acts as the
0.05 to 0.2) of Gaussian noises are added in infrared pedestrian
role of regularization term.
images. Fig. 9 shows an example of infrared pedestrian image
In summary, neighborhood information is considered by
which contains Gaussian noises with mean 0 and standard
changing the fitting terms of the objective function, and
deviation 0.2. More segmentation examples of infrared
symmetry information is considered by adding term into the
pedestrian images with different percentages of noises are
objective function in our method. The symmetry information
shown in Fig. S2 (Fig. S2 is supplied in the supplementary
acts as the role of regularization term in the objective function
material). The processed results of our method in Fig. 9 show
of our method. The most important of all, as we have analyzed
that our method can remove the negative effect of noises. The
above, the experimental results have indicated that our method
infrared pedestrian can still be extracted successfully in the
is more effective for infrared pedestrian segmentation
results of our method. Moreover, some other synthetic images
compared with the regularization based FCM methods.
with the same Gaussian noises are also processed by our
method and our method can still obtain correct segmentation
VII. CONCLUSION
results as shown in Fig. S3 (Fig. S3 is supplied in the
supplementary material). All the experimental results verified Pedestrians are highlight regions in infrared images and they
that our method still performs well for noisy infrared pedestrian always present obvious clustering property. The blur edges
images and other noisy synthetic images. Based on the analysis result in blur boundaries in infrared images, which would
above, it is reasonable to believe that our method is not very increase the uncertainty of infrared images. Thus fuzzy
sensitive to noises and can handle general noisy images. clustering method would be an effective segmentation method
for infrared images. Also infrared images lack texture features,
which means the shape information would be more important in
C. Comparison with Regularization Based FCM Methods infrared images. Thus, symmetry is an important feature of
Adding regularization terms into the objective function is infrared pedestrians. It is reasonable to believe that symmetry
one of the used ways to improve the performance of FCM. information will be helpful to improve the performance of FCM
Some improved FCM methods handle the neighborhood for infrared pedestrian segmentation. In our method, symmetry
information by adding regularization terms, such as FCM_S information is introduced by Markov Random Field theory.
[34], FLICM and the method in [35]. In our method, the However, pedestrians are non-rigid objects and the vertical
neighborhood information is incorporated by the MRF theory, symmetry is weak in some cases. To overcome this problem,
which is handled as a form of weight Wij in the objective we use a new metric to enhance the contribution of those pixels
function. To prove the efficiency of our method for infrared with strong symmetry. Finally, our segmentation results are
pedestrian segmentation, our method is compared with these compared with eight existing segmentation methods from two
three regularization based FCM methods. The segmentation aspects: visual effect comparison and quantitative comparison.
results are shown in Fig. 10. To be fair for the comparison, We can see that our segmentation results are more accurate and
these results are also processed by the same additional closer to the ground truth, which means our method is more
processing steps as our method. effective for infrared pedestrian segmentation compared with
From Fig. 10, we can see that the three regularization based other segmentation methods.
FCM methods cannot extract the infrared pedestrian There are many symmetric objects around us, in vertical or
successfully. Moreover, the quantitative comparison is also horizontal or a certain direction. Symmetry information would
executed and the three quantitative measures are calculated. affect the accuracy of the segmentation. Normally, these
The comparison result is shown in table V. objects are not strictly symmetric. To handle the weak
As shown in table V, our method still has the lowest ME, symmetry of these objects, the symmetry constraints have been
RAE and the highest F-Measure. This means our method is loosened in our method. This improvement makes the proposed
more effective for infrared pedestrian segmentation compared method adapt to the weak symmetry better. Thus the proposed
with the other three regularization based FCM methods. method would be used in a wide range. In section VI, we have
Therefore the way to handle the neighborhood information in proved that our method can be generalized to other application
our method is reasonable and effective for infrared pedestrian areas and can also obtain correct segmentation results.
segmentation. Moreover, infrared pedestrian segmentation is the
foundation of infrared pedestrian detection. The performance of

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TFUZZ.2017.2756827, IEEE
Transactions on Fuzzy Systems
14

segmentation will directly affect the accuracy of detection. [21] J. C. Bezdek, “Fuzzy Mathematics in Pattern Classification,” Ph.D. New
York, 1973.
Considering the widely-used application of pedestrian [22] Y. Feng and W. Chen, “Brain MR Image Segmentation Using Fuzzy
detection, infrared pedestrian segmentation would play an Clustering with Spatial Constraints Based on Markov Random Field
important role in many fields, such as the night driving Theory,” Lect. Notes Comput. Sci., vol. 3150, pp. 188-195, 2004.
[23] S. A. Jayasuriya and A. W. Liew, “Symmetry incorporated Fuzzy
assistance system, intelligent monitoring system and pedestrian
C-means Method for Image Segmentation,” IEEE Int Conf Fuzzy Syst., pp.
behavior analysis. Hence, the main idea of our method could be 1-7, 2013.
generalized to other areas. Symmetry information would [24] X. Bai, Z. Chen, Y. Zhang, Z. Liu and Y. Lu, “Spatial information based
improve the results in many cases. FCM for infrared ship target segmentation,” IEEE Int. Conf. Image
Process., pp. 5127-5131, 2014.
[25] X. Bai, Z. Chen, Y. Zhang, Z. Liu and Y. Lu, “Infrared Ship Target
Segmentation Based on Spatial Information Improved FCM,” IEEE
REFERENCES Trans. Cybern., vol. 46, no. 12, pp. 3259-3271. 2016.
[26] K. Wu and M. Yang, “Alternative c-means clustering algorithms,”
[1] Y. S. Lee, Y. Chan, L. Fu and P. Y. Hsiao, “Near-Infrared-Based Pattern Recognit., vol. 35, no. 10, pp. 2267-2278, 2002.
Nighttime Pedestrian Detection Using Grouped Part Models,” IEEE [27] X. Hou, J. Harel and C. Koch, “Image Signature: Highlighting Sparse
Trans. Intell. Transp. Syst., vol. 16, no. 4, pp. 1929-1940, Aug. 2015. Salient Regions,” IEEE Trans Pattern Anal Mach Intell., vol. 34, no. 1, pp.
[2] W. Liu, B. Yu, C. Duan, L. Chai, H. Yuan and H. Zhao, “A 194-201, 2012.
Pedestrian-Detection Method Based on Heterogeneous Features and [28] Z. Liu, F. Zhou, X. Chen, X. Bai and C. Sun, “Iterative infrared ship target
Ensemble of Multi-View–Pose Parts,” IEEE Trans. Intell. Transp. Syst., segmentation based on multiple features,” Pattern Recognit., vol. 47, no.
vol. 16, no. 2, pp. 813-824, Apr. 2015. 9, pp. 2839-2852. 2014.
[3] X. Li, L. Li, F. Flohr, J. Wang, H. Xiong, M. Bernhard, S. Pan, D. M. [29] W. A. Yasnoff, J. K. Mui, and J. W. Bacus, “Error measures for scene
Gavrila and K. Li, “A Unified Framework for Concurrent Pedestrian and segmentation,” Pattern Recognit., vol. 9,no. 4, pp. 217-231, 1977.
Cyclist Detection,” IEEE Trans. Intell. Transp. Syst., vol. 18, no. 2, [30] Z. Fan, C. Wang and X. Ma, “Double-threshold image segmentation
pp.269-281, 2017. method based on gray gradient,” Proc SPIE, vol. 7506, pp.
[4] S. Zhang, C. Bauckhage and A. B. Cremers, “Efficient Pedestrian 75062O-75062O-8, 2009.
Detection via Rectangular Features Based on a Statistical Shape Model,” [31] R. C. Gonzalez, et al. “Digital Image Processing,” Addison-Wesley,
IEEE Trans. Intell. Transp. Syst., vol. 16, no. 2, pp. 763-775, Apr. 2015. Massachusetts, 1992.
[5] K. Li, X. Wang, Y. Xu and J. Wang, “Density Enhancement-Based [32] T. Liu, Z. Yuan, J. Sun, J. Wang, N. Zheng, X. Tang and H. Y. Shum,
Long-Range Pedestrian Detection Using 3-D Range Data,” IEEE Trans. “Learning to Detect a Salient Object,” IEEE Trans Pattern Anal Mach
Intell. Transp. Syst., vol. 17, no.5, pp. 1368-1380, May. 2016. Intell., vol. 33, no.2, pp. 353-367. Feb. 2011.
[6] N. Otsu, “A Threshold Selection Method from Gray-Level Histograms,” [33] D. R. Martin, C. C. Fowlkes and J. Malik, “Learning to Detect Natural
IEEE Trans Syst Man Cybern., vol. SMC-9, no. 1, pp. 62-66, Jan. 1979. Image Boundaries Using Local Brightness, Color, and Texture Cues,”
[7] P. L. Rosin, “Unimodal thresholding,” Pattern Recognit., vol. 34, no. 11, IEEE Trans Pattern Anal Mach Intell., vol. 26, no. 5, pp. 530-549. May.
pp. 2083-2096, Nov. 2001. 2004.
[8] J. Kittler and J. Illingworth, “Minimum Error Thresholding,” Pattern [34] M. N. Ahmed, S. M. Yamany and N. Mohamed, “A Modified Fuzzy
Recognit., vol. 19, no. 1, pp. 41-47, 1986. C-Means Algorithm for Bias Field Estimation and Segmentation of MRI
[9] F. Zhou, X. Bai, Y. Xie and T. Jin, “Marker based watershed to segment Data,” IEEE Trans. Med. Imag., vol. 21, no. 3, pp. 193-199, March. 2002.
dim infrared target through image enhancement,” Proc SPIE, vol. 7128, [35] G. Liu, Y. Zhang and A. Wang, “Incorporating Adaptive Local
pp. 71280O-71280O-7, 2008. Information Into Fuzzy Clustering for Image Segmentation,” IEEE Trans.
[10] X. Bai, P. Wang and F. Zhou, “Pedestrian Segmentation in Infrared Image Process., vol. 24, no. 11, pp. 3990-4000, 2015.
Images Based on Circular Shortest Path,” IEEE Trans. Intell. Transp.
Syst., vol. 17, no. 8, pp. 2214-2222, 2016.
[11] S. Krinidis, V. Chatzis, “A Robust Fuzzy Local Information C-Means
Clustering Algorithm,” IEEE Trans. Image Process., vol. 19, no. 5,
pp.1328-1337, 2010.
[12] S. P. Awate, H. Zhang and J. C. Gee, “A Fuzzy, Nonparametric
Segmentation Framework for DTI and MRI Analysis: With Applications
to DTI-Tract Extraction,” IEEE Trans. Med. Imag., vol. 26, no. 11,
pp.1525-1536, 2007.
[13] M. Gong, L. Su, M. Jia and W. Chen, “Fuzzy Clustering With a Modified
MRF Energy Function for Change Detection in Synthetic Aperture Radar
Images,” IEEE Trans Fuzzy Syst., vol. 22, no. 1, pp. 98-109, Feb. 2014.
[14] K. Lin, “A Novel Evolutionary Kernel Intuitionistic Fuzzy C-means
Clustering Algorithm,” IEEE Trans Fuzzy Syst., vol. 22, no. 5, pp.
1074-1087, Oct. 2014.
[15] J. K. Parker and L. O. Hall, “Accelerating Fuzzy-C Means Using an
Estimated Subsample Size,” IEEE Trans Fuzzy Syst., vol. 22, no. 5, pp.
1229-1244, Oct. 2014.
[16] J. M. Leski, “Fuzzy (c+p)-Means Clustering and Its Application to a
Fuzzy Rule-Based Classifier: Toward Good Generalization and Good
Interpretability,” IEEE Trans Fuzzy Syst., vol. 23, no. 4, pp. 802-812, Aug.
2015.
[17] S. Chang, K. Lu and M. Yang, “Fuzzy Change-Point Algorithms for
Regression Models,” IEEE Trans Fuzzy Syst., vol. 23, no. 6, pp.
2343-2357, Dec. 2015.
[18] L. Maciel, R. Ballini and F. Geomide, “Evolving Possibilistic Fuzzy
Modeling for Realized Volatility Forecasting with Jumps,” IEEE Trans
Fuzzy Syst., vol. 25, no. 2, pp.302-314, 2017.
[19] T. C. Glenn, A. Zare and P. D. Gader, “Bayesian Fuzzy Clustering,” IEEE
Trans Fuzzy Syst., vol. 23, no. 5, pp. 1545-1561, Oct. 2015.
[20] J. C. Dunn, “A Fuzzy Relative of the ISODATA Process and Its Use in
Detecting Compact Well-Separated Clusters,” J Cybern., vol. 3, no. 3, pp.
32-57, 1973.

1063-6706 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Potrebbero piacerti anche