Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Abstract — Computer vision techniques for traffic Various studies have been done in this field using different
surveillance have proved to be non invasive, automated, features. Viola Jones algorithm made use of haar like
and cost effective solutions to detect, count, and classify features for rapid object detection [1]. It suffers from high
vehicles, finding immense application in intelligent false positive rate. The HOG features have been used to
transportation system. Since vehicles have high intra class capture local shape characteristics [2]. But, the HOG
variation and relatively low interclass variation, features involve huge computations. LBP is a robust texture
classification poses a great challenge, when the video descriptor, which is computationally efficient [3].
resolution is low, under varying illumination, occlusion etc. Combination of HOG and LBP takes advantage of each of
To overcome these problems, a novel method for detection and the descriptor, but this approach increases the feature
classification of moving objects in traffic surveillance scenes is dimension [4]. Centre symmetric LBP reduces the
proposed. From the side view, the detected objects are classified computational load by half that of LBP [5]. CS LBP
into pedestrians, two wheelers and cars. The moving object provides a measure of intensity difference of each given
segmentation is done using background subtraction. Features are
pixel. Since orientation of each pixel is lost, a new lower
extracted from the segmented foreground using haar wavelets
and oriented centre symmetric local binary pattern (OCS LBP). dimentional feature oriented CS LBP, known as Oriented
Hierarchical SVM classifier, trained using the extracted features, Centre Symmetric Local Binary Pattern (OCS LBP) was
classifies the detected objects. The proposed algorithm is able to devised [6]. The proposed work combines the shape
classify up to 97%. features obtained from haar wavelets and texture features
extracted by OCS LBP to obtain a descriptor that provides
Index Terms — Haar wavelets, OCS LBP. Contrast ratio, better performance and reduced features..
Support Vector Machine
II. METHODOLOGY
I. INTRODUCTION
The primary step in this work is the segmentation of the
In recent years computer vision technologies have found moving objects from the video frames. For this a simple
wide applications in our day to day life for convenience background subtraction technique is used. Subsequently,
improvement. Real time moving object classification is an shadow removal is done using contrast enhancement, to extract
important issue in video surveillance with a great potential only actual foreground blobs. Shape feature and texture
for use in various applications such as motion capture features are extracted using Haar DWT sub bands and
techniques, Intelligent Transportation System (ITS), and OCSLBP. Classification is done using hierarchical binary
video interpretation. Intelligent transportation systems can SVM classifier. Fig. 1 shows the proposed methodology.
be used for monitoring traffic conditions, reducing
congestion, enhancing mobility, optimizing the use of INPUT VIDEO
transport infrastructure and improving security. The
detailed classification of moving objects can draw traffic
forecasts and advise the best departure times in different MOVING OBJECT
regions, and alternative routes for major roads. Object SEGMENTATION
classification in surveillance scenes is challenging for the
following aspects. Primarily, the region of interest in
surveillance scenes is of reduced size and low resolution SHAPE FEATURE EXTRACTION
due to the capacity of surveillance cameras. The intra class USING HAAR DWT
variance for each class is very large than inter class
variation. Object appearance is highly varied for different
view angles and illuminations. Therefore, real time TEXTURE FEATURE
performance is limited by such complexities. EXTRACTION USING OCSLBP
CLASSIFICATION USING
BINARY SVM CLASSIFIER
Fig. 1 Proposed methodology
1305
IEEE Sponsored 2nd International Conference on Innovations in Information, Embedded and Communication systems (ICIIECS)‟15
1 0 ≤ t < 1
φ(t) = (5)
0, 𝑖𝑓 𝐼𝑛 𝑥, 𝑦 − 𝐵𝑛 𝑥, 𝑦 < 𝑇ℎ 0 otherwise
𝐷𝑛 𝑥, 𝑦 = (1)
1, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
The above scaling and wavelet basis functions decomposes an
image into four square sub-bands, which includes the average
where Th is a predefined threshold and chosen as the sub-band(LL1), horizontal sub-band (LH1), vertical sub-band
average of the difference image Dn. After background (HL1) and diagonal sub-band (HH1). The average sub-band
subtraction, simple morphological operations are performed LL1 is further decomposed into the sub bands LL2, LH2,
for noise removal. HL2, and HH2 as shown in Figure 3. The sub bands provide
information regarding the vertical, horizontal and diagonal
B. Shadow Removal edges and are thus used to extract shape features. The sub
A cast shadow is a semi-transparent mask that has the same bands in level two are used for obtaining a better low
texture of part of background and it changes the color dimensional descriptor [9]. The LSF(local shape feature)
appearance. So, the three RGB colour components, vector is given by Eq. 6
decreases and becomes less dispersed on a shadowed
𝑖
region. This variation depends on the time of day and the 𝐹𝑠ℎ𝑎𝑝𝑒 𝑥, 𝑦 = 𝐿𝐻2𝑖 𝑥, 𝑦 + 𝐻𝐿𝑖2 𝑥, 𝑦 + 𝐻𝐻2𝑖 𝑥, 𝑦
scene geometry. By making these two distributions 1≤ x≤4,1≤y≤4 (6)
(shadow and light regions) similar, shadow removal is
done. A contrast ratio measure is used to make the Where (x,y) denotes pixel position. For each block the feature
distribution of shadow region similar to the background in size is 16.
terms of the dispersion and darkness [11]. Contrast is a
distinctive measure of the luminance distribution of an In order to make use of texture features, Oriented centre
image. The contrast is formulated as symmetric Local Binary Pattern is chosen as the descriptor
1 𝑁−1 𝑀−1 [6]. Local Binary Pattern is a robust texture descriptor, but it
𝐶= 𝑖=0 𝑗 =0 (𝐼 𝑖, 𝑗 − 𝐼𝑚 )2 (2) produces long histograms. In order to reduce the computations
𝑁𝑋𝑀
CSLBP (Centre symmetric local binary pattern) was
where N and M are the dimensions of image, is I(i,j) the developed. CSLBP reduced the computations by half that of
intensity value at the pixel (i,j) and Im is the intensity mean of LBP, but orientation of pixel is lost. Therefore, OCSLBP
image. A candidate shadow region is determined and the scheme is chosen as texture descriptor [7]. To extract an
contrast of the region without shadow Ck , the contrast of the oriented histogram of OCS-LBP from a sub block, the
region with shadow C sh is found from the above equation. gradient orientations are calculated at every pixel and
The contrast ratio C r k is then found for each color component histogram of each kth orientation in the neighborhood is
(R, G, B) as binned using Eq. 7 and Eq. 8
𝐶
𝐶𝑟 𝑘 = 𝑘 𝐶 (3)
𝑠ℎ 𝑠 𝑥 = 𝑥 𝑖𝑓 𝑥 >𝑇 (7)
By multiplying this ratio to the extracted foreground in the
respective color components, the shadow regions become 𝑘
𝑂𝐶𝑆 𝐿𝐵𝑃𝑅,𝑁 𝑥, 𝑦 = 𝑠(𝑛𝑖 − 𝑛𝑖+𝑁 ) (8)
nearly similar to the background. Foreground segmentation is 2
again performed and thus shadow is removed. Figure 2 shows 0 < i < (N/2)-1
the effect of shadow removal. K= 0, 1, 2, 3, 4, 5, 6, 7
1306
IEEE Sponsored 2nd International Conference on Innovations in Information, Embedded and Communication systems (ICIIECS)‟15
pixels is greater than a given threshold, the gradient While training, for the first linear SVM, the desired output is
orientation is confirmed. To reduce the computation time, “1”for the N training samples of the pedestrian class, and “−1”
integral histogram is used to preserve the local features [10]. for the remaining samples of other two classes. In a similar
way, other two SVMs are trained.
In the testing phase, the first linear SVM tests whether
detected object is pedestrian. If the output is -1, then the
second SVM in level two checks for two wheeler. If it is not
classified as two wheeler, then the third SVM classifies as car
or other vehicle.
Fig. 3 Sub bands of Haar DWT The performance of the proposed algorithm is tested with
the benchmark dataset collected on national highway,
The integral histogram at a given point is obtained by copying consisting of 1000 frames. The proposed approach was
the previous values and then increasing the corresponding bin implemented using MATLAB software.
with respect to the current value of the point, instead of The moving objects are segmented using background
generating all of the histograms for every given pixel. Thus subtraction. For the resulting binary image bounding box is
OCS LBP descriptor with dimension 8 is obtained. drawn and the rectangular coordinates for the moving objects
The new feature that combines the Local shape features are obtained. Shadow removal is done by multiplying the
(extracted from haar wavelet transform) and the texture foreground with the contrast ratio. Although this step
features (from OCSLBP) by concatenation is proposed to illuminates the objects of interest, this does not affect the
classify the moving object. As a result, the total number of features.
features in each block is 16 (from the LSF) + 8 (from the OCS After foreground segmentation and shadow elimination,
LBP) = 24. These features are fed to the SVM classifier for feature extraction is done, taking in to account the shape and
classification. texture of an object. In order to extract the shape features, haar
wavelets in sub band two are utilized since edge information
D. Classification can be obtained with reduced dimensions. Texture features are
Using the extracted features, classification of moving obtained using OCS LBP descriptor, from which gradient
objects in to pedestrians, two wheelers and cars is done by a magnitude and orientation can be inferred. A hierarchical SVM
linear hierarchical SVM (Support Vector Machine) classifier. classifier is trained using samples. Linear SVMs are used to
The proposed classification scheme is shown in Figure 5. reduce the computational load. The detected objects are finally
Three linear binary SVMs are used. The output of each linear classified as shown in the Fig. 7
SVM is is given as Performance is analyzed using overall accuracy rate in Eq. 10.
𝑖
𝑦𝑖 𝑥 = 𝑁 𝑘=1 𝑤𝑘 𝑥𝑘 + 𝑏𝑘 (9)
where xk= [x1, ..., xN] is the feature vector, N is the feature 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑙𝑦 𝑐𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑒𝑑 𝑠𝑎𝑚𝑝𝑙𝑒𝑠
Accuracy rate =
dimension and wki and bk are the coefficients trained through 𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑒𝑠𝑡 𝑠𝑎𝑚𝑝𝑙𝑒𝑠
the SVM. (10)
In training phase, N samples for each of the class, like
pedestrians, cars and two wheelers are taken which totals to
3N training samples.
1307
IEEE Sponsored 2nd International Conference on Innovations in Information, Embedded and Communication systems (ICIIECS)‟15
PEDESTRIAN
OBJECT FEATURE TWO
SVM 1
IMAGE EXTRACTION WHEELER
NON
SVM2 CAR
PEDESTRIAN
NOT A TWO
SVM3
WHEELER
OTHER
VEHICLE
IV. CONCLUSION
In this paper, a combination of the new feature descriptor
(LSF+OCS LBP) with the hierarchical linear SVM classifier is
proposed for the classification of moving object. The proposed
feature descriptor provides better classification with reduced
dimensionality. Additionally, the shadow removal technique
and hierarchical SVM classifier configuration have reduced the
number of false classifications. Further improvement on this
work can be done on view independent vehicle detection in
occluded areas.
600
500
400
300
200
100
0
1308
IEEE Sponsored 2nd International Conference on Innovations in Information, Embedded and Communication systems (ICIIECS)‟15
Wavelet 3
LBP 59
ACKNOWLEDGMENT
This work has been supported by UGC under Major Research
Project Scheme entitled, „Intelligent Video Surveillance
System for Human Action Analysis‟ with reference F.No.41-
592/2012 (SR).
REFERENCES
1309