Sei sulla pagina 1di 4

New features Log in / create account

Article Discussion Read Edit Search

Template matching
From Wikipedia, the free encyclopedia

Template matching[1] is a technique in digital image processing for finding small parts of an image which match a template
Main page image. It can be used in manufacturing as a part of quality control,[2] a way to navigate a mobile robot,[3] or as a way to detect
Contents edges in images.[4]
Featured content
Current events Contents [hide]
Random article 1 Approach
Donate 2 Feature-based approach
3 Template-based approach
Interaction
4 Motion tracking and occlusion handling
About Wikipedia
5 Template-based matching and convolution
Community portal
6 Example
Recent changes
7 Implementation
Contact Wikipedia
8 Speeding up the Process
Help
9 Improving the accuracy of the matching
Toolbox 10 Similar Methods
11 Examples of Use
Print/export
12 References
Languages 13 See also
日本語 14 External links

Approach [edit]

Template matching can be subdivided between two approaches: feature-based and template-based matching. The feature-based
approach uses the features of the search and template image, such as edges or corners, as the primary match-measuring
metrics to find the best matching location of the template in the source image. The template-based, or global, approach, uses
the entire template, with generally a sum-comparing metric (using SAD, SSD, cross-correlation, etc.) that determines the best
location by testing all or a sample of the viable test locations within the search image that the template image may match up to.

Feature-based approach [edit]

If the template image has strong features, a feature-based approach may be considered; the approach may prove further useful if
the match in the search image might be transformed in some fashion. Since this approach does not consider the entirety of the
template image, it can be more computationally efficient when working with source images of larger resolution, as the alternative
approach, template-based, may require searching potentially large amounts of points in order to determine the best matching
location[5].

Template-based approach [edit]

For templates without strong features, or for when the bulk of the template image constitutes the matching image, a template-
based approach may be effective. As aforementioned, since template-based template matching may potentially require sampling
of a large number of points, it is possible to reduce the number of sampling points by reducing the resolution of the search and
template images by the same factor and performing the operation on the resultant downsized images (multiresolution, or
pyramid, image processing), providing a search window of data points within the search image so that the template does not
have to search every viable data point, or a combination of both.

Motion tracking and occlusion handling [edit]

In instances where the template may not provide a direct match, it may be useful to implement the use of eigenspaces –
templates that detail the matching object under a number of different conditions, such as varying perspectives, illuminations,
color contrasts, or acceptable matching object “poses”. For example, if the user was looking for a face, the eigenspaces may
consist of images (templates) of faces in different positions to the camera, in different lighting conditions, or with different
expressions.
It is also possible for the matching image to be obscured, or occluded by an object; in these cases, it unreasonable to provide a
multitude of templates to cover each possible occlusion. For example, the search image may be a playing card, and in some of
the search images, the card is obscured by the fingers of someone holding the card, or by another card on top of it, or any
object in front of the camera for that matter. In cases where the object is malleable or poseable, motion also becomes a problem,
and problems involving both motion and occlusion become ambiguous[6]. In these cases, one possible solution is to divide the
template image into multiple sub-images and perform matching on each subdivision.

Template-based matching and convolution [edit]

A basic method of template matching uses a convolution mask (template), tailored to a specific feature of the search image,
which we want to detect. This technique can be easily performed on grey images or edge images. The convolution output will be
highest at places where the image structure matches the mask structure, where large image values get multiplied by large mask

converted by Web2PDFConvert.com
values.
This method is normally implemented by first picking out a part of the search image to use as a template: We will call the
search image S(x, y), where (x, y) represent the coordinates of each pixel in the search image. We will call the template T(x t, y
t), where (xt, yt) represent the coordinates of each pixel in the template. We then simply move the center (or the origin) of the
template T(x t, y t) over each (x, y) point in the search image and calculate the sum of products between the coefficients in S(x,
y) and T(xt, yt) over the whole area spanned by the template. As all possible positions of the template with respect to the search
image are considered, the position with the highest score is the best position. This method is sometimes referred to as 'Linear
Spatial Filtering' and the template is called a filter mask.
For example, one way to handle translation problems on images, using template matching is to compare the intensities of the
pixels, using the SAD (Sum of absolute differences) measure.
A pixel in the search image with coordinates (xs, ys) has intensity Is(xs, ys) and a pixel in the template with coordinates (xt, yt)
has intensity It(xt , yt ). Thus the absolute difference in the pixel intensities is defined as Diff(xs, ys, x t, y t) = | Is(xs, ys) – It(x
t, y t) |.

The mathematical representation of the idea about looping through the pixels in the search image as we translate the origin of
the template at every pixel and take the SAD measure is the following:

Srows and Scols denote the rows and the columns of the search image and Trows and Tcols denote the rows and the columns
of the template image, respectively. In this method the lowest SAD score gives the estimate for the best position of template
within the search image. The method is simple to implement and understand, but it is one of the slowest methods.

Example [edit]

+ =

Implementation [edit]

In this simple implementation, it is assumed that the above described method is applied on grey images: This is why Grey is
used as pixel intensity. The final position in this implementation gives the top left location for where the template image best
matches the search image.

minSAD = VALUE_MAX;
// loop through the search image
for ( int x = 0; x <= S_rows - T_rows; x++ ) {
for ( int y = 0; y <= S_cols - T_cols; y++ ) {
SAD = 0.0;
// loop through the template image
for ( int i = 0; i < T_rows; i++ )
for ( int j = 0; j < T_cols; j++ ) {
pixel p_SearchIMG = S[x+i][y+j];
pixel p_TemplateIMG = T[i][j];
SAD += abs( p_SearchIMG.Grey - p_TemplateIMG.Grey );
}
}
// save the best found position
if ( minSAD > SAD ) {
minSAD = SAD;
// give me VALUE_MAX
position.bestRow = x;
position.bestCol = y;
position.bestSAD = SAD;
}

converted by Web2PDFConvert.com
}

One way to perform template matching on color images is to decompose the pixels into their color components and measure the
quality of match between the color template and search image using the sum of the SAD computed for each color separately.

Speeding up the Process [edit]

In the past, this type of spatial filtering was normally only used in dedicated hardware solutions because of the computational
complexity of the operation[7], however we can lessen this complexity by filtering it in the frequency domain of the image,
referred to as 'frequency domain filtering,' this is done through the use of the convolution theorem.
Another way of speeding up the matching process is through the use of an image pyramid. This is a series of images, at different
scales, which are formed by repeatedly filtering and subsampling the original image in order to generate a sequence of reduced
resolution images[8]. These lower resolution images can then be searched for the template (with a similarly reduced resolution),
in order to yield possible start positions for searching at the larger scales. The larger images can then be searched in a small
window around the start position to find the best template location.
Other methods can handle problems such as translation, scale and image rotation.[9] [10]

Improving the accuracy of the matching [edit]

Improvements can be made to the matching method by using more than one template (eigenspaces), these other templates can
have different scales and rotations.
It is also possible to improve the accuracy of the matching method by hybridizing the feature-based and template-based
approaches[11]. Naturally, this requires that the search and template images have features that are apparent enough to support
feature matching.

Similar Methods [edit]

Other methods which are similar include 'Stereo matching', 'Image registration' and 'Scale-invariant feature transform'.

Examples of Use [edit]

Template matching has various different applications and is used in such fields as face recognition (see facial recognition
system) and medical image processing. Systems have been developed and used in the past to count the number of faces that
walk across part of a bridge within a certain amount of time. Other systems include automated calcified nodule detection within
digital chest X-rays.[12]

References [edit]
1. ^ R. Brunelli, Template Matching Techniques in Computer Vision: Theory and Practice, Wiley, ISBN 978-0-470-51706-2, 2009
([1] TM book)
2. ^ Aksoy, M. S., O. Torkul, and I. H. Cedimoglu. "An industrial visual inspection system that uses inductive learning." Journal of
Intelligent Manufacturing 15.4 (August 2004): 569(6). Expanded Academic ASAP. Thomson Gale.
3. ^ Kyriacou, Theocharis, Guido Bugmann, and Stanislao Lauria. "Vision-based urban navigation procedures for verbally instructed
robots." Robotics and Autonomous Systems 51.1 (April 30, 2005): 69-80. Expanded Academic ASAP. Thomson Gale.
4. ^ WANG, CHING YANG, Ph.D. "EDGE DETECTION USING TEMPLATE MATCHING (IMAGE PROCESSING, THRESHOLD LOGIC,
ANALYSIS, FILTERS)". Duke University, 1985, 288 pages; AAT 8523046
5. ^ Li, Yuhai, L. Jian, T. Jinwen, X. Honbo. “Afast rotated template matching based on point feature.” Proceedings of the SPIE 6043
(2005): 453-459. MIPPR 2005: SAR and Multispectral Image Processing.
6. ^ F. Jurie and M. Dhome. Real time robust template matching. In British Machine Vision Conference, pages 123–131, 2002.
7. ^ Gonzalez, R, Woods, R, Eddins, S "Digital Image Processing using Matlab" Prentice Hall, 2004
8. ^ E. H. Adelson, C. H. Anderson, J. R. Bergen, P. J. Burt and J. M. Ogden, Pyramid methods in image processing
http://web.mit.edu/persci/people/adelson/pub_pdfs/RCA84.pdf
9. ^ Yuan, Po, M.S.E.E. "Translation, scale, rotation and threshold invariant pattern recognition system". The University of Texas at
Dallas, 1993, 62 pages; AAT EP13780
10. ^ H. Y. Kim and S. A. Araújo, "Grayscale Template-Matching Invariant to Rotation, Scale, Translation, Brightness and Contrast,"
IEEE Pacific-Rim Symposium on Image and Video Technology, Lecture Notes in Computer Science, vol. 4872, pp. 100-113, 2007.
11. ^ C. T. Yuen, M. Rizon, W. S. San, and T. C. Seong. “Facial Features for Template Matching Based Face Recognition.” American J. of
Engineering and Applied Sciences 3 (1): 899-903, 2010.
12. ^ AshleyAberneithy. "Automatic Detection of Calcified Nodules of Patients with Tuberculous". University College London, 2007

See also [edit]

Facial recognition system


Pattern recognition
Computer vision
Elastic Matching

External links [edit]

Template Matching
Visual Object Recognition using Template Matching
Rotation, scale, translation-invariant template matching demonstration program
perspective-invariant template matching

Categories: Image processing

converted by Web2PDFConvert.com
This page was last modified on 3 August 2010 at 14:23.
Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. See Terms of Use for details.
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.
Contact us

Privacy policy About Wikipedia Disclaimers

converted by Web2PDFConvert.com

Potrebbero piacerti anche