Share on Facebook Share on Twitter Email
Answers.com

Pattern recognition

 
Sci-Tech Dictionary: pattern recognition
(′pad·ərn ′rek·ig′nish·ən)

(computer science) The automatic identification of figures, characters, shapes, forms, and patterns without active human participation in the decision process.


Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
Britannica Concise Encyclopedia: pattern recognition
Top

In computer science, the imposition of identity on input data, such as speech, images, or a stream of text, by the recognition and delineation of patterns it contains and their relationships. Stages in pattern recognition may involve measurement of the object to identify distinguishing attributes, extraction of features for the defining attributes, and comparison with known patterns to determine a match or mismatch. Pattern recognition has extensive application in astronomy, medicine, robotics, and remote sensing by satellites. See also speech recognition.

For more information on pattern recognition, visit Britannica.com.

World of the Mind: pattern recognition
Top
Informally, a pattern is defined by the common denominator among the multiple instances of an entity. For example, commonality in all fingerprint images defines the fingerprint pattern; the commonality in fingerprint images of John Doe's left index finger defines the John-Doe-left-index-fingerprint pattern (see Fig. 1 — showing a bunch of fingerprints of the same finger; and a bunch of impressions of arbitrary fingers in Fig. 2). Thus, a pattern could be a fingerprint image, a handwritten cursive word, a human face, a speech signal, a bar code, or a web page on the internet (see Fig. 3). Often, individual patterns may be grouped into a category based on their common properties; the resultant group is also a pattern and is often called a pattern class. Pattern recognition is the science of observing (sensing) the environment, learning to distinguish patterns of interest (e.g. animals) from their background (e.g. sky, trees, ground), and making sound decisions about the patterns (e.g. Fido) or pattern classes (e.g. a dog, a mammal, an animal).

1. Introduction
2. Pattern recognition and classification
3. Systems for automatic pattern recognition
4. Some challenges in pattern and class learning
5. Pattern recognition applications




Fig. 1. Examples of patterns: six fingerprints from the same finger of the same person.



Fig. 2. Examples of patterns: Fingerprints of different persons.



Fig. 3. Examples of patterns: sound wave, fingerprint, trees, face, bar code, and character images.

1. Introduction

Since our early childhood, we have been observing patterns in the objects around us (e.g. toys, flowers, pets, and faces). Learning patterns also reinforces, and is reinforced by, the acquisition of language. By the time children are 5 years old, most can recognize digits and letters. Small and large characters, handwritten and machine-printed characters, characters of different colours and orientations, and partially occluded letters — all are easily recognized by the young. We take this ability for granted until we face the task of teaching a machine how to recognize the characters. In spite of almost 50 years of research, design of general-purpose machines for pattern recognition remains an elusive goal.

Humans are the best pattern recognizers in most scenarios, yet we do not fully understand how we recognize patterns. Ross (1998) emphasizes the work of Nobel laureate Herbert Simon whose central finding is that pattern recognition is critical in most human decision-making tasks: 'The more relevant patterns at your disposal, the better your decisions will be. This is hopeful news to proponents of artificial intelligence, since computers can surely be taught to recognize patterns. Indeed, successful computer programs that help banks score credit applicants, help doctors diagnose diseases and help pilots land airplanes depend in some way on pattern recognition.'

We will first describe the area of pattern recognition in detail and relate it to the more restricted problem of pattern classification. This is followed by systems for automatic pattern recognition. In particular, we describe some methods for generalization, i.e. how the derived decision rules can be applied to new observations. Next, some aspects of pattern learning are discussed that may play a role in human learning as well. Finally, some applications are described that are already in use in various sectors of our society.

2. Pattern recognition and classification

Pattern recognition aims to make the process of learning and detection of patterns explicit, such that it can be partially or entirely implemented on computers. Automatic (machine) recognition, description, and classification (grouping of patterns into pattern classes) have become important problems in a variety of engineering and scientific disciplines such as biology, psychology, medicine, marketing, computer vision, artificial intelligence, and remote sensing. In almost any area of science in which observations are studied but the underlying mathematical or statistical models are not available, pattern recognition can be used to support human concept acquisition or decision making. Given a group of objects, there are two ways to build a classification or recognition system (Watanabe 1985): supervised, i.e. with a teacher, or unsupervised, without the help of a teacher (see Fig. 4).

Interest in pattern recognition has been renewed recently due to emerging applications which are not only challenging but also computationally more demanding, such as data mining, document classification, organization and retrieval of multimedia databases, and biometric authentication (i.e. face recognition and fingerprint matching).



Fig. 4. a. Supervised pattern recognition deals with classifying objects with (known) different labels. b. In unsupervised pattern recognition, classes or subclasses have to be derived from the data.

3. Systems for automatic pattern recognition

Rapid advances in computing technology not only enable us to process huge amounts of data, but also facilitate the use of elaborate and diverse methods for data analysis and classification. At the same time, demands on automatic pattern recognition systems are rising enormously due to the availability of large databases and stringent performance requirements (faster recognition speed and higher accuracy at a lower cost). In many emerging applications, it is clear that no single approach for classification is 'optimal' and multiple methods and approaches have to be used. Consequently, combining several sensing modalities and classifiers is now a common practice in pattern recognition.

The design of a pattern recognition system essentially involves the following four aspects: (i) data acquisition and pre-processing, e.g. taking a picture of an object and removing the irrelevant background; (ii) data representation, e.g. deriving relevant object properties (like its size, shape, and colour) which efficiently offer pertinent information needed for pattern recognition; (iii) training, e.g. imparting pattern class definition into the system, often by showing a few typical examples of the pattern; and (iv) decision making that involves finding the pattern class or pattern description of new, unseen objects based on a training set of examples. The application domain dictates the choice of sensor(s), pre-processing technique, representation scheme, and decision-making model. It is generally agreed that a well-defined and sufficiently constrained classification problem will lead to a compact pattern representation and a simple decision-making strategy. Learning from a set of examples (training set) is an important and desired characteristic of most pattern recognition systems, in contrast with systems consisting of handcrafted decision rules only.

The five major approaches for pattern recognition are summarized below (Jain, Duin, and Mao 2000).Template matching. Objects are directly compared with a few stored examples or prototypes that are representative of the underlying classes. Because of the large variations often encountered in these examples, template matching is not the most effective approach to pattern recognition.Geometrical classification. Classes are represented by regions in the representation space (e.g. a feature space as in Fig. 5) defined by simple functions such that the training examples are classified as correctly as possible. Suppose the average value of (height, weight) of women is (1.6 m, 57 kg) (5′5″, 125 lb) and that of men is (1.7 m, 71 kg) (5′11″, 157 lb). A simple geometric woman vs. man classifier using (height, weight) as a two-dimensional representation may simplistically divide the representation space into two triangular regions (similar to Fig. 4a). So, a person with (height, weight) = (1.5 m, 55 kg) will be classified by this classifier as a woman.Statistical classification. Continuing with the foregoing example, a statistical classifier may estimate the statistical distribution of the two features, namely height and weight of the two classes of interest (women and men), from known samples. At any coordinate or point in the representation space, one could estimate the likelihood of it being a man or a woman; depending upon which likelihood is higher, one could determine the class of an entity. This method differs from the geometrical method in that the classes are not (pre)defined in terms of any regular shapes in the representation space.Syntactic or structural matching. The height and weight representation space is too simplistic and it is conceivable that a person's body shape is a better representation for determining his or her gender. One could decompose the shape of a person into component parts and describe the shape in terms of component parts and their relationships (e.g. how they are attached to each other). Now, the determination of gender could be performed based on either the shapes of the individual body parts or their relationships, or both. In a syntactic or structural approach, a complex pattern (e.g. animal) is described in terms of component patterns (e.g. hair and head, or torso and limbs) and their relationship (e.g. articulated joints). Strategies for learning such a language (defining the structure) from examples are problematic, as it is essentially difficult to compensate for noise (see Fu 1983, also Perlovsky 1998).Artificial neural networks. These networks attempt to apply the models of biological neural systems to solve practical pattern recognition problems. This approach has become so popular that the use of neural networks for solving pattern recognition problems has become an area on its own, and is often studied outside the biological context, e.g. see the books by Bishop (1995) and Ripley (1996).

It is interesting to compare these approaches for automatic pattern recognition with the various ways the human learning process may be modelled: simulation of the neural system itself, or simulation of the processes in that system, based either on direct information from the senses (as in the statistical and the geometrical approaches) or on higher-level symbolic information (as in the structural approach). The template-matching procedure can be compared with learning by storing all facts without understanding them.



Fig. 5. Example of four objects represented by two features (area and perimeter) in a two-dimensional feature space.

4. Some challenges in pattern and class learning

Selection of training sets. If we want to learn from examples, care should be paid to the way the examples are selected. For instance, a system for the recognition of electrocardiograms (say, into normal heart vs. diseased heart) can be based on examples collected in hospitals, on examples collected in a general screening test, on typical cardiograms that are clear examples of particular classes of heart problems, or on selected cardiograms that are the border cases between these classes. The choice of such a strategy is strongly related to the learning approach to be used and to the way the recognition system can be used.Representation of objects. There are various ways to represent objects: raw data measurements (e.g. overall height, overall weight), derived measurements or features (e.g. ratio of height to weight), a structural description (e.g. height to weight ratios of parts of bodies and spatial relationship of the body parts), etc. In the statistical approach, the feature representation is the most common. For the recognition of simple real-world objects, the features can be their sizes, shapes, colours, etc. More features do not necessarily imply a better classification performance. Given a representation scheme, an objective measure (e.g. 'distance' or 'score') needs to be defined to quantify the (dis)similarity between any two representations.Inter-and intra-class distances. A direct and intuitive way to see whether a feature representation is good for a classification problem is to compare the inter-class distances (e.g. between the two sets of pictures of two different persons) with the intra-class distances (e.g. between all pictures of a single person). If the inter-class distances are much larger than the intra-class distances, the classification problem is easy. If they are of similar orders, either the classes overlap, or a more advanced procedure is needed to separate the classes. Obviously, a representation with large inter-class variability and small intra-class variability is desirable. See Fig. 7 for an illustration on inter-and intra-class distances.Invariance of representation. Some object variations may not be important for the classification task, e.g. the size of a character, the angle (pose) at which a face is observed, the speed by which a word is spoken. These variations may influence the representation so that the position of the object in feature space is changed. An important problem is how to identify and extract these so-called invariants. We can collect objects under all possible variations, which is expensive. A preferred approach is to use invariant features.The problem of overtraining. An overly complex pattern recognition system may learn unnecessary details of training samples of a pattern and, consequently, will be unable to recognize the essential commonality defining the pattern. It is necessary to adapt the complexity of the recognition system to the complexity and size of the data set under consideration.



Fig. 6. Design of a pattern recognition system.



Fig. 7. Different faces of the same person, or different persons? a. Faces belonging to two different people. b. Multiple faces of the same person.

5. Pattern recognition applications

Pattern recognition is used in any area of science and engineering that studies the structure of observations. It is now frequently used in many applications in manufacturing industry, healthcare, and the military. Examples include the following.

Optical character recognition (OCR) is becoming an integral part of document scanners, and is also used frequently in banking and postal applications. Printed characters can now be accurately recognized, and the improving performance of automatic recognition of handwritten cursive characters has diminished significantly the need of human interaction for OCR tasks.

Automatic speech recognition is very important for user interaction with machines. Commercial systems for automatic response to flight queries, telephone directory assistance, and telebanking are available. Often the systems are tuned to a specific speaker for better recognition accuracy.

Computer vision deals with the recognition of objects as well as the identification and localization of their three-dimensional environments. This capability is required, for example, by robots to operate in dynamic or unknown environments. This can be useful for applications ranging from manufacturing to household cleaning, and even for rescue missions.

Personal identification systems that use biometrics are very important for security applications in airports, ATMs, shops, hotels, and secure computer access. Recognition can be based on face, fingerprint, iris, or voice, and can be combined with the automatic verification of signatures and PIN codes.

Recognition of objects on earth from the sky (by satellites) or from the air (by aeroplanes and cruise missiles) is called remote sensing. It is important for cartography, agricultural inspection, detection of minerals and pollution, and target recognition.

Many tests for medical diagnosis utilize pattern recognition systems, from counting blood cells and recognition of cell tissues through microscopes to the detection of tumours in magnetic resonance scans and the inspection of bones and joints in X-ray images.

Many large databases are stored on the repositories accessible via the internet or otherwise in local computers. They may have a clear structure such as bank accounts, a weak structure such as consumer behaviour, or no obvious structure such as a collection of images. Procedures for finding desired items (database retrieval) as well as learning or discovering structures in databases (data mining) are becoming more and more important. Web search engines and recommender systems are two example applications.

(Published 2004)

— Anil K. Jain/Robert P. W. Duin

    Bibliography
  • Bishop, C. M. (1995). Neural Networks for Pattern Recognition.
  • Duda, R. O., Hart, P. E., and Stork, D. G. (2001). Pattern Classification and Scene Analysis (2nd edn.).
  • Fu, K. S. (1983). 'A step towards unification of syntactic and statistical pattern recognition'. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5/2.
  • Jain, A. K., Duin, R. P. W., and Mao, J. (2000). 'Statistical pattern recognition: a review'. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22/1.
  • Perlovsky, L. I. (1998). 'Conundrum of combinatorial complexity'. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20/6.
  • Picard, R. (1997). Affective Computing.
  • Ripley, B. (1996). Pattern Recognition and Neural Networks.
  • Ross, P. E. (1998). Flash of Genius.
  • Watanabe, S. (1985). Pattern Recognition: Human and Mechanical.


Wikipedia: Pattern recognition
Top

Pattern recognition is "the act of taking in raw data and taking an action based on the category of the pattern"[1]. Most research in pattern recognition is about methods for supervised learning and unsupervised learning.

Pattern recognition aims to classify data (patterns) based either on a priori knowledge or on statistical information extracted from the patterns. The patterns to be classified are usually groups of measurements or observations, defining points in an appropriate multidimensional space. This is in contrast to pattern matching, where the pattern is rigidly specified.

Contents

Overview

A complete pattern recognition system consists of a sensor that gathers the observations to be classified or described, a feature extraction mechanism that computes numeric or symbolic information from the observations, and a classification or description scheme that does the actual job of classifying or describing observations, relying on the extracted features.

The classification or description scheme is usually based on the availability of a set of patterns that have already been classified or described. This set of patterns is termed the training set, and the resulting learning strategy is characterized as supervised learning. Learning can also be unsupervised, in the sense that the system is not given an a priori labeling of patterns, instead it itself establishes the classes based on the statistical regularities of the patterns.

The classification or description scheme usually uses one of the following approaches: statistical (or decision theoretic) or syntactic (or structural). Statistical pattern recognition is based on statistical characterizations of patterns, assuming that the patterns are generated by a probabilistic system. Syntactical (or structural) pattern recognition is based on the structural interrelationships of features. A wide range of algorithms can be applied for pattern recognition, from simple naive Bayes classifiers to powerful neural networks.

Pattern recognition is more complex when templates are used to generate variants. For example, in English, sentences often follow the "N-VP" (noun - verb phrase) pattern, but some knowledge of the English language is required to detect the pattern. Pattern recognition is studied in many fields, including psychology, ethology, cognitive science and computer science.

Holographic associative memory is another type of pattern matching scheme where a target small patterns can be searched from a large set of learned patterns based on cognitive meta-weight.

Uses

The face was automatically detected by special software.

Within medical science, pattern recognition is the basis for computer-aided diagnosis (CAD) systems. CAD describes a procedure that supports the doctor's interpretations and findings.

Typical applications are automatic speech recognition, classification of text into several categories (e.g. spam/non-spam email messages), the automatic recognition of handwritten postal codes on postal envelopes, or the automatic recognition of images of human faces. The last two examples form the subtopic image analysis of pattern recognition that deals with digital images as input to pattern recognition systems.[2][3]

See also

Further reading

  • Keinosuke Fukunaga, (1990) Introduction to Statistical Pattern Recognition, (2nd edition), Academic Press, ISBN 0-12-269851-7.
  • Christopher M. Bishop, (2006) Pattern Recognition and Machine Learning, Springer, ISBN 0-387-31073-8.
  • Sergios Theodoridis, Konstantinos Koutroumbas, (2009) Pattern Recognition (4th edition), Elsevier, ISBN 978-1-59749-272-0.
  • Phiroz Bhagat, (2005) Pattern Recognition in Industry Elsevier, ISBN 0-08-044538-1.
  • Dietrich Paulus and Joachim Hornegger (1998) Applied Pattern Recognition (2nd edition), Vieweg. ISBN 3-528-15558-2
  • Juergen Schuermann: Pattern Classification: A Unified View of Statistical and Neural Approaches, Wiley&Sons, 1996, ISBN 0-471-13534-8
  • Sholom Weiss and Casimir Kulikowski (1991) Computer Systems That Learn, Morgan Kaufmann. ISBN 1-55860-065-5

References

  1. ^ Richard O. Duda, Peter E. Hart, David G. Stork (2001) Pattern classification (2nd edition), Wiley, New York, ISBN 0-471-05669-3
  2. ^ Richard O. Duda, Peter E. Hart, David G. Stork (2001) Pattern classification (2nd edition), Wiley, New York, ISBN 0-471-05669-3
  3. ^ R. Brunelli, Template Matching Techniques in Computer Vision: Theory and Practice, Wiley, ISBN 978-0-470-51706-2, 2009 ([1] TM book)

External links

This article was originally based on material from the Free On-line Dictionary of Computing, which is licensed under the GFDL.


 
 

 

Copyrights:

Sci-Tech Dictionary. McGraw-Hill Dictionary of Scientific and Technical Terms. Copyright © 2003, 1994, 1989, 1984, 1978, 1976, 1974 by McGraw-Hill Companies, Inc. All rights reserved.  Read more
Britannica Concise Encyclopedia. Britannica Concise Encyclopedia. © 2006 Encyclopædia Britannica, Inc. All rights reserved.  Read more
World of the Mind. The Oxford Companion to the Mind. Second Edition. Copyright © Oxford University Press, 2004. All rights reserved.  Read more
Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "Pattern recognition" Read more