Complex Feature Space

It can happen that the variability that makes it difficult to distinguish patterns is due to complexity rather than noise. For example, it would make no sense to try to use a nearest-distance clasifier to detect syntactic errors in expressions in a programming language.

Similar problems arise in classifying sensory data. Whenever the patterns can be subject to well-defined transformations such as translation or rotation, there is a danger of introducing a large amount of complexity into a primitive feature space. In general, one should employ features that are invariant to such transformations, rather than forcing the classifier to handle them.

In human communication, larger patterns are frequently composed of smaller paterns, which in turn are composed of smaller patterns. For example, sentences are made up of words which are made up of letters which are made up of strokes. Designing classifiers to cope with this level of complexity is definitely beyond the scope of these notes.

Back to Subclasses Up to Limitations