Andy Zeng
Iris Recognition

Iris Recognition (Proposal)

For this project, we will be exploring and implementing various computer vision techniques to obtain reasonable accuracy for iris verification and identification. General outline of proposed approach (by process):
Iris localization – locate the inner and outer boundaries of the iris with edge detection algorithms (i.e. first order gradient components), explore mathematical operators to isolate the iris and pupil respectively (recover circular geometry via homography), achieve reasonable accuracy w.r.t. noise and computation time.
Iris segmentation – transform iris texture from Cartesian to polar reference (rectangular representation) to retrieve detailed frequency information to facilitate feature extraction. Explore other methods of delineating the analysis of circular iris geometry (i.e. octant segmentation).

Feature extraction and matching – for the purpose of testing after training over known dataset, obtain/compare feature vector(s) from one or more of the following methods:
- Textons (edge orientation, pixel intensity, histograms, etc.)
- Decision trees (local feature arrangements, shape)
- Hamming distance (Fourier transform, binary filters, etc.)
- Etc…
Depending on the techniques used, and the performance, we will use various machine learning algorithms to organize data.
MMU Iris Dataset (Alternative download link: MMU1, MMU2)

Abstract

Iris recognition is a form of biometric identification performed with computer vision. The iris is an internal organ whose texture is ran- domly determined during embryonic gestation. It is at and its texture is relatively stable, as the iris is protected from external harm. All these features make it a very viable form of biometric recognition. We use an ellipse-fitting technique for iris segmentation, rectify the irises to rectangular coordinates, and feed the irises into a feature extractor. We use various features, such as HOG, DAISY, and a random forest classifier to obtain a 0% error on the MIU dataset.

Introduction and Related Work

The iris is a structure in the eye that responsible for controlling the size of the pupil. The iris is responsible for "eye color". It is unique to a given person and also does not change. [6] Therefore, it is possible to identify individuals completely based on images of their irises. Iris recognition can be divided into several sections. Given an image of an eye, we first perform iris localization, which can be found in section 4 on the following page. This gives us the coordinates and dimensions of the iris in the image. The next step is iris segmentation, which can be found in section 5 on page 4. This extracts the iris from the image and converts it into a rectangle. This rectangular image is used in the final section, iris recognition, which is found in section 8 on page 5.Iris segmentation in state of the art techniques currently uses Fourier methods to account for eyelid occlusions. They also use statistical methods to account for eyelashes. [2] Features involve using Hamming distance between 2-D Gabor wavelet features to uniquely identify irises. [1] [2] Other methods involve using circular symmetric filters to extract features. [3]

Dataset

We chose to work with the Multimedia University (MMU) iris database, con- tributing a total of 450 images, 5 images per iris, 2 irises per subject. All images were taken using the LG IrisAccess 2200 at a range of 7-25 centime- ters. We chose this particular dataset over the others we found online for the following reasons:
1. It was free.
2. Due to some privacy issues, most iris datasets require lengthy registra- tion processes, official paperwork, and administrative contacts. How- ever, we had no trouble acquiring this dataset within a few days.
3. Most datasets offer 3 or fewer images per iris. This particular dataset provides 5 images per iris, giving our machine learning algorithms some functional ease.
The largest drawback encountered from using this dataset was the low resolution across all iris images. Post iris localization algorithms performed over the MMU dataset return iris radii of approximately 30 pixels, where as state-of-the-art equipment allows the collection of pixel radii from 80 up to 130+ pixels. This undoubtedly had some effect on the results obtained from the particular feature extraction methods used for this project.

Localization

We broke localization into two stages: pupil localization and iris localization. The visual consistency of pupil color and contour allows for reliable meth- ods of computing the center of the iris. Many iris recognition papers explore a variety of pupil localization algorithms; some methods trade off accuracy for speed, and vice versa. Since running time was not a concern for our application, we went about approaching this problem with circular hough transforms as opposed to convolution. Taking advantage of the pupil's low saturation, we can reasonably filter out everything but the darkest portions of the original grayscale image (pupil, eyelashes, eyebrows, moles), generating a binary image based a low pixel intensity thresholding value. The size of the blob representing the pupil post binary conversion is typically much larger than the size of detected noise (eyelashes, eyebrows, moles, etc), which can then be filtered out from the binary image by removing all blobs possessing a pixel area below some reasonable value. Then using circular Hough transform across the binary image with a reasonable pupil radius range estimate, we can obtain the circle that returns the largest magnitude of accumulator array peaks over the isolated binary pupil blob - giving us a very good estimate of the pupil radius and location.

Following a slight gaussian blur, we applied canny edge detection over the original grayscale image to pick up the edges between the iris and the sclera. We then projected rays in an outward fashion from the center of the pupil in the directions 0 degrees to -45 degrees and -135 degrees to -180 degrees (because the edges of the iris are more likely to be located in those regions). The point at which the very first edge that each ray hits beyond the pupil radius was then saved as a point of interest. The points of interests were collected and the best fit ellipse was computed mathematically. The parameters of any simple ellipse (without taking into account orientation) follows from the equation: ((x-x1)^2)/(a^2) + ((y-y1)^2)/(b^2) = 1. Since the center of the ellipse is a static value, we defined our optimization formula as a function that takes in two parameters, a and b values (width and height respectively), and returns the sum squared distances between the points of interest and the points of the projected ellipse computed from the a and b values. This essentially gives us a function that returns a lower positive value depending on how closely a particular ellipse fits the set of points of interest. We can compute the a and b values that return a local minimum for this particular function, giving us the best fit ellipse across the set of points. Because the ratio of the radius of the pupil to the radius of the iris is typically 0.4 to 0.8, we penalize points of interests within this optimization function based on how far they are from this range. Across the entirety of the MMU dataset we worked with, this method of iris localization worked very well.







Segmentation

Taking the computed ellipse of the iris, we transformed the image such that the ellipse becomes a circle (as a quick hack to remove subtle camera dis- tortions), and we transformed the image from a polar coordinate system to a cartesian coordinate system, localizing the iris from the rest of the image. Sorting the pixel intensities within the iris images and removing the extreme pixel intensities also allows for the filtering of additional noise caused by skin and eyelashes. This gives us a clean rectangular image of the iris to work with for the purpose of feature extraction and comparison.




Preliminary Results

Using a basic set of features, we used a 1-nn classifier to run some prelimi- nary tests on the dataset. The results are shown below.
Features vs Error Rate
Raw pixels: 0.767
5x5 Blocks of Average Pixel Intensities: 0.869
HOG: 0.119
No rejection policy was explored for these preliminary testing algorithms.

Hamming Distance

Another approach we used was inspired by John Daugman's in uential iris detection paper, "How Iris Detection works". [1] We created an IrisCode for an iris by calculating the gradient for the iris, and, for each pixel, mapping the direction into two bits representing which quadrant it is in. We generated an IrisCode for each subject using their first image, and this was the subjects unique identifier. To classify whether a given iris matched a known iris, we calculated the Hamming distance, the number of different bits, between their IrisCodes. If the distance was below a threshhold, we said the irises match. To determine an appropriate theshhold, we trained with two iris images per subject. We created a utility function f(i, j) = {tp : 1; fp : 1; fp : -5; fn : -1}. We then found a threshhold that minimized this utility function applied to all pairs of irises. Once we found a threshold, we tested on the remaining two iris images per subject. Using this method, we attained precision = 1.0 and recall = 0.848 for an F1 = 0.918. Importantly, we had no false positives, which would constitute a security breach. However, recall can definitely be improved. Upon inspecting the false negatives, we see most of them had skin and eyelash artifacts from segmentation. We believe that if we remove these sources of noise we should see a large improvement in performance.

Machine Learning-based Recognition

We were able to successfully localize and segment all the images of irises in the MMU dataset. These rectified images were then used as the source of features for a machine learning classifier. We experimented with several sets of features:
1. Raw pixels (brightness values) We took the pixels of the image and concatenated them into one long feature vector of length 7752.
2. Textons [4] We used 25 5x5 texton cluster centers as a basis of generating his- tograms as features.
3. PHOG We computed several histograms of oriented gradients (window sizes 1,2,3,4) and concatenated those into a feature vector 4. DAISY [5] We computed DAISY features over the image and used them as the feature vector.
We split the dataset into 300 training, 150 testing. Here are the results.
Classifier Features Error
SVM Raw pixels: 0.08
SVM Raw pixels + textons: 0.08
SVM PHOG: 0.0267
Random Forest (500 trees, entropy) DAISY: 0.00
The failure cases for the SVM are presented on the right:

Conclusion and Future Work

Our machine learning methods were able to bring down our error on the dataset to 0%. These results are comparable to current SOTA techniques on the dataset and are very encouraging. The variety of features we experi- mented with show that general computer vision techniques are effective in iris recognition, whereas most related work uses new, domain-specific techniques. The dataset we were working with was convenient in that it had images with good lighting, with not too much noise. However, it was also a small dataset. To further benchmark our methods, we could test them on larger, more noisy datasets and see if the computer vision techniques we used will work on tougher data. Other areas to look into would be using more ad- vanced features, such as the Fourier features discussed before and comparing their results to other techniques, and also look into incorporating Hamming distance techinques with the machine learning we performed.

References

[1] J. Daugman. How iris recognition works. Circuits and Systems for Video Technology, IEEE Transactions on, 14(1):21{30, 2004.
[2] J. Daugman. New methods in iris recognition. Systems, Man, and Cy- bernetics, Part B: Cybernetics, IEEE Transactions on, 37(5):1167{1175, 2007.
[3] L. Ma, Y. Wang, and T. Tan. Iris recognition using circular symmet- ric filters. In Pattern Recognition, 2002. Proceedings. 16th International Conference on, volume 2, pages 414{417 vol.2, 2002.
[4] J. Malik, S. Belongie, J. Shi, and T. Leung. Textons, contours and regions: cue integration in image segmentation. In Computer Vision, 1999. The Proceedings of the Seventh IEEE International Conference on, volume 2, pages 918{925 vol.2, 1999.
[5] E. Tola, V. Lepetit, and P. Fua. DAISY: An Effcient Dense Descriptor Applied to Wide Baseline Stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(5):815{830, May 2010.
[6] R. Wildes. Iris recognition: an emerging biometric technology. Proceed- ings of the IEEE, 85(9):1348{1363, 1997.