64 References [1] P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, “Eigenfaces vs. Fisherfaces: recognition using class specific linear projection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 711–720, 1997. [2] C. Campbell and N. Cristianini, “Simple learning algorithms for training support vector machines,” Technical report, University of Bristol ,1998. [3] W. Chu and Z. Ghahramani, “Gaussian processes for ordinal regression,” Tech. report, University College London, 2004. [4] L. Csato, “Gaussian Processes: Iterative Sparse Approximations.” Ph.D. dissertation, University of Aston in Birmingham, 2002. [5] K. Delac, M. Grgic, and S. Grgic, “Independent comparative study of PCA, ICA, and LDA on the FERET data set,” International Journal of Imaging Systems and Technology, vol. 15, no. 5, pp. 252–260, 2005. [6] A. Graf, F. Wichmann, H. Bulthoff, and B. Scholkopf, “Classification of Faces in Man and Machine,” Neural Computation archive Volume 18, Issue 1, pp. 143 – 165 , 2006. [7] M. Grgic and K. Delac, “Face recognition homepage.” [Online]. Available: http://www.face-rec.org. [Accessed May 22, 2007] [8] G. Guo, S. Li, and K. Chan, “Face recognition by support vector machines,” Automatic Face and Gesture Recognition, 2000. Proceedings. Fourth IEEE International Conference on, pp. 196–201, 2000. [9] S. Haykin, Neural Networks: A Comprehensive Introduction. Prentice Hall, 1999. 65 [10] B. Heisele, P. Ho, and T. Poggio, “Face recognition with support vector machines: Global versus component-based approach,” in International Conference on Computer Vision, 2001, pp. II: 688–694. [11] R. Herbrich, Learning kernel classifiers. MIT Press Cambridge, Mass, 2002. [12] K. Jonsson, J. Matas, Y. P. Li, and J. V. Kittler, “Learning support vectors for face verification and recognition,” in International Conference on Automatic Face and Gesture Recognition, 2000, pp. 208–213. [13] H. Kim and Z. Ghahramani, “The em-ep algorithm for gaussian process classification,” In Proc. of the Workshop on Probabilistic Graphical Models for Classification, 2003. [14] M. Kuss, “Gaussian process models for robust regression, classification, and reinforcement learning,” Ph.D. dissertation, Technische Universit¨at Darmstadt, 2006. [15] M. Kuss and C. E. Rasmussen, “Assessing approximate inference for binary gaussian process classification,” Journal of Machine Learning Research, vol. 6, pp. 1679–1704, 2005. [16] N. Lawrence, “Ivm software.” [Online]. Available: http://www.dcs.shef.ac.uk/ neil/ivm/. [Accessed May 21, 2007]. [17] N. D. Lawrence and J. C. Platt, “Learning to learn with the informative vector machine,” in ICML ’04: Proceedings of the twenty-first international conference on Machine learning. New York, NY, USA: ACM Press, 2004, p. 65. [18] N. Lawrence, J. C. Platt and M. I. Jordan. “Extensions of the informative vector machine”. In J. Winkler, N. D. Lawrence and M. Niranjan (eds) Proceedings of the Sheffield Machine Learning Workshop, Springer-Verlag, Berlin. 2005. 66 [19] N. D. Lawrence, M. Seeger, and R. Herbrich, “Fast sparse Gaussian process methods: The informative vector machine,” in Advances in Neural Information Processing Systems 15, 625-632, 2003. [20] N. D. Lawrence, M. Seeger and R. Herbrich, "The informative vector machine: a practical probabilistic alternative to the support vector machine" Technical Report no CS- 04-07, Department of Computer Science, University of Sheffield. 2004. [21] A. Martinez and A. Kak, “PCA versus LDA,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 23, no. 2, pp. 228–233, 2001. [22] T. P. Minka, “A family of algorithms for approximate bayesian inference,” Ph.D. dissertation, Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2001. [23] M. Møller, A scaled conjugate gradient algorithm for fast supervised learning. Aarhus University, Computer Science Department. [24] I. T. Nabney, “Netlab neural network software.” [Online], Available: http://www.ncrg.aston.ac.uk/netlab/. [Accessed May 21, 2007]. [25] J. Platt, “Sequential minimal optimization: A fast algorithm for training support vector machines,” Advances in Kernel Methods - Support Vector Learning, B. Schölkopf, C. Burges, and A. Smola, eds., pp. 185-208, MIT Press, 1999. [26] Y. Qi, “Extending Expectation Propagation for Graphical Models,” Ph.D. dissertation, Massachusetts Institute of Technology, 2004. [27] J. Quiñonero-Candela, C. E. Rasmussen and Z. Ghahramani , “Open Problems in Gaussian Processes for Machine Learning” The NIPS*05 GP Workshop, [Online] Available :http://gp.kyb.tuebingen.mpg.de/. [Accessed May 21, 2007]. 67 [28] C. E. Rasmussen and C. Williams, “Gaussian process regression and classification.” [Online]. Available: http://www.GaussianProcess.org/gpml/code/. [Accessed May 22, 2007]. [29] M. Seeger, “Bayesian gaussian process models: PAC-bayesian generalization error bounds and sparse approximations,” Ph.D. dissertation, University of Edinburgh, July 2003. [30] M. Seeger, N. D. Lawrence, and R. Herbrich, “Efficient nonparametric bayesian modelling with sparse Gaussian process approximations,” 2006. [Online]. Available: http://www.kyb.tuebingen.mpg.de/bs/people/seeger/. [Accessed May 21, 2007]. [31] H. Seung, M. Opper, and H. Sompolinsky, “Query by committee,” Proceedings of the fifth annual workshop on Computational learning theory,pp. 287–294, 1992. [32] M. Stankovic, V. Moustakis, and S. Stankovic, “Text categorization using informative vector machines,” in The International Conference on Computer as a Tool, EUROCON 2005, vol. 6, 2005. [33] H. Tang, M. Lyu, and I. King, “Face recognition committee machine,” Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP’03). 2003 IEEE International Conference on, vol. 2,2003. [34] M. Tipping, “The relevance vector machine,” in Advances in Neural Information Processing Systems, San Mateo, CA. Morgan Kaufmann, 2000. [Online]. Available: citeseer.ist.psu.edu/tipping00relevance.html. [35] V. N. Vapnik, The Nature of Statistical Learning Theory, 2nd ed. Springer, 1999. 68 [36] V. Vapnik, Statistical learning theory, Wiley, 1998. [37] M. Yang, “Face recognition using kernel methods,” Advances in Neural Information Processing Systems, vol. 14, pp. 215–220, 2002. [38] W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld, “Face recognition: A literature survey,” ACM Comput. Surv., vol. 35, no. 4, pp. 399–458, 2003.