Потенциальные функции для анализа сигналов и символьных последовательностей разной длины
Диссертация
Научная новизна. В работе предложены новые вероятностные модели случайных преобразований сигналов и символьных последовательностей, в частности, модели эволюционных изменений аминокислотных последовательностей белков. На основе этих моделей впервые построен класс корректных потенциальных функций, выражающих правдоподобие гипотезы о наличии общего прародителя у пары сравниваемых сигналов либо… Читать ещё >
Список литературы
- Plamandon R., Srihari S. N. On-line and off-line handwriting recognition: A comprehensive survey // IEEE Trans. Pattern Recognition and Machine Intelligence, 2000, Vol. 22, pp. 107 131.
- Kholmatov, A., Yanikoglu, В.: Biometric authentication using online signatures // Proc. ISCIS, Springer LNCS-3280, 2004, pp. 373 380.
- Martens R., Claesen L. On-line signature verification by dynamic time-warping // IEEE: ICPR, 1996, pp. 38−42.
- Колядин Д.В. Анализ динамических кривых применительно к задаче верификации рукописной подписи // Сборник трудов 11-й всероссийской конференции Математические методы распознавания образов, 2003, с. 330 332.
- J. Richiardi, Н. Ketabdar, A. Drygajlo. Local and global feature selection for on-line signature verification // Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR'05), Seoul, South Korea, 2005, 625−629.
- Сулимова B.B., Моттль B.B. Идентификация личности по динамике подписи методом опорных векторов// Труды международной научной конференции ММТТ-16, Ростов-на-Дону, 2003 г, т.4, С.23−25.
- Feng Н., Wah С.С. Online signature verification using a new extreme points warping technique // Pattern Recognition Letters, 2003, Vol. 24, pp. 2943−2951.
- Zhang D., Jain A.K. Searching for an Optimal Reference System for On-Line Signature Verification Based on (x, y) Alignment // ICBA 2004, LNCS 3072, pp. 519−525, 2004.
- Ту Дж., Гонсалес P. Принципы распознавания образов. М.: МИР, 1978, 401с.
- Дуда Р., Харт П. Распознавание образов и анализ сцен. М.: Мир, 1976, 511 с. .
- Вапник В.Н., Червоненкис А. Я. Теория распознавания образов. М.: Наука, 1974,415 с.
- Патрик Э. Основы теории распознавания образов. М.: Сов. радио, 1980, 408 с.
- Загоруйко Н.Г. Прикладные методы анализа данных и знаний. Новосибирск: Изд-во Ин-та математики, 1999, 270 с.
- Martens R., Claesen L. Dynamic programming optimisation for on-line signature verification // Proceedings of the Fourth International Conference on Document Analysis and Recognition (ICDAR'97), Ulm, Germany, 1997, Vol. 2, pp. 653−656.
- Chu S., Keogh E., Hart D., Pazzani M. Iterative deeping dynamic time warping for time series. // IEEE International Conference of Data Mining. Maebashi City. Japan, 2002.
- Дмитриев Д.А., Сулимова B.B., Моттль B.B. Измерение попарного несходства подписей для идентификации личности // Сборник трудов международной конференции ММТТ-16, 2003, т.4, с. 216−218.
- Hansheng Lei, Bingyu Sun. A Study on the Dynamic Time Warping in Kernel Machines // Proceedings of the 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System, 2007, pp. 839−845.
- W. A. Chaovalitwongse and P. M. Pardalos. On the time series support vector machine using dynamic time warping kernel for brain activity classification // Springer Science+Business Media, Inc., 2008, pp.125−138.
- D. DeCoste and B. Scholkopf. Training invariant support vector machines //Machine Learning, 46(1/3):161, 2002.
- Bahlmann C., Haasdonk B., and Burkhardt H. On-line Handwriting Recognition with Support Vector Machines— A Kernel Approach // Proc. of the 8th IWFHR, 2002, pp. 49−54.
- B. Haasdonk and D. Keysers. Tangent distance kernels for support vector machines // Proc. of the 16th ICPR, 2002.
- Shimodaira, H., Noma, K., Nakai, M. Dynamic time-alignment kernel in Support Vector Machine. //Advances in Neural Information Processing Systems 14, Vol. 2. MIT Press, Cambridge, MA, 2002, pp. 921−928.
- Pekalska E., Paclic P., Duin R. A Generalized Kernel Approach to Dissimilarity-based Classification //Journal of Machine Learning Research 2,2001, 175−211.
- Hong-Wei Ji and Zhong-Hua Quan. Signature Verification Using Wavelet Transform and Support Vector Machine //Lecture Notes in Computer Science. Volume 3644/2005, 2005, pp. 671−678.
- Lavi Shpigelman, Yoram Singer, Rony Paz, and Eilon Vaadia. Spikernels: Predicting arm movements by embedding population spike rate patterns in inner-product spaces //Neural Computation, vol. 17, no. 3, 2005, pp. 671−690.
- Corinna Cortes, Patrick Haffner, and Mehryar Mohri, Rational kernels: Theory and algorithms // JMLR, vol. 5, 2004, pp. 1035−1062.
- Eichhorn, J. Applications of Kernel Machines to Structured Data: PhD thesis / Eichhorn, J. Berlin, 2007.
- K. R. Sivaramakrishnan, K. Karthik, and C. Bhattachaiyya. Kernels for large margin time-series classification // Neural Networks, IJCNN 2007, International Joint Conference, 2007, pp. 2746−2751.
- Vapnik V. Statistical Learning Theory. Wiley, Chichester, GB, 1998.
- Kholmatov, A., Yankolu, B. Identity Authentication Using Improved On-line Signature Verification Method // Pattern Recognition Letters, vol. 26, no. 15, 2005, pp. 2400−2408.
- Bousquet, O.- Pez-Cruz, F. Kernel methods and their applications to signal processing. // 2003 IEEE International Conference. Acoustics, Speech, and Signal Processing, Proceedings. (ICASSP '03). vol.4, 2003, IV- 860−3.
- N. Smith and M. Gales. Speech recognition using SVMs //Advances in Neural Information Processing Systems. /T. Dietterich, S. Becker, and Z. Ghahramani editors. volume 14. MIT Press, 2002.
- L. Ralaivola, F. d’Alche-Buc. Dynamical Modeling with Kernels for Nonlinear Time Series Prediction, //Advances in Neural Information Processing Systems, /L. Saul eds, MIT Press, 2004.
- Yoon, H.S.- Lee, J.Y.- Yang, H.S. An online signature verification system using hidden Markov model in polar space. Frontiers in Handwriting Recognition 2002 // Proceedings. Eighth International Workshop, 2002, pp. 329 333.
- Haussler, D. Convolution kernels on discrete structures. Technical Report UCSC-CRL-99−10, UC Santa Cruz, 1999.
- Watkins, C. Dynamic alignment kernels //Advances in Large Margin Classifiers /А. J. Smola, P. L. Bartlett, B. Sch’olkopf and D. Schuurmans eds. MIT Press, Cambridge, MA, 2000, pp. 3950.
- Clark A. Learning morphology with pair hidden markov models // Proceedings of the Student Workshop at ACL 2001, Toulouse, July 2001.
- V. Wan and S. Renals, Evaluation of kernel methods for speaker verification and identification, // Proc. ICASSP 2002, Orlando, FL, 2002.
- M. Cuturi, J. P. Vert, O. Birkenes, and T. Matsui. A Kernel for Time Series Based on Global Alignments. Proc. // ICASSP, 2007, pp. 413−416.
- Гельфанд M.C. Апология биоинформатики //Биофизика, 2005, том 50, № 4, с. 752−766.
- Needleman S.B., Wunsch C.D. A general method applicable to the search for similarities in the amino-acid sequence of two proteins //Journal of Molecular Biology: 48, 1970, pp. 443−53.
- Smith T.F., Waterman M.S. Identification of common molecular subsequences // Journal of Molecular Biology: 147, 1981, pp. 195−197.
- Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D. J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs // Nucleic Acids Research, 25: 1997, pp. 3389−3402.
- Pearson W.R. Flexible sequence similarity searching with the FASTA3 program package // Methods Mol. Biol.: 132, 2000, pp. l85−219.
- Sankoff D. and Kruskal J., Eds. Time warps, string edits, and macromolecules: The theory and practice of sequence comparison // CSLI Publications, Stanford, 1999.
- Joachims T. Learning to Align Sequences: A Maximum-Margin Approach, 2003.
- Kernel Methods in Computational Biology / B. Scholkopf, K. Tsuda and J.-P. Vert. eds. MIT Press, 2004.
- T. Hofmann, B. Scholkopf, and A. J. Smola. Kernel methods in machine learning // Ann. Statist. Volume 36, Number 3, 2008, pp.1171−1220.
- A. Ben-Hur, C. S. Ong, S. Sonnenburg, B. Scholkopf and G. Riitsch. Support Vector Machines and Kernels for Computational Biology // PLoS Comput Biol. 2008 October- 4(10).
- Bernhard Sch’olkopf and Alexander J. Smola. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge, MA, 2002.
- Sonnenburg S, Ra" tsch G, Rieck K. Large scale learning with string kernels. /Bottou L3 Chapelle O, DeCoste D, Weston J, eds. Large scale kernel machines MIT Press., 2007, pp. 73 104.
- Zhang, S.-W., Pan, Q., Zhang, H.-C., Zhang, Y.-L., and Wang, H.-Y. Classification of protein quaternary structure with support vector machine //Bioinformatics, 2003, 19(18), pp. 2390— 2396.
- T. Hofmann, B. Scholkopf, and A. J. Smola Kernel methods in machine learning //Ann. Statist. Volume 36, Number 3, 2008, pp. 1171−1220.
- Gert R. G. Lanckrietl, Tijl De Bie3, Nello Cristianini4, Michael I. Jordan2 and William Stafford Noble5. A statistical framework for genomic data fusion //Bioinformatics, Vol. 20 no. 16, 2004, pp. 2626−2635.
- Kuang R., le E., Wang Ke and Kai, Siddiqi M., Freund Y., Leslie C. Profile-Based String Kernels for Remote Homology Detection and Motif Extraction. CSB, 2004, pp. 152−160.
- Leslie, C. S., Eskin, E., Cohen, A., Weston, J., and Noble, W. S. Mismatch, string kernels for discriminative protein classification //Bioinformatics, 2004, 20(4), pp. 467—476.
- Tsuda K, Kawanabe M, Ra" tsch G, Sonnenburg, S Mu" lier K. A new discriminative kernel from probabilistic models //Neural Computation 14, 2002, pp. 2397−2414.
- Wang, M., Yang, J., Liu, G.-P., Xu, Z.-J., and Chou, K.-C. Weighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition //Protein Eng. Des. Sel., 17(6), 2004, pp. 509−516.
- L. Liao and W. S. Noble. Combining pairwise sequence similarity and support vector machines for remote protein homology detection // Proceedings of the Sixth Annual International. Conference on Computational Molecular Biology, pp. 225−232, 2002
- Ben-Hur, A. and Brutlag, D. Remote homology detection: a motif based approach //Bioinformatics, 19(Suppl. 1), 2003, pp. 26−33.
- Kondor, R.I. and Lafferty, J. Diffusion kernels on graphs and other discrete input spaces. /Sammut, C. and Hoffmann, A. eds // Proceedings of the International Conference on Machine Learning. Morgan Kaufmann, San Francisco, CA, 2002, pp. 315−322.
- Vert, J.-P., Saigo, H., and Akutsu, T. Local alignment kernels for biological sequences /Sch'olkopf, B., Tsuda, K., and Vert, J. editors. Kernel Methods in Computational Biology -MIT Press, 2004, pp. 131−154.
- Qiu J., Hue, M., Ben-Hur A., Vert J.-P., Noble W. S. A structural alignment kernel for protein structures //Bioinformatics: 23(9), 2007, pp. 1090−1098.
- Sun L., Ji S., Ye J. Adaptive diffusion kernel learning from biological networks for protein function prediction //BMC Bioinformatics: 9,2008, 162.
- Durbin, R., Eddy, S., Krogh, A., and Mitchison, G. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, 1998.
- Cuturi M., Vert. J.-P. A mutual information kernel for sequences. Neural Networks // Proc. of IEEE Int. Joint Conference, 3,2004, pp. 1905−1910.
- Cuturi, M. and Vert, J.-P. The context-tree kernel for strings // Neural Network, 2005.
- Jaakkola, T. S., Diekhans, M., and Haussler, D. Using the Fisher kernel method to detect remote protein homologies // Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology, 1999, pp. 149−158.
- Vert, J.-P., Thurman, R., and Noble, W. S. Kernels for gene regulatory regions // Adv. Neural. Inform. Process Syst., 2006.
- Tsuda, K., Kin, Т., and Asai K. Marginalized kernels for biological sequences //Bioinformatics, 2002, vol.18, pp. 268−275.
- A. Chervonenkis, A.J. Gammerman, I.A. Shalimuradov and V.V. Solovyev. Sequence Alignment Kernel for recognition of promoter regions //Bioinformatics 19, 2003, pp. 1964−1971.
- Rangwala H., Karypis G. Profile-based direct kernels for remote homology detection and fold recognition//Bioinformatics: 21(23), 2005, pp. 4239−4247.
- Thome J.L., Kishino H., Felsenstein J. An evolutionary model for maximum likelihood alignment of DNA sequences //Journal of Molecular Evolution, 33, 1991, pp. 114−124.
- Miklos I., Lunter G.A., Holmes I. A «long indel» model for evolutionary sequence alignment //Molecular Biology and Evolution: 21(3), 2004, pp. 529−540.
- Miklos I., Novak A., Satija R., Lyngso R., Hein J. Stochastic models of sequence evolution including insertion-deletion events // Statistical methods in medical research: 29, 2008.
- Metzler, D. Statistical alignment based on fragment insertion and deletion models // Bioinformatics- 19, 2003, pp. 490−499.
- Seeger, M. Covariance kernels from bayesian generative models // Adv. Neural Inform. Process. Syst., volume 14,2002, pp. 905−912.
- Duin R.P.W, De Ridder D., Tax D.M.J. Experiments with a featureless approach to pattern recognition//Pattern Recognition Letters, vol. 18, no. 11−13, 1997, pp. 1159−1166.
- Mottl V.V., Sulimova V.V., Tatarchuk A.I. Multy-kernel approach to on-line signature verification // Proceedings of the Eighth IASTED International Conference on Signal and Image Processing, held August 14−16, 2006, Honolulu, Hawaii, USA, pp. 448−453
- Айзерман M.A., Браверман Э. М., Розоноэр Л. И. Метод потенциальных функций в теории обучения машин. М.: Наука, 1970, 384 с.
- Duin R.P.W, De Ridder D., Tax D.M.J. Featureless classification // Proceedings of the Workshop on Statistical Pattern Recognition, Prague, June 1997, pp.37−42.
- Vapnik V. Statistical Learning Theory. John-Wiley & Sons, Inc. 1998.
- Mercer T. Functions of positive and negative type and their connection with the theory of integral equations // Trans. London. Philos. Soc., 1999, A, 209, 415−416.
- Моттль B.B. Метрические пространства, допускающие введение линейных операций и скалярного произведения. //ДАН, 2003, том 67, № 1.
- Berg С., Christensen J. and Ressel P. Harmonic analysis on semigroups: Theory of positive definite and related functions. Springer, 1984.
- Vanschoenwinkel В., Manderic B. Substitution matrix based kernel functions for protein secondary structure prediction // Proceedings of the 2004 International Conference on Machine Learning and Applications. Decemberl6−18, 2004, pp. 388 396.
- Wu F., Oslon В., Dobbs D., Honavar V. Comparing kernels for predicting protein binding sites from amino acid sequence //Newral Netwarks, 2006, IJCNN’OC, pp. 1612−1616.
- Henikoff S., Henikoff J. Amino acid substitution matrices from protein blocks // Proc. Natl. Acad. Sci., 1992, 10 915−10 919.
- Altschul S.F. The Statistics of Sequence Similarity Scores, http://www.ncbi.nlm.nih.gov/ BLAST/tutorial/Altschul-1 .html.
- Корн Г., Корн Т. Справочник по математике для научных работников и инженеров. М.: Наука, 1974.
- Gotoh О. An improved algorithm for matching biological sequences //Journal of Molecular Biology, 1982, pp. 705−708.
- Attwood Т.К. The PRINTS database: a resource for identification of protein families. // Brief Bioinformatics, 3, 2002, pp. 252−263.
- Rost B. at al. PHD-an automatic server for protein secondary structure prediction. // Computational applications in biosciences. 10, 1994, pp. 53−60.
- Goebel U. at al. Correlated mutations and residue contacts in proteins. // Proteins, 18, 1994, pp. 309−317.
- Шлезингер M. И. О самопроизвольном различении образов // Сб. Читающие автоматы Киев, Наукова думка, 1965.
- Моттль В.В., Мучник И. Б. Скрытые марковские модели в структурном анализе сигналов М.: Физматлит, 1999, 352 с.
- Ю4.Вапник B.H. Восстановление зависимостей по эмпирическим данным М.: Наука, 1979, 447 с.
- Юб.Моттль B.B., Середин О. С., Красоткина О. В. Комбинирование потенциальных функций при восстановлении зависимостей по эмпирическим данным //Искусственный интеллект -2'2004, с. 134−139.
- SVC 2004: First International Signature Verification Competition. URL: http://www.cs.ust.hk/ svc2004/index.html.
- H.-W. Ji, Z.-H.Quan. Signature verification using wavelet transform and support vector machine. ICIC 2005, Springer-Verlag Berlin Heidelberg 2005, Part 1, LNCS 3644, pp.671−678.
- Kaufman, L., Rousseeuw, P. J. Finding Groups in Data. Wiley, New York, 1990.
- Mirkin B. Clustering for Data Mining: A Data Recovery Approach. Chapman and Hall/CRC, 2005.
- Virus Database at University College London (VIDA). URL: http://www.biochem.ucl.ac.uk/ bsm/virusdatabase/VIDA3/VIDA.html
- McGeoch, DJ, Rixon, FJ, and Davison, AJ: Topics in herpesvirus genomics and evolution // Virus Research 2006, 117, pp. 90−104.
- Smith TF, Waterman MS. Identification of Common Molecular Subsequence //Journal of Molecular Biology, 147: 1981, pp. 195−197.
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool //Journal of Molecular Biology 215 (3): 1990, pp. 40310.