Исследование конечно-линейных статистических моделей.
Оптимизация и избыточность
Диссертация
Описание некоторых систем в различных областях современной науки, таких как биология, медицина, физика и химия, зачастую содержит информацию, представленную в виде большого количества категориальных признаков. В качестве примеров таких признаков можно привести наличие или отсутствие симптомов различных заболеваний пациентов в медицине, кодирование нуклеотидных остатков при помощи двух или четырех… Читать ещё >
Список литературы
- Greenacre М. Theory and Applications of Correspondence Analysis. London, England: Academic Press, 1984.
- Giffi A. Nonlinear Multivariate Analysis. Chichester, England: Wiley, 1990.
- Cox D. R. The analysis of multivariate binary data // Applied Statistics. 1972. Vol. 21. P. 113−120.
- Bloomfield P. Linear transformations for multivariate binary data // Biometrics. 1974. Vol. 30. P. 609−617.
- Алексеева H. П., Алексеев А. О. О роли конечных геометрий в корреляционном анализе бинарных признаков // Математические модели. Теория и приложения. Под ред. чл.корр. РАЕН, проф. Чиркова М. К. НИ-ИМ СПбГУ. 2004. Т. 4. С. 102−117.
- Алексеева Н. П. Комбинаторный анализ двух форм скрытой периодичности категориальных последовательностей // Вестн. С.-Петерб. ун-та. Сер. 1: Математика, Механика, Астрономия. 2007. Вып. 3. С. 55−64.
- Алексеева Н. П., Конради А. О., Бондаренко Б. Симптомный анализ в исследовании долгосрочного клинического прогноза // Артериальная гипертензия. Том 14. 2008. Т. 1. С. 38−43.
- Ананьевская (Грачева) П. В. Метод дискретной оптимизации на основе параметризации грассманиана в многомерном структурировании дихотомических данных // Вестн. С.-Петерб. ун-та. Сер. 1: Математика, Механика, Астрономия. 2011. Вып. 4. С. 28−37.
- Мартынов Б. В., Алексеева Н. П., Ананьевская (Грачева) П. В. и др.
- Прогностические факторы у больных с глиомами: симптомно-синдро-мальный анализ // Вестн. Рос. Воен.-мед. акад. 2010. Т. 1, № 29. С. 7−14.
- Alexeyeva N., Smirnov I., Ananyevskaya (Gracheva) P., Martynov B. The finitely geometric symptom analysis in the glioma survival study // Proc. of the 2nd International Conference on BioMedical Engineering and Informatics (BMEI). 2009. P. 1−4.
- Pearson К. On Lines and Planes of Closest Fit to Systems of Points in Space // Philosophical Magazine. 1901. Vol. 2, no. 6. P. 559−572.
- Hotelling H. Analysis of a complex of statistical variables into principal components // J. Educ. Psychol. 1933. Vol. 24. P. 417−441, 498−520.
- Hotelling H. Relations between two sets of variates // Biometrica. 1936. Vol. 28. P. 321−377.
- Kettenring J. R. Canonical analysis of several sets of variables // Biometrica. 1971. Vol. 56. P. 433−451.
- Fisher R. A. The Use of Multiple Measurements in Taxonomic Problems // The Annals of Eugenics. 1936. Vol. 7, no. 2. P. 179−188.
- Nerlove M., Press S. J. Univariate and multivariate log-linear and logistic models. Santa-Monica, California: Rand Corporation, 1973.
- Friedman J. Regularized discriminant analysis // Journal of American Statistical Association. 1989. Vol. 84. P. 165−175.
- Fisher R. A. The Precision of Discriminant Functions // The Annals of Eugenics. 1938. Vol. 10. P. 422−429.
- Greenacre M., Blasius J. Multiple Correspondence Analysis and Related Methods. Boca Raton, FL: Chapman & Hall/CRC, 2006.
- Michailidis G., de Leeuw J. The Giffi System of Descriptive Multivariate Analysis // Statistical Science. 1998. Vol. 13. P. 307−336.
- Gutch H. W., Gruber P., Yeredor A., Theis F. J. ICA over finite fields -separability and algorithms // Signal Processing. 2012. Vol. 92, no. 8. P. 1796−1808.
- Yeredor A. Independent Component Analysis Over Galois Fields of Prime Order // IEEE Transactions on Information Theory. 2011. Vol. 57, no. 8. P. 5342−5359.
- Akaike H. Information theory and an extension of the maximum likelihood principle // Second International Symposium on Information Theory. Ed. by B. Petrov, B. Csaki. Academiai Kiado: Budapest. 1973. P. 267−281.
- Berger A. L., Pietra S. A. D., Pietra V. J. D. A Maximum Entropy approach to Natural Language Processing // Computational linguistics. 1996. Vol. 22. P. 39−71.
- He F. Maximum entropy, logistic regression, and species abundance // Oikos. 2010. Vol. 119. P. 572−582.
- Renyi A. On measures of dependance // Acta Mathematica Academiae Sci-entiarium Hungaricae. 1959. Vol. 10. P. 441−451.
- Joe H. Relative entropy measures of multivariate dependence // Journal of the American Statistical Association. 1989. Vol. 84. P. 157−164.
- Renyi A. An estimator for the mutual information based on a criterion for independence // Computational Statistics and Data Analysis. 1999. Vol. 32. P. 1−17.
- Green P. J. Iteratively Reweighted Least Squares for Maximum Likelihood Estimation, and Some Robust and Resistant Alternatives // Journal of the Royal Statistical Society B. 1984. Vol. 46, no. 2. P. 149−193.
- Buja A., Hastie Т., Tibshirani R. Linear Smoothers and Additive Models // The Annals of Statistics. 1989. Vol. 17, no. 2. P. 453−510.
- Gabriel K. R. Generalised bilinear regression // Biometrica. 1998. Vol. 85. P. 689−700.
- Silberstein N., Etzion T. Enumerative Coding for Grassmannian Space // IEEE Transactions on Information Theory. 2011. Vol. 57. P. 365−374.
- Borel A., Tits J. Groupes reductifs // Publ. Math. IHES 27. 1965. P. 55−152.
- Гриффите Ф., Харрис Д. Принципы алгебраической геометрии. Москва: Мир, 1982.
- Calvet J., Cardillo J., Hennet J., Szigeti F. Method of Relaxation applied to optimization of discrete systems // Electronic Journal of Differential Equations. 2003. P. 13−19.
- Raghavan P., Thompson C. Randomized rounding: A technique for provably good algorithms and algorithmic proofs // Combinatorica. 1987. Vol. 7. P. 365−374.
- Land A. H., Doig A. G. An automatic method of solving discrete programming problem // Econometrica. 1960. Vol. 28. P. 497−520.
- Du D.-Z., Pardalos P. Handbook of Combinatorial Optimization. Netherlands: Kluwer Academic Publishers, 1998.
- Owens J. D., Luebke D., Govindaraju N. et al. A survey of general-purpose computation on graphics hardware // Computer Graphics Forum. 2007. Vol. 26. P. 80−113.
- Suchard M. A., Rambaut A. Many-core algorithms for statistical phyloge-netics // Bioinformatics. 2009. Vol. 25. P. 1370−1376.
- Kirk D. B., Hwu W. M. Programming Massively Parallel Processors: A Hands-on Approach. San Francisco, CA, US: Morgan Kaufmann Publishers Inc, 2010.
- Preis T., Virnau T., Paul W., Schneider J. J. GPU accelerated Monte Carlo simulation of the 2D and 3D Ising model // Journal of Computational Physics. 2009. Vol. 228, no. 12. P. 4468−4477.
- Zhou H., Lange K., Suchard M. A. Graphics Processing Units and High-Dimensional Optimization // Statistical Science. 2010. Vol. 25, no. 3. P. 311−324.
- Suchard M. A., Wang Q., Chan C. et al. Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures // Journal of Computational and Graphical Statistics. 2010. Vol. 19, no. 2. P. 419−438.
- Tibbits M. M., Haran M., Liechty J. C. Parallel Multivariate Slice Sampling // Statistics and Computing. 2011. Vol. 21, no. 3. P. 415−430.
- Zhou Y., Liepe J., Sheng X. et al. GPU accelerated biochemical network simulation // Bioinformatics. 2011. Vol. 27, no. 6. P. 874−876.
- Lee A., Caron F., Doucet A., Holmes C. Bayesian sparsity-path-analysis of genetic association signal using generalized t priors // Statistical Applications in Genetics and Molecular Biology. 2012. Vol. 11, no. 2. P. 1544−6115.
- Lingoes J. C., Guttman L. Nonmetric factor analysis: a rank reducing alternative to linear factor analysis // Multivariate Behavioral Research. 1967. Vol. 2. P. 485−505.
- Kruskal J. B., Shepard R. N. A nonmetric variety of linear factor analysis // Psychometrika. 1974. Vol. 39. P. 123−157.
- Young F. W., Takane Y., de Leeuw J. The principal components of mixed measurement level multivariate data: an alternating least squares method with optimal scaling features // Psychometrika. 1978. Vol. 45. P. 279−281.
- Horst P. Relations among m sets of measures // Psychometrika. 1961. Vol. 26. P. 129−149.
- Nelder J., Wedderburn R. Generalized Linear Models // Journal of the Royal Statistical Society A. 1971. Vol. 135, no. 3. P. 370−384.
- Hastie T., Tibshirani R. Generalized Additive Models. London, England: Chapman & Hall, 1990.
- Yee T. W., Wild C. J. Vector Generalized Additive Models // Journal of the Royal Statistical Society B. 1996. Vol. 58, no. 3. P. 481−493.
- Yee T. W., Hastie T. J. Reduced-rank Vector Generalized Linear Models // Statistical Modelling. 2003. Vol. 3, no. 2. R 15−41.
- Hastie T., Tibshirani R., J.Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. NY, USA: Springer, 2009.
- Celeux G., Mkhadri A. Discrete regularized discriminant analysis // Statistics and Computing. 1992. Vol. 2. P. 143−151.
- Tutz G. An alternative choice of smoothing for kernelbased density estimates in discrete discriminant analysis // Biometrika. 1986. Vol. 73. P. 405−411.
- Saporta G. Discriminant analysis when all the variables are nominal, a stepwise method. Murray Hill, NJ: Spring meeting of the Psychametric Society, 1976.
- Thomas L. C., Edelman D. B., Crook J. N. Credit Scoring and its Applications. Philadelphia: PA: SIAM Monographs on Mathematical Modelling and Computation, 2002.
- Celeux G., Nakache J. Discrimination sur Variables Qualitatives. Paris: Polytechnica, 1994.
- Verde R., Palumbo F. Analisi fattoriale discriminante non-simmetrica su predittori qualitativi. Rimini: Atti del Convegno delia XXXVIII Riunione Scientifica della Societa Italiana di Statistica, 1996.
- Shannon C. A mathematical theory of communication // Bell System Technical Journal. 1948. Vol. 27. P. 379−423.
- Колмогоров А. Н. Новый метрический инвариант транзитивных динамических систем и автоморфизмов пространств Лебега // Доклады АН СССР. 1958. Т. 5.
- Cercignani С. The Boltzmann equation and its applications. Berlin: Springer, 1988.
- Jaynes E. T. Information theory and statistical mechanics // Physical Review. 1957. Vol. 106, no. 4. P. 620−630.
- Jaynes E. T. Information theory and statistical mechanics II // Physical Review. 1957. Vol. 108, no. 2. P. 171−190.
- Jaynes E. T. Information theory and statistical mechanics // Statistical Physics, K. Ford (ed.), Benjamin, New York. 1963. P. 181−218.
- Джейнс Э. Т. О логическом обосновании методов максимальной энтропии // ТИИИЭР. 1982. Т. 70, № 9. С. 33−51.
- Burg J. P. Maximum Entropy Spectral Analysis: Ph.D. thesis / Stanford University, Stanford, CA. 1975.
- Hinton G. E., Sejnowski Т. J. Learning and relearning in Boltzmann machines //In Rumelhart, D. E. and McClelland, J. L., editors, Parallel Distributed Processing: Explorations in the Microstructure of Cognition. 1986. Vol. 1. P. 282−317.
- Pietra S. D., Pietra V. D., Lafferty J. Inducing features of random fields // IEEE Transactions on Pattern Analysis and Machine Intelligence. 1997. Vol. 19. P. 380−393.
- Theil H. On the estimation of relationships involving qualitative variables // American Journal of Sociology. 1970. Vol. 76. P. 103−154.
- Sarndal С. E. A comparative study of association measures // Psychometri-ka. 1974. Vol. 39. P. 165−187.
- Cover Т. M., Thomas J. Elements of information theory, 2nd edition. New York: Wiley, 2006.
- Kojadinovic I. On the use of mutual information in data analysis: an overview // Proceedings of 11th International Symposium on Applied Stochastic Models and Data Analysis. 2005. P. 738−747.
- Chellappa R., Turaga P., Veeraraghavan A. Statistical analysis on Stiefel and Grassmann Manifolds with applications in Computer Vision // IEEE Conference on Computer Vision and Pattern Recognition. 2008. P. 1−8.
- Ghorpade S., Tsfasman M. Schubert varieties, linear codes and enumerative combinatorics // Finite Fields and Their Applications. 2005. P. 684−699.
- Кострикин А. И., Манин Ю. И. Линейная алгебра и геометрия. Москва: Наука, 1986.
- Knuth D. E. Subspaces, subsets, and partitions // Journal of Combinatorial Theory. 1971. Vol. 10. P. 178−180.
- Thomas S. Designs over finite fields // Geometriae Dedicata. 1987. Vol. 21. P. 237−242.
- Martin W. J., Zhu X. J. Anticodes for the Grassman and biliniar forms graphs // Designs, Codes, and Cryptography. 1995. Vol. 6. P. 73−79.
- Schwartz M., Etzion T. Codes and anticodes in the Grassman graph // Journal of Combinatorial Theory, Series A. 2002. Vol. 97. P. 27−42.
- Manganiello F., Gorla E., Rosenthal J. Spread codes and spread decoding in network coding //In Proceedings of International Symposium on Information Theory. July 2008. P. 881−885.
- Silva D., Kschischang F. R. On metric for error correction in network coding // IEEE Trans. Information Theory. December 2009. Vol. IT-55. P. 5479−5490.
- Koetter R., Kschischang F. R. Coding for errors and erasures in random network coding // IEEE Trans. Information Theory. August 2008. Vol. 54, no. 8. P. 3579−3591.
- Guan D.-J. Generalized Gray Codes with Applications // Proc. Natl. Sei. Counc. Repub. Of China (A). 1998. Vol. 22. P. 841−848.
- Sanders J., Kandrot E. CUDA by Example An Introduction to General-Purpose GPU Programming. US: Addison-Wesley Professional, 2010.
- Bouillaguet C., Chen H. C., Cheng C. M. et al. Fast Exhaustive Search for Polynomial Systems in F2 // Cryptographic Hardware and Embedded Systems, Lecture Notes in Computer Science. 2010. Vol. 6225. P. 203−218.
- Fluck О., Aharon S., Cremers D., Rousson M. GPU histogram computation // In Proceeding SIGGRAPH ACM, Research posters. 2006.
- Shams R., Barnes N. Speeding up Mutual Information Computation Using NVIDIA CUDA Hardware //In Proceedings 9th Biennial Conference of the Australian Pattern Recognition Society on Digital Image Computing Techniques and Applications. 2007. P. 555−560.
- Shams R., Kennedy R. A. Efficient histogram algorithms for NVIDIA CUDA compatible devices //In ICSPCS. 2007.
- Koppaka S., Mudigere D., Narasimhan S., Narayanan B. Fast histograms using adaptive CUDA streams // http://arxiv.org/abs/1011.0235.