Использование вероятностных методов для анализа аминокислотных последовательностей мембранных белков
Диссертация
При разработке методов предсказания мембранной структуры важной является проблема формирования обучающей выборки, т. е. выборки последовательностей, для которых известно точное расположение мембранных сегментов, в условиях нехватки данных по известным структурам мембранных белков. Авторам сервера ТМНММ удалось сформировать выборку из 160 аминокислотных последовательностей с экспериментально… Читать ещё >
Список литературы
- Al. Arai М., Mitsuke Н., Ikeda М., Xia J.X., Kikuchi Т., Satake М., Shimizu Т. 2004. ConPred II: a consensus prediction method for obtaining transmembrane topology models with high reliability. Nucleic Acids Res. 32:390−393.
- A2. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol. 215(3):403−410.
- A3. Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25(17):3389−3402.
- A4. Argos P., Rao J.K., Hargrave P.A. 1982. Structural prediction of membrane-bound proteins. Eur J Biochem. 128(2−3):565−575.
- A5. Arkin I.T., Brunger A.T. 1998. Statistical analysis of predicted transmembrane alpha-helices. Biochim Biophys Acta. 1429(1):113−128.
- A6. Altschul S.F. 1991. Amino acid substitution matrices from an information theoretic perspective. J Mol Biol. 219(3):555−565.
- A7. Agre P., King L.S., Yasui M., Guggino W.B., Ottersen OP, Fujiyoshi Y, Engel A, Nielsen S. 2002. Aquaporin water channels--from atomic structure to clinical medicine. J Physiol. 542:3−16.
- A8. Alberts В., Johnson A., Lewis J., Raff M., Roberts K., Walter P. Molecular Biology of the Cell. Garland Publishing Inc. Hamden, Connecticut, 2002.1616 p.
- Bl. Bairoch A., Boeckmann B. 1994. The SWISS-PROT protein sequence databank: current status. Nucleic Acids Res. 22(17):3578−3580.
- B2. Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28(l):235−242.
- C2. Cserzo M., Eisenhaber F., Eisenhaber В., Simon I. 2004. TM or not TM: transmembrane protein prediction with low false positive rate using DAS-TMfilter. Bioinformatics. 20(1):136−137.
- C8. Chattopadhyay M.K. 2006. Mechanism of bacterial adaptation to low temperature. J. Biosci. 31(1): 157−165.
- Dl. Durbin R., Eddy S., Krogh A., and Mitchison G. 1998. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press. 356 pp.
- D2. Dayhoff M., Schwartz R., Orcutt B. 1978. A model of evolutionary change in protein. Atlas of Protein Sequence and Structure (Natl. Biomed. Res. Found., Washington). 5(3): 345−358.
- D3. Do C.B., Mahabhashyam M.S., Brudno M., Batzoglou S. 2005. ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res. 15,330−340.
- D4. Dowhan W. 1997. Molecular Basis For Membrane Phospholipid Diversity: Why Are There So Many Lipids? Annu. Rev. Biochem. 66:199−232.
- El. Engelman D.M., Steitz T.A., Goldman A. 1986. Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. Annu Rev Biophys Biophys Chem. 15:321 353.
- E2. Eisenberg D., Schwarz E., Komaromy M., Wall R. 1984. Analysis of membrane and surface protein sequences with the hydrophobic moment plot. J Mol Biol. 179(1): 125−142.
- E3. El Karkouri K., Gueune H., Delamarche C. 2005. MIPDB: a relational database dedicated to МЕР family proteins. Biol Cell. 97(7):535−543.
- Fl. Frishman D., Mewes H.W. 1997. Protein structural classes in five complete genomes. Nat Struct Biol. 4(8):626−628.
- F2. Frishman D., Mokrejs M., Kosykh D., Kastenmuller G., Kolesov G., Zubrzycki I., Gruber C., Geier В., Kaps A., Albermann K., Volz A., Wagner C., Fellenberg M., Heumann K., Mewes H.W. 2003. The PEDANT genome database. Nucleic Acids Res. 31(1):207−211.
- F3. Felsenstein J. 1996. Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods. Methods Enzymol. 266:418−427.
- F4. Fu D, Libson A, Miercke LJ, Weitzman C, Nollert P, Krucinski J, Stroud RM. 2000. Structure of a glycerol-conducting channel and the basis for its selectivity. Science. 290(5491):481−486.
- Gl. Gonnet G.H., Cohen M.A., Benner S.A. 1992. Exhaustive matching of the entire protein sequence database. Science. 256(5062): 1443−1445.
- G2. Gerstein M., Sonnhammer E.L., Chothia С. 1994. Volume changes in protein evolution. J. Mol. Biol. 236:1067−1078.
- G3. Galperin M.Y., Koonin E.V. 2001. Comparative genome analysis. Methods Biochem Anal. 43:359−392.
- HI. Henikoff S., Henikoff J.G. 1992. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA. 89(22):10 915−10 919.
- H2. Hofmann K., Stoffel W. 1993. TMbase A database of membrane spanning proteins segments. Biol. Chem. Hoppe-Seyler. 374,166
- H3. Hirokawa Т., Boon-Chieng S., Mitaku S. 1998. SOSUI: classification and secondary structure prediction system for membrane proteins. Bioinformatics. 14(4):378−379.
- H4. Heringa J. 1999. Two strategies for sequence comparison: profile-preprocessed and secondary structure-induced multiple alignment. Comput. Chem. 23:341−364.
- H5. Hannenhalli S.S., Russell R.B. 2000. Analysis and prediction of functional sub-types from protein sequence alignments. J Mol Biol. 303(l):61−76.
- Jl. Jones D.T., Taylor W.R., Thornton J.M. 1994. A model recognition approach to the prediction of all-helical membrane protein structure and topology. Biochemistry. 33(10):3038−3049.
- J2. Jones D.T., Taylor W.R., Thornton J.M. 1992. The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 8(3):275−282.
- J3. Jones D.T., Taylor W.R., Thornton J.M. 1994. A mutation data matrix for transmembrane proteins. FEBS Lett. 339(3):269−275.
- J4. Juretic D., Zoranic L., Zucic D. 2002. Basic charge clusters and predictions of membrane protein topology. J Chem Inf Comput Sci. 42(3):620−632.
- J5. Jaroszewski L., Li W., Godzik A. 2002. In search for more accurate alignments in the twilight zone. Protein Sci. 11:1702−1713.
- J6. Jones D.T. 1998. Do transmembrane protein superfolds exist? FEBS Lett. 423:281−285.
- J7. Jones S., Thornton J.M. 2004. Searching for functional sites in protein structures. Curr Opin Chem Biol. 8(l):3−7.
- Kl. Krogh, A., Larsson, В., von Heijne, G., and Sonnhammer, E.L. 2001. Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J. Mol. Biol. 305:567−580.
- K2. Krogh A., Mian I.S., Haussler D. 1994. A hidden Markov model that finds genes in E. coli DNA. Nucleic Acids Res. 22:4768−4778.
- КЗ. Krogh A., Brown M., Mian I.S., Sjolander K., Haussler D. 1994. Hidden Markov models in computational biology. Applications to protein modeling. J Mol Biol. 235(5):1501−1531.
- K4. Klein P., Kanehisa M., DeLisi C. 1985. The detection and classification of membrane-spanning proteins. Biochim Biophys Acta. 815(3):468−476.
- K5. Kalinina O.V., Mironov A.A., Gelfand M.S., Rakhmaninova A.B. 2004. Automatedselection of positions determining functional specificity of proteins by comparative analysis of orthologous groups in protein families. Protein Sci. 13(2):443−456.
- K6. Kalinina O.V., Novichkov P. S., Mironov A.A., Gelfand M.S., Rakhmaninova A.B. 2004. SDPpred: a tool for prediction of amino acid residues that determine differences in functional specificity of homologous proteins. Nucleic Acids Res. 32: 424−428.
- K7. Kaback H.R., Voss J., Wu J. 1997. Helix packing in polytopic membrane proteins: the lactose permease of Escherichia coli. Curr Opin Struct Biol. 7(4):537−542.
- Ml. Mirus 0., Schleiff E. 2005. Prediction of beta-barrel membrane proteins by searching for restricted domains. BMC Bioinformatics. 6:254.
- M2. Martelli P.L., Fariselli P., Casadio R. 2003. An ENSEMBLE machine learning approach for the prediction of all-alpha membrane proteins. Bioinformatics. 19(1):205−211.
- M3. Muller Т., Spang R., Vingron M. 2002. Estimating amino acid substitution models: a comparison of Dayhoff s estimator, the resolvent approach and a maximum likelihood method. Mol Biol Evol. 19(1):8−13.
- M4. Muller Т., Rahmann S., Rehmsmeier M. 2001. Non-symmetric score matrices and the detection of homologous transmembrane proteins. Bioinformatics. 17:182−189.
- M5. Murzin A.G., Brenner S.E., Hubbard Т., Chothia C. 1995. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247:536 540.
- N1. Needleman S.B., Wunsch C.D. 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 48(3):443−453.
- N2. Nakai K, Horton P. 1999. PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci. 24(l):34−36.
- N3. Ng P.C., Henikoff J.G., Henikoff S. 2000. PHAT: a transmembrane-specific substitution matrix. Predicted hydrophobic and transmembrane. Bioinformatics. 16(9):760−766.
- N4. Nilsson J, Persson B, von Heijne G. 2000. Consensus prediction of membrane protein topology. FEBS Letters. 267−269.
- Overbeek R., Larsen N., Pusch G.D., D’Souza M., Selkov E. Jr, Kyrpides N., Fonstein M., Maltsev N., Selkov E. 2000. WIT: integrated system for high-throughput genome sequence analysis and metabolic reconstruction. Nucleic Acids Res. 28(1):123−125.
- Okada Т., Palczewski K. Crystal structure of rhodopsin: implications for vision and beyond. Curr. Opin. Struct. Biol. 2001−11:420−426.
- Orengo С.A., Michie A.D., Jones S., Jones D.T., Swindells M.B., Thornton J.M. 1997. CATH a hierarchic classification of protein domain structures. Structure. 5:1093−1108.
- PI. Persson В., Argos P. 1997. Prediction of membrane protein topology utilizing multiple sequence alignments. J Protein Chem. 16(5):453−457.
- P3. Persson В., Argos P. 1994. Prediction of transmembrane segments in proteins utilising multiple sequence alignments. J Mol Biol. 237(2):182−192.
- P4. Paulsen I.T., Sliwinski M.K., Saier M.H. Jr. 1998. Microbial genome analyses: global comparisons of transport capabilities based on phylogenies, bioenergetics and substrate specificities. Mol. Biol. 277:573−592.
- P5. Parodi L.A., Granatir C.A., Maggiora G.M. 1994. A consensus procedure for predicting the location of alpha-helical transmembrane segments in proteins. Comput Appl Biosci. 10:527 535.
- P6. Park J.H., Saier M.H. Jr. 1996. Phylogenetic characterization of the MIP family of transmembrane channel proteins. J Membr Biol. 153(3): 171−180.
- Rl. Rost В., Fariselli P., Casadio R. 1996. Topology prediction for helical transmembrane proteins at 86% accuracy. Protein Sci. 5(8): 1704−1718.
- R2. Ren Q., Kang K.H., Paulsen I.T. 2004. TransportDB: a relational database of cellular membrane transport systems. Nucleic Acids Res. 32:284−288.
- R3. Royant A., Nollert P., Edman K., Neutze R., Landau E.M., Pebay-Peyroula E., Navarro J. 2001. X-ray structure of sensory rhodopsin II at 2.1-A resolution. Proc. Natl. Acad. Sci USA. 98:10 131−10 136.
- R4. Rost В., Liu J. 2003. The PredictProtein server. Nucleic Acids Res. 31:3300−3304.
- Saier M.H. Jr. 2000. A functional-phylogenetic classification system for transmembrane solute transporters. Microbiol. Mol. Biol. Rev. 64(2):354−411.
- Saier M.H. Jr, Tran C.V., Barabote R.D. 2006. TCDB: the Transporter Classification Database for membrane transport protein analyses and information. Nucleic Acids Res. 34:181−186.
- Smith T.F., Waterman M.S. 1981. Identification of common molecular subsequences. J Mol Biol. 147(1):195−197.
- Saier MH Jr. 1999. A functional-phylogenetic system for the classification of transport proteins. Cell Biochem. 32:84−94.
- Sutormin R.A., Rakhmaninova A.B., Gelfand M.S. 2003. BATMAS30: amino acid substitution matrix for alignment of bacterial transporters. Proteins. 51:85−95.
- Saitou N., Nei M. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406−425.
- Sonnhammer E.L., von Heijne G., Krogh A. 1998. A hidden Markov model for predicting transmembrane helices in protein sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 6:175 182.
- Sutormin R.A., Mironov A.A. 2006. Membrane probability profile construction based on amino acids sequences multiple alignment. Mol Biol (Mosk). 40(3):541−545.
- Tl. Tusnady G.E., Simon 1.1998. Principles governing amino acid composition of integral membrane proteins: Application to topology prediction. J. Mol. Biol. 283:489−506.
- T2. Tusnady G.E., Dosztanyi Z., Simon 1.2005. PDBTM: selection and membrane localization of transmembrane proteins in the protein data bank. Nucleic Acids Res. 33:275−278.
- T3. Tusnady G.E., Dosztanyi Z., Simon I. 2005. TMDET: web server for detecting transmembrane regions of proteins by using their 3D coordinates. Bioinformatics. 21(7):1276−1277.
- Т4. Thompson J.D., Higgins D.G., Gibson T.J. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673−4680.
- T5. Tudos E., Cserzo M., Simon I. 1990. Predicting isomorphic residue replacements for protein design. Int J Pept Protein Res. 36:236−239.
- VI. Viklund H., Elofsson A. 2004. Best alpha-helical transmembrane protein topologypredictions are achieved using hidden Markov models and evolutionary information. Protein Sci. 13(7):1908−1917.
- Wl. White S.H. 2004. The progress of membrane protein structure determination. Protein Sci. 13(7):1948−1949
- W2. Watson J.D., Laskowski R.A., Thornton J.M. 2005. Predicting protein function from sequence and structural data. Curr Opin Struct Biol. 15(3):275−284.
- Zl. Zhou C., Zheng Y., Zhou Y. 2004. Structure prediction of membrane proteins. Genomics Proteomics Bioinformatics. 2(l):l-5.
- Z2. Zhou H, Zhou Y. 2003. Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-Markov-model-based method. Protein Sci. 12(7): 15 471 555.
- Z3. Zhang H., Cramer W.A. 2005. Problems in Obtaining Diffraction-quality Crystals of Hetero-oligomeric Integral Membrane Proteins. J. Struct. Funct. Genomics. 6:219−223.
- Z4. Zuckerkandl E., Pauling L. 1965. Molecules as documents of evolutionary history. J. Theor. Biol. 8:357−366.