ΠΠΎΠ²ΡΡΠ΅Π½ΠΈΠ΅ ΡΠ΅Π»Π΅Π²Π°Π½ΡΠ½ΠΎΡΡΠΈ ΠΏΠ΅ΡΠΈΠΎΠ΄ΠΈΡΠ΅ΡΠΊΠΎΠ³ΠΎ ΡΠ΅ΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠΎΠ³ΠΎ ΠΏΠΎΠΈΡΠΊΠ° ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ Π² Web
ΠΠΈΡΡΠ΅ΡΡΠ°ΡΠΈΡ
Π ΠΎΠ±Π»Π°ΡΡΠΈ ΠΏΠΎΠΈΡΠΊΠ° ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ ΠΈΡΡΠΎΡΠΈΡΠ΅ΡΠΊΠΈ Π²ΡΠ΄Π΅Π»ΡΡΡΡΡ Π΄Π²Π° ΡΠΈΠ»ΡΠ½ΠΎ ΡΠ²ΡΠ·Π°Π½Π½ΡΡ ΡΠΈΠΏΠ° Π·Π°Π΄Π°Ρ: ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΎΠ½Π½ΠΎΠ³ΠΎ ΠΏΠΎΠΈΡΠΊΠ° (information retrieval) ΠΈ ΡΠΈΠ»ΡΡΡΠ°ΡΠΈΠΈ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ (information filtering). Π‘ΠΈΡΡΠ΅ΠΌΡ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΎΠ½Π½ΠΎΠ³ΠΎ ΠΏΠΎΠΈΡΠΊΠ° ΠΏΡΠΈΠΌΠ΅Π½ΡΡΡΡΡ Π² ΡΡΠ»ΠΎΠ²ΠΈΡΡ Π²ΡΡΠΎΠΊΠΎΠΉ ΠΈΠ·ΠΌΠ΅Π½ΡΠ΅ΠΌΠΎΡΡΠΈ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΎΠ½Π½ΠΎΠΉ ΠΏΠΎΡΡΠ΅Π±Π½ΠΎΡΡΠΈ ΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°ΡΠ΅Π»Π΅ΠΉ ΠΈ ΠΎΡΠ½ΠΎΡΠΈΡΠ΅Π»ΡΠ½ΠΎΠΉ ΡΡΠ°ΡΠΈΡΠ½ΠΎΡΡΠΈ ΠΈΡΠΏΠΎΠ»ΡΠ·ΡΠ΅ΠΌΠΎΠ³ΠΎ Ρ ΡΠ°Π½ΠΈΠ»ΠΈΡΠ° ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ. Π‘ΠΈΡΡΠ΅ΠΌΡ ΡΠΈΠ»ΡΡΡΠ°ΡΠΈΠΈ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ Π½Π°ΠΏΡΠΎΡΠΈΠ²… Π§ΠΈΡΠ°ΡΡ Π΅ΡΡ >
Π‘ΠΏΠΈΡΠΎΠΊ Π»ΠΈΡΠ΅ΡΠ°ΡΡΡΡ
- ΠΠ³Π΅Π΅Π² Π. Π‘. ΠΠ΅ΡΠΎΠ΄Ρ Π°Π²ΡΠΎΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΠ±ΡΠΈΠΊΠ°ΡΠΈΠΈ ΡΠ΅ΠΊΡΡΠΎΠ², ΠΎΡΠ½ΠΎΠ²Π°Π½Π½ΡΠ΅ Π½Π° ΠΌΠ°ΡΠΈΠ½Π½ΠΎΠΌ ΠΎΠ±ΡΡΠ΅Π½ΠΈΠΈ ΠΈ Π·Π½Π°Π½ΠΈΡΡ ΡΠΊΡΠΏΠ΅ΡΡΠΎΠ². ΠΠΈΡ. ΠΊΠ°Π½Π΄. ΡΠΈΠ·-ΠΌΠ°Ρ. Π½Π°ΡΠΊ: 05.13.11. ΠΠΎΡΠΊΠΎΠ²ΡΠΊΠΈΠΉ Π³ΠΎΡ. ΡΠ½ΠΈΠ². ΠΠΎΡΠΊΠ²Π°, 2005.
- ΠΠ°ΠΉΠ½ΡΠ²Π°ΠΉΠ³ Π.Π. ΠΠ»Π³ΠΎΡΠΈΡΠΌ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΠΎΠ±ΡΠ°Π·ΠΎΠ² «ΠΠΎΡΠ°» // ΠΠ»Π³ΠΎΡΠΈΡΠΌΡ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΠΎΠ±ΡΠ°Π·ΠΎΠ² / ΠΠΎΠ΄ ΡΠ΅Π΄. Π. Π. ΠΠ°ΠΏΠ½ΠΈΠΊΠ°. — Π.: Π‘ΠΎΠ². ΡΠ°Π΄ΠΈΠΎ, 1973, ΡΡΡ. 110−116.
- ΠΠ°ΠΏΠ½ΠΈΠΊ Π.Π. ΠΠΎΡΡΡΠ°Π½ΠΎΠ²Π»Π΅Π½ΠΈΠ΅ Π·Π°Π²ΠΈΡΠΈΠΌΠΎΡΡΠ΅ΠΉ ΠΏΠΎ ΡΠΌΠΏΠΈΡΠΈΡΠ΅ΡΠΊΠΈΠΌ Π΄Π°Π½Π½ΡΠΌ. Π.: ΠΠ°ΡΠΊΠ°, 1979.
- ΠΠ°Π»ΠΈΠ·Π½ΡΠΊ Π. ΠΡΠ°ΠΌΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠΈΠΉ ΡΠ»ΠΎΠ²Π°ΡΡ ΡΡΡΡΠΊΠΎΠ³ΠΎ ΡΠ·ΡΠΊΠ°. Π ΡΡΡΠΊΠΈΠΉ ΡΠ·ΡΠΊ, ΠΠΎΡΠΊΠ²Π°, 1980
- ΠΡΡΠ°Π»Π΅Π½ΠΎΠΊ Π., ΠΠ΅ΠΊΡΠ΅ΡΡΡΡΠ½ΠΎΠ² Π. ΠΡΠ΅Π½ΠΊΠ° ΡΠΈΡΡΠ΅ΠΌ ΡΠ΅ΠΊΡΡΠΎΠ²ΠΎΠ³ΠΎ ΠΏΠΎΠΈΡΠΊΠ°// ΠΡΠΎΠ³ΡΠ°ΠΌΠΌΠΈΡΠΎΠ²Π°Π½ΠΈΠ΅. — 28(4), 2002, ΡΡΡ. 226−242 ΠΠΎΡΠΆΠΎΠ² Π. ΠΡΠΎΠ³ΡΠ°ΠΌΠΌΡ Π»ΠΎΠΊΠ°Π»ΡΠ½ΠΎΠ³ΠΎ ΠΏΠΎΠΈΡΠΊΠ°// ΠΡΡΠ½Π°Π» «ΠΡΠΊΡΡΡΡΠ΅ ΡΠΈΡΡΠ΅ΠΌΡ» #11, 2005
- ΠΠ°ΠΊΡΠ°ΠΊΠΎΠ² A.B. ΠΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠ΅ ΡΠΏΠΎΡΠΎΠ±ΠΎΠ² ΡΠΌΠ΅Π½ΡΡΠ΅Π½ΠΈΡ Π½Π°Π±ΠΎΡΠ° Ρ Π°ΡΠ°ΠΊΡΠ΅ΡΠΈΡΡΠΈΠΊ Π² Π°Π»Π³ΠΎΡΠΈΡΠΌΠ°Ρ ΠΊΠ»Π°ΡΡΠΈΡΠΈΠΊΠ°ΡΠΈΠΈ ΡΠ΅ΠΊΡΡΠΎΠ²// Π’ΡΡΠ΄Ρ ΠΡΠ΅ΡΠΎΡΡΠΈΠΉΡΠΊΠΎΠΉ Π½Π°ΡΡΠ½ΠΎΠΉ ΠΊΠΎΠ½ΡΠ΅ΡΠ΅Π½ΡΠΈΠΈ «ΠΠ΅ΡΠΎΠ΄Ρ ΠΈ ΡΡΠ΅Π΄ΡΡΠ²Π° ΠΎΠ±ΡΠ°Π±ΠΎΡΠΊΠΈ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ» -Π.: ΠΠ·Π΄Π°ΡΠ΅Π»ΡΡΠΊΠΈΠΉ ΠΎΡΠ΄Π΅Π» ΡΠ°ΠΊΡΠ»ΡΡΠ΅ΡΠ° ΠΠΠΈΠ ΠΠΠ£, 2003, ΡΡΡ. 234−240.
- ΠΠ°ΠΊΡΠ°ΠΊΠΎΠ² A.B. ΠΠ°ΡΡΡΠ°Π±ΠΈΡΡΠ΅ΠΌΡΠ΅ Π°Π»Π³ΠΎΡΠΈΡΠΌΡ ΠΊΠ»Π°ΡΡΠΈΡΠΈΠΊΠ°ΡΠΈΠΈ ΡΠ΅ΠΊΡΡΠΎΠ²// Π’ΡΡΠ΄Ρ 12-ΠΉ ΠΊΠΎΠ½ΡΠ΅ΡΠ΅Π½ΡΠΈΠΈ «ΠΠ°ΡΠ΅ΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠΈΠ΅ ΠΌΠ΅ΡΠΎΠ΄Ρ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΠΎΠ±ΡΠ°Π·ΠΎΠ²» (ΠΠΠ Π-12), ΠΠΎΡΠΊΠ²Π°, 2005.106
- ΠΠ°ΠΊΡΠ°ΠΊΠΎΠ² A.B. ΠΠ±Π΅ΡΠΏΠ΅ΡΠ΅Π½ΠΈΠ΅ ΠΊΠΎΠ½ΡΠ΅ΠΊΡΡΠ½ΠΎΠ³ΠΎ ΠΏΠΎΠΈΡΠΊΠ° ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ Π΄Π»Ρ Π±Π°Π· Π·Π½Π°Π½ΠΈΠΉ// ΠΡΠΊΡΡΡΡΠ²Π΅Π½Π½ΡΠΉ ΠΈΠ½ΡΠ΅Π»Π»Π΅ΠΊΡ (ΠΠΎΠ½Π΅ΡΠΊ), 2002 № 2, ΡΡΡ. 493−500
- Π’ΡΡΠ΄Ρ Π ΠΠΠΠΠ2003// ΠΠΠ Π₯ΠΈΠΌΠΈΠΈ Π‘ΠΠ±ΠΠ£ / ΠΠΎΠ΄ ΡΠ΅Π΄. Π. Π‘. ΠΠ΅ΠΊΡΠ΅ΡΡΡΡΠ½ΠΎΠ²Π° — Π‘Π°Π½ΠΊΡ-ΠΠ΅ΡΠ΅ΡΠ±ΡΡΠ³, 2003 — 132 ΡΡΡ.
- Baeza-Yates R., Ribeiro-Beto Π. Modern Information retrieval. New York: ACM Press, 1999.
- Baker L., McCallum A. Distributional Clustering of Words for text Classification// Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 1998. p. 96−103
- Barfourosh A., Nezhad H., Anderson M., Perils D. Information Retrieval on the World Wide Web and Active Logic: A Survey and Problem Definition// Technical report CS-TR-429. College Park: University of Maryland, 2002. p. 1−45.
- Baudisch P., Dynamic Information Filtering. GMD Research series 2001 No 16, Darmstadt Technology University, 2001, Germany
- Belkin N., Croft W. Information filtering and information retrieval: two sides of the same coin?// Communications of the ACM, Volume 35, Issue 12. New York: ACM Press, 1992. p. 29 38
- Bergman M. Deep Web: Surfacing Hidden Value// The Journal of Electronic Publishing 7(1), 2001.
- Binkley J., Young L. Rama: An Architecture for internet information filtering// Journal of Intelligent information systems #5. Hingham: Kluwer academic publishers, 1996. p.81−99.
- Brake D. Lost in Cyberspace // New Scientist. London: Red Business, 1997. p. 12−13.
- Brin S., Page L. The anatomy of a large-scale hypertextual Web search engine// Computer Networks 30(1−7). London: Elsevier, 1998. p. 107−117.
- Bun K., Ishizuka M. Emerging Topic Tracking System// Proceedings of Web Intelligence Conference. London: SpringerVerlag, 2001. p. 125−130.
- Chakrabarti S. Data Mining for hypertext: A tutorial survey// SIGKDD Explorations, vol.1, issue 2. New York: ACM Press, 2000. p. 1−10
- Chakrabarti S. Mining The Web Discovering Knowledge From Hypertext Data. San Francisco: Morgan Kaufmann Publishers, 2004
- Chakrabarti S., Berg M., Dom B. Focused Crawling: A New
- Approach to Topic-Specific Web Resource Discovery// In Proc. ofththe 8 conference on WWW. New York: Elseiver North Holland, 1999. p. 1623−1640.
- Chen H., Dumais S. Bringing Order to the Web: Automatically Categorizing Search // Proceedings of ACM SIGCHI Conference on Human Factors in Computing Systems, Vol. 1. New York: ACM Press, 2000. p. 145−152
- Cohen W., Fast Effective Rule Induction// Proceeding of 20th International Conference on Machine Learning. Tahoe: Morgan Kaufmann Publishers, 1995. p. 115−123.
- Diao Y., Lu H., Wu D. A comparative study of classification-based personal E-Mail filtering// In proceedings of 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Kyoto: Springer Verlag, 2000. p.408−419.
- Douglis F., Ball T., Chen Y., Koutsofos E. The At&T Internet Difference Engine: Tracking and Viewing changes on the Web// World Wide Web #1: 1998. p. 27−44.
- Driori O., Aron N. Using documents classification for displaying search results list// Journal of Information Science, 29, vol. 2. London: Chartered Institute of Library and Information Professionals, 2003. p. 97−106.
- Dumais S., Cutrell E., Chen H. Optimizing search by showing results in contextII CHI '01: Proceedings of the SIGCHI conferenceon Human factors in computing systems. New York: ACM Press, 2001. p. 277−284.
- Dumais S., Platt J., Heckerman D., Sahami M. Inductive learning algorithms and representations for text categorization. // In Proceedings of International Conference on Information and Knowledge Management. New York: ACM Press, 1998. p. 148 155.
- Eichler K. Automatic Classification of Swedish Email Messages. Master thesis, Eberhard-Karls-University, Sweden, 2005. http://www.sfs.uni-tuebingen.de/iscl/Theses/eichler.pdf
- Filman R. Searching the Internet// IEEE Internet Computing, July 1998. p. 21−23
- Fisher R. The use of multiple measurements in taxonomic problems// Eugenics, 7: 1936. p. 179−188
- Frakes W., Baeza-Yates R. Information retrieval: Data structures and algorithms. Prentis Hall, Upper Saddle River, NJ, USA: 1992.
- Francopoulo G. Experiments with Chunker and Lucene// Advances in Cross-Language Information Retrieval, Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002. Heidelberg: Springer, 2002. p. 336−337.
- Furnkranz J. A study using n-gram features for text categorization// Technical report OEFAI-TR-98−30. Vienna: Austrian Institute for Artificial Intelligence, 1998. p. 1−10.
- Gulli A., Signorini A. The Indexable Web is more than 11.5 billion pages// Special interest tracks and posters of the 14th international conference on World Wide Web. New York: ACM Press, 2005. p. 902−903.
- Haykin S. Neural Networks: A Comprehensive Foundation. New York: Macmillan College Publishing, 1994
- Hersh W. OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research// Proceedings of the 17th Annual International Conference on Research and Development in Information Retrieval/ New York: Springer-Verlag, 1994. p. 192 201.
- Hofmann T. Probabilistic latent semantic indexing// In Proc. of the SIGIR'99. NY: ACM Press, 1999. p. 50−57. Ingwersen P. Information retrieval interaction. London: Taylor Graham Publishing: 1992.
- Joachims. A Probabilistic Analysis of the Rocchio algorithm with TFIDF for text categorization// Proc. of Int. Conf. on Machine Learning (ICML). San Francisco: Morgan Kaufmann Publishers, 1997. p. 143−151.
- Joachims T. Estimating the Generalization Performance of a SVM Efficiently. // Proceedings of the International Conference on Machine Learning. San Francisco: Morgan Kaufman, 2000. p. 431 438.
- Joachims T. Making large-scale SVM learning practical// Advances in kernel methods: Support vector learning. Cambridge: MIT-Press, 1999. p. 169−184.
- Joachims T. Text Categorization with Support Vector Machines: Learning with Many Relevant Features. // Proceedings of ECML-98, 10th European Conference on Machine Learning. Heidelberg: Springer, 1998. p. 137−142.
- Juan, A., Ney, H.: Reversing and Smoothing the Multinomial Naive Bayes Text Classifier. In: Proc. of the 2nd Int. Workshop on Pattern Recognition in Information Systems (PRIS 2002). Alacant (Spain): 2002. p. 200−212
- Kahle B. Preserving the Internet// Scientific American: March 1997. p. 82−83
- Khare R., Cutting D., Sitaker K., Rifkin A. Nutch: A Flexible and Scalable Open-Source Web Search Engine// CommerceNet Labs Technical Report #04−04. May 10, 2005. http://www.master.netseven.it/files/262-Nutch.pdf
- Koch T., Ardo A., Bremmer A., Lundberg S. The building and maintenance of robot based internet search services: A review of current indexing and data collection methods// Technical report, Lund University Library, Sweden, 1996
- Kobayashi M., Takeda K. Information retrieval on the Web// ACM Computing Surveys, vol.32, 2. New York: ACM Press, 2000. p. 144−173.
- Kolesnikov O., Lee W., Lipton R. Filtering spam using search engines//Georgia Tech Technical Report, GIT-CC-03−58, 2003. ftp://ftp.cc.gatceh.edu/pub/coc/tech reports/GIT-CC-03−58.pdf
- Kullback S., Leibler R. On Information and Sufficiency// The Annals of Mathematical Statistics, Vol. 22, No. 1 (Mar., 1951). p. 79−86
- Kwon J., Rao P., Moon B., Lee S. FiST: scalable XML document filtering by sequencing twig patterns// Proceedings of the 31st international conference on Very large data bases. New York: ACM Press, 2005. p. 217−228
- Lang K. Newsweeder: Learning to filter netnews// Proceedings oftfithe 12 International Conference on Machine Learning. San Mateo: Morgan Kaufmann, 1995. p. 331−339.
- Lawrence S., Giles C. Accessibility of information on the Web// Nature, 400 (July 8, 1999), p. 107−109
- Lawrence S, Giles C. Context and page analysis for improved web search// IEEE Internet computing, vol.2, issue 4, 1999. p. 38−46
- Lawrence S., Giles C. Inquirus, The {NECI} Meta Search Engine// Proc. Of 7th International World Wide Web Conference. Brisbane: Elsevier Science, 1998. p. 95−105.
- Lawrence S., Giles C. Searching the World Wide Web// Science, 280(5360), 1998. p. 98−100.
- Lewandowski D., Wahlig H., Gunnar M. The freshness of Web search engines' database// Journal of Information Science vol. 32, issue 2,2006. p. 131−146.
- Lewis D. An Evaluation of Phrasal and Clustered Representation on a Text Categorization Task// Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 1992. p. 37−50.
- Lewis D. Representation and learning in information retrieval. Dissertation, Dept. Of Computer and Information Science, Univ. of Massachusetts, 1992
- Lewis D. Reuters-21 578 text categorization test collection. Distribution 1.0. http://www.daviddlewis.com/resources/testcollections/reuters21578 /readme.txt
- Liao C., Alpha S., Dixon P. Feature preparation in text categorization// Proc. of Australian Data Mining Workshop. Sydney: University of Technology, 2003. p. 23−34
- Liu L. Query routing in large-scale digital library systems// In proc. th
- Of the 15 conference on Data Engineering. IEEE press, 1999. p. 154−163.
- Mackassy S. New Techniques in Intellegent Information Filtering. Ph.D. dissertation thesis, New Brunswick, New Jersey, 2003.
- Marchionini G. Information Seeking in Electronic Environments. Cambridge series on human-computer interactions, 9, Cambridge University Press, 1995.
- Marshall R. Generation of Boolean classification rules. // Proceedings of Computational Statistics 2000. Heidelberg: Springer-Verlag, 2000. p. 355- 360.
- Mladenic D. Turning Yahoo to Automatic Web-Page Classifier// Proceedings of the 13 European Conference on Artificial Intelligence (ECAI'98) Brighton, UK: ECCAI Press, 1998. p. 473 474
- Mostafa J. A multilevel approach to intelligent information filtering: model, system and evaluation// ACM transactions on information systems. New York: ACM Press, 1997. p. 368−399.
- Menczer F., Belew R. Adaptive retrieval agents: Internalizing local context and scaling up to the Web// Machine learning, vol. 39, issue 2−3. Boston: Kluwer academic publishers, 2000. p. 203−242.
- Nigam K., McCallum A., Thrun S., and Mitchell T. Learning to classify text from labeled and unlabeled documents// Proc. of the 15th National Conf. on Artificial Intelligence. Menlo Park: AAAI Press, 1998. p. 729−799.
- Ntoulas A., Cho J., Olston C. What’s new on the Web? Theevolution of the Web from a Search Engine perpective// Inthproceedings of the 13 International World Wide Web Conference. New York: ACM Press, 2004. p. 1−12.
- Oard D. The State of the Art in Text Filtering// User Modeling and User-Adapted Interaction, Volume 7, Issue 3. Hingham: Kluwer Academic Publishers, 1997. p. 147−178
- Osinski S., Weiss, D. Carrot2: Design of a flexible and efficient Web information retrieval framework// In Proceedings A WIC 2005. Heidelberhg: Springer, 2005. p. 439−444.
- Pazzani M., Nguyen L., Mantik S. Learning from hotlists and coldlists: Towards a WWW information filtering and seekingagentII 7th International Conference on Tools with Artificial Intelligence. Washington: IEEE Computer Society, 1995. p. 492.
- Provost J. Naive-Bayes vs. rule-learning in classification of Email// Technical Report AI-TR-99−284. Austin: The University of Texas, Department of Computer Sciences, 1999. p. 1−4.
- Porter M. (1980). An algorithm for suffix stripping// Program, 14(3), p. 130−137.
- Quinlan R. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993
- Rainie L. Search Engine use November 2005. PDF] http://www.pewinternet.org/pdfs/PIP SearchData 1105. pdf96. van Rijsbergen C. Information Retrieval. Butterworth’s and Co. — London, 1979 — 2nd edition.
- Sahami M., Dumais S., Heckerman D., Horvitz E. A Bayesian Approach to Filtering Junk E-Mail// In proceedings of the AAAI'98 Workshop on Learning for Text Categorization, p. 55−62.
- Salton G., Wong A., and Yang C. A vector space model for automatic indexingII Communications of the ACM, 18(11), 1988. p. 613−620.
- Salton G., Buckley C. Term-weighting approaches in automatic text retrieval// Information Processing and Management, 24(5), 1988. p. 513−523.
- Salton G., McGill M. Introduction to modern Information Retrieval. McGraw-Hill Computer Science Series. McGraw-Hill, New York, 1983
- Sarawagi S., Nagaralu S. Data mining models as services on the internet// ACM SIGKDD explorations newletter, vol. 2, issue 1. New York: ACM Press, 2000. p. 24−28
- Sebastiani F., Machine Learning in Automated Text Categorization//ACM Computing Surveys, vol.1,2002. p. 1−47 115
- Sebastiani F. Text Categorization// Text Mining and its Applications to Intelligence, CRM and Knowledge Management. Southampton: WIT Press, 2005. p. 109−129.
- Sheth B. A Learning Approach to Personalized Information Filtering// Master thesis, Department of Electrical Engineering and Computer Science, 1994. 75 p.
- Temperley D, Lafferty J., Sleator D. 1995. Link Grammar Parser. http://www.link.cs.cmu.edu/link
- Twidale M., Nichols D., Smith G., Trevor J. Supporting collaborative learning during information searching// In proc. of 1st international conf. on collaborative learning. Mahwah: Lawrence Erlbaum associates, 1995. p. 367−370.
- The Twelfth Text Retrieval Conference (TREC 2003). Appendix 1. Common Evaluation Measures. http://trec.nist.gov/pubs/trecl2/appendices/measures.ps
- Vapnik V. The Nature of Statistical Learning Theory. SpringerVerlag, New York, 1995.
- Voorhees E., Harman D. Overview of the seventh Text Retrieval Conference TREC 7// In Proceedings of the seventh Text Retrieval Conference TREC 7. Gaithersburg: NIST, 1998.
- Wasson M. Classification Technology at LexisNexis // SIGIR 2001 Workshop on Operational Text Classification.
- Wiener E., Pedersen, J., Weigend, A. A neural network approach to topic spotting// In Proceedings of SDAIR-95, 4th Annual Symposium on Document Analysis and Information Retrieval. Las Vegas, 1995. p. 317−332.
- Wong S., Yao Y. An information-theoretic measure of term specificity// Journal of the American Society for Information Science, 43(1): p. 45−61, 1992.
- Yan T., Garcia-Molina H. SIFT A tool for Wide-Area Information Dissemination// In proceedings of USENIX Technical conference. Berkley: USENIX association, 1995. p. 177−186.
- Yang Y., Liu X. A re-examination of text categorization methods// Proc. of International ACM Conf. on Research and Development in Information Retrieval (SIGIR-99). New York: ACM Press, 1999. p. 42−49.
- Yang Y., Pedersen J. A comparative study on feature selection in text categorization. // In: Proc. of ICML-97, 14th International Conf. On machine Learning. San Francisco: Morgan Kaufmann publishers, 1997. p. 412−420.
- Zipf G. Human behaviour and the principle of least effort. Addison Wesley, 1949.