Решение задачи тематического информационного поиска в рунет
Диссертация
Описанные свойства задачи делают невозможным прямое применение существующих наработок в области традиционного ИП для решения задачи тематического ИП в Интернет. В результате, возникает необходимость в разработке новых специализированных методов тематического информационного поиска, в Интернет, учитывающих специфику задачи и обеспечивающих большую эффективность поиска по сравнению с существующими… Читать ещё >
Список литературы
- Ахо А., Ульман Д., Теория синтаксического анализа, перевода и компиляции, М., Мир, 1978.
- Крюков Д.В. Поисковая система «Turtle». Физиология и анатомия. http://www.turtle.ru/db/architecture/
- Мальковский М. Г., Грацианова Т. Ю., Полякова И. Н., Прикладное программное обеспечение: системы автоматической обработки текстов, Учебное пособие для студентов факультета ВМиК МГУ, Москва, МГУ, 2000
- Мидоу Ч., Анализ информационно-поисковых систем, М., Мир, 1970
- Некрестьянов И. Тематико-ориентированные методы информационного поиска. Диссертация. Санкт-Петербург, 2000
- Райдингс К. Растолкованный PageRank. http://digits.ru/articles/promotion/pagerank.html. (пер. Садовский А.)
- Россеева О., Загорулько Ю. Организация эффективного поиска на основе онтологий. Труды Международного семинара Диалог'2001.
- Солтон Дж., Динамические библиотечно-информационные системы, М., Мир, 1979.
- Сэлтон Г., Автоматическая обработка, хранение и поиск информации, М., «Советское радио», 1973
- Черный А. И., Введение в теорию информационного поиска, М., Наука, 1975
- Хензингер М. Анализ гиперссылок в Web. Открытые системы, 2001, N10, http://www.osp.ru/2001/10/050.htm.
- Bates М., The design of browsing and berrypicking techniques for the online search interface. Online Review 13,5,1989
- Belkin N., Cool C., Stein A., Thiel U. Cases, Scripts, and Information-Seeking Strategies: On the Desingn of Interactive Information Retrieval Systems, Expert Systems and Applications, 9(3): 379−395 1994
- Bergman K., The Deep Web: Surfacing Hidden Value, BrightPlanet.com LLC, http://www.completeplanet.com/Tutorials/DeepWeb/index.asp
- Brahat K. SearchPad: explicit Capture of Search Context to Support Web Search, Proceedings of the WWW9 Conference.
- Davison В. Topical locality in the Web. Proceedings of the ACM SIGIR'2000 Conference, 2000.
- Etzioni 0., The World Wide Web: quagmire or gold mine?, Communications of the ACM, November 1996.
- O’Day V., Jeffries R. Orienteering in an Information Landscape: How Information Seekers Get From here to There. Proceedings of INTERCHI '93, 1993.
- Seberg E., Etzioni O., The MetaCrawler Architecture for Resource Aggregation on the Web, http://www.cs.washington.edu/research/metacrawler. 1998
- Inktomi Corp., Web Surpasses One Billion Documents, press release issued January 18, 2000, http://www.inktomi.com/new/press/billion.html
- Mladenic D. Turning Yahoo Into an Automatic Web-Page Classifier. European Conference on Artificial Intelligence, 1998
- Gruber T. Towards Principles for the Design of Ontologies Used for Knowledge Sharing. International Workshop on Formal Ontology, 1993.
- Koenemann J. Supporting Interactive Information Retrieval Through Relevance Feedback, Proceedings of ACM CHI'96 Conference.
- Pirolli P., Pitkow J., Rao R., Silk from a Sow’s Ear: extracting Usable Structures from the Web, Proc. ACM Conf. Human Factors in Computing Systems, CHI'96, 1996
- Flake G., Lawrence S., Giles C., Coetzee F. Self-Organization of the Web and Identification of Communities. IEEE Computer, 35(3), pp 66−71,2002.
- Lawrence S., Bollacker K., Giles C., Digital Libraries and Autonomous Citation Indexing, IEEE Computer, pp. 67−71, June 1999
- Lawrence S., Bollacker K., Giles C., Indexing and Retrieval of Scientific Literature, Proceedings of CIKM 1999 Conference, pp. 139−146
- Lawrence S., Giles C., Accessibility of Information on the Web, Nature, vol.400, pp. 107−109,1999
- Lawrence S., Giles C., Context and page analysis for improved web search, IEEE Internet Computing, July 1998, pp.3 8−46
- Lawrence S., Giles C., Inquirus: The NECI Search Software, http://www.neci .nj .nec.com/homepages/lawrence/inquirus.html
- Lawrence S., Giles C., Searching the Web: General and Scientific Information Access, IEEE Communications Magazine, January 1999, pp 116−122
- Bollacker К., Lawrence S., Giles C., CiteSeer: An Autonomous {Web} Agent for Automatic Retrieval and Identification of Interesting Publications, Proceedings of the Second International Conference on Autonomous Agents, ACM Press, 1998, pp. 116— 123
- Glover E., Lawrence S., Birmingham W., Giles C., Architecture of a Metasearch Engine that Supports User Information needs, Proceedings of CIKM-99 Conference, pp. 210 216, ACM, 1999
- Research Index Computer Science Directory http://citeseer.ni.nec.com/directory.html
- Rijsbergen C., Information Retrieval, London, Butterworths, 1979
- Salton G., Historical Note: The Past Thirty Years in Information Retrieval, Cornell University Technical Report 87−827
- Salton G., Mathematics and information retrieval Cornell University Technical Report 78−332
- Salton G., Buckley C., Term Weighting Approaches in Automatic Text Retrieval, Cornell University Technical Report 87−881
- Salton G., Fox E., Wu H., Extended Boolean Information Retrieval, Cornell University Technical Report 82−511
- Salton G., Buckley C. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science 41:288—297, 1990
- Brin S., Page L. The anatomy of a large-scale hypertextual Web search engine. Proceedings of the WWW7 Conference, 1998.
- Brin S., Page L., Motwani R., Winograd T. The PageRank citation ranking: Bringing order to the Web. Stanford Digital Library Technologies, Working Paper 1999−0120, 1998.
- Bharat K., Henzinger M. Improved Algorithms for Topic Distillation in a Hyperlinked Environment. ACM SIGIR conference on Research and Development in IR, 1998.
- Henzinger M. Link Analysis in Web Information Retrieval. IEEE Bulletin on Data Engineering, Vol. 23, N3,2000.
- Chakrabarti S., Dom В., Kumar S., Raghavan P., Tomkins A., Gibson D., Kleinberg J. Mining the Web’s link structure. IEEE Computer, 32(8) pp 60−67,1999
- Chakrabarti S., В. Dom, D. Gibson, J. Kleinberg, P. Raghavan, S. Rajagopalan, Automatic resource list compilation by analyzing hyperlink structure and associated text. Proc. 7th International World Wide Web Conference, 1998.
- Chakrabarti S. Integrating the Document Object Model with Hyperlinks for Enhanced Topic Distillation and Information Extraction. Proceedings of WWW 10 Conference, 2001
- Chakrabarti S., Van Den Berg M., Dom B. Focused crawling: A new approach to topic-specific Web resource discovery. Eights World Wide Web Conference, Toronto, May 1999.
- Kleinberg J. Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998.
- Gibson D., Kleinberg J., Raghavan P. Inferring Web communities from link topology. Proc. 9th ACM Conference on Hypertext and Hypermedia, 1998.
- Kleinberg J., Kumar S., Raghavan P., Rajagopalan S., Tomkins A. The Web as a graph: Measurements, models and methods. Invited survey at the International Conference on Combinatorics and Computing, 1999.
- Kumar S., Raghavan P., Rajagopalan S., Tomkins A. Trawling the Web for emerging cyber-communities. Eighth World Wide Web Conference, Toronto, Canada, May 1999.
- Terveen L., Hill W., Amento B. Constructing, Organizing, and Visualizing Collections of Topically Related Web Resourses. ACM Transactions on Computer-Human Interaction, Vol 6, No 1, March 1999, Pages 67−94
- Amento В., Terveen L., Hill W. Does «Authority» Mean Quality? Predicting Expert Quality Ratings of Web Documents. ACM SIGIR conference on Research and Development in IR, 2000.
- Lempel R., Moran S. The Stochastic Approach for Link-Structure analysis (SALSA) and the TKS Effect. Ninth World Wide Web Conference, 2000
- Borodin A., Roberts G., Rosenthal J., Tsaparas P. Finding Authorities and Hubs From Link Structures on the World Wide Web. Tenth World Wide Web Conference, Hong Kong, 2001
- Shivakumar N., Garcia-Molina H. Finding Near-Replies of documents on the Web. Porceedings inWebDB'99,1999.
- Broder A. Z., Kumar S. R., Maghoul F., Raghavan P., Rajagopalan S., Stata R., Tomkins A., Wiener J. L. Graph structure in the web. In Proc. of the WWW9, pp. 309−320,2000.
- Hermans В., Intelligent Software Agents on the Internet: an inventory of currently offered functionality in the information society and a prediction of future developments, http://www.hermans.org/agents, 1996
- Cavnar W., Trenkle J., N-gram-based text categorization, in Proceedings of SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval, pp.161−175, 1994.
- TREC-2002 Web Track Guidelines, TREC, 2002.
- Сегалович И.В. Как работают поисковые системы, www.yandex.ru/papers/
- Open Directory Project http://www.dmoz.org проект по созданию некоммерческого каталога.64. Google www.googIe.com65. Yandex www.vandex.ru
- Стеммер Snowball http://snowball.tartarus.org
- Davison В., Recognizing nepotistic links on the Web. AAAI-2000 Workshop on Artificial Intelligence for Web Seasrch, 2000.
- Берж К. Теория графов и ее применения. — М.: Изд-во иностр. лит., 1962
- СУБД Informix http://www.informix.com
- Проект Jakarta, http://iakarta.apache.org