Методы группировки и структуризации поисковых запросов и их реализация
Диссертация
Также большое внимание уделяется методам, которые позволяют преобразовывать неструктурированный запрос пользователя1 с «ключевыми словами» (keyword queries) в структурированный. Основная причина популярности подобных методов заключается в том, что большая часть интернет-данных изначально содержатся в структурированных базах данных. И знание структуры запроса значительно облегчает поиск… Читать ещё >
Список литературы
- И. Некрестьянов, М. Некрестьянова, А. Нозик. К вопросу об эффективности метода «общего котла». Труды 7-ой Всероссийской научной конференции «Электронные библиотеки: перспективные методы и технологии, электронные коллекции «RCDL'2005. — 2005.
- М. Агеев, И. Кураленок, И. Некрестьянов. Официальные метрики. Труды Российского семинара по Оценке Информационного поиска РОМИП. 2007.- Приложение А.
- Некрестьянов И. С. Кураленок И.Е. Оценка систем текстового поиска. Программирование.-№ 28.- 2002, — С. 226−242.
- Некрестьянов И.С. Тематико-ориентированные методы информационного поиска. Рукопись. — 2000.
- Юлия Киселева. Автоматическое сегментирование запросов интернет-магазинов. Программные продукты и системы. № 3 (91). — 2010.
- Юлия Киселева. Группировка пользователей интернета, основанная на истории их веб-сессий. Труды 10-ой Всероссийской научной конференции «Электронные библиотеки: перспективные методы и технологии, электронные коллекции «RCDL'2008. 2008.- С.405−407.
- Agichtein Е., Ganti V. Mining Reference tables for automatic Text Segmentation. In processing of the Eleventh ACMSIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2004). 2004. — Pp. 20−29.
- Agirre, Eneko and David Martinez. «Integrating selectional preferences in WordNet.» In: roceedings of the first International WordNet Conference, Mysore, India, 21−25 Januaiy 2002
- Andrea Esuli, Fabrizio Sebastiani. PageRanking WordNet Synsets: An Application to Opinion Mining. In processing of 45th Annual Meeting of the Association of Computation Linguistic (ACL '2007) 2007.
- Arasu, Garcis-Molina H. Extraction information from webpages. In processing of the ACM SIGMOD International Conference on Management Data. -2003.
- B. Shipley. Cause and Correlation in Biology: A User’s Guide to Path Analysis, Structural Equations and Causal Inference. Cambridge. — 2000.
- B.Mobasher, H. Dai, T. Luo, and M.Nakagawa. Effective personalization based on association rule discovery from the web usage data. In Processing of 3rd International Workshop on Web Information and Data Management (WIDM'2001). 2001.
- Baeza-Yates R., Hurtado C., Mendoza M. Query recommendation using query logs in search engines. In processing of Current Trends in Database Technology (EDBT'2004). Springer-Verlag GmbH. — 2004. — Pp. 588−596.
- Barr C., Jones R. and Regelson M. The linguistic structure of English web-search queries. In Processing of Empirical Methods in Natural Language Processing (ENLP '2008). — 2008. Pp. 1021−1030.
- Bernard J. Jansen, Danielle L. Booth, Amanda Spink. Determining the informational, navigational and transactional intent of Web queries. In Processing Information Processing and Management. — 44. 2008. — Pp. 1251−1266.
- Broder, A. A taxonomy of Web search. In Processing of SIGIR Forum. -36(2).-2002.-Pp. 3−10.
- C. Buchwalter, M. Ryan, and D. Martin. The state of online advertising: data covering the fourth Quarter. In Processing of TR Adrelevance. 2001.
- C.W. Cleverdon. The Cranfield tests on index language devices. In Aslib Proceedings. Volume 19. 1967. — Pp. 173−192. (Reprinted in Readings in Information Retrieval, K. Sparck-Jones and P. Willett, editors, 1997)
- Cansius, S., Spoleder, C.: Bootstraping information extraction from the field books. In Processing of Conference on Empirical Methods in Natural Language Processing (EMNLP'2007). 2007. Pp. 827−836.
- Christopher D. Mining, Prabhakar Raghavan, Hinrich Schutze. An introduction to Informational Retrieval. Cambridge University Press Cambridge -England.- 2009
- Crenager T., Klein D. and Manning C.: Unsupervised learning of field segmentation models for information extraction. In Processing of the Meeting of the ACL. 2005. — pp. 371−378.
- D. Edwards. Introduction to Graphical Modelling. 2nd ed. — SpringerVerlag. — 2000.
- Data mining for hypertext: A tutorial survey. SIGKDD Explorations, 1(2). -2000. Pp. 1−11.
- E. M. Voorhees. The philosophy of Information Retrieval Evaluation. Revised Papers from the Second Workshop of the Cross-Language Evaluation
- Forum on Evaluation of Cross-Language Information Retrieval Systems. 2001.-Pp. 355−370.
- E. Voorhees. TREC 2007 Introduction (slides). Gaithersburg, Maryland, USA. — 2007
- G.Salton and M.J. McGill. Introduction to the modern Informational Retrieval. McGraw-Hill Computer Science Series. — McGraw-Hill, New-York. — 1983.
- Grandvalt, Y., Bengio, Y.: Semi-supervised Learning by Entropy Minimization. In processing of CA?. 2005. — Pp. 281−296.
- Grcar M. User Profiling: Web Usage Mining. In Proceedings of the 7th International Multiconference Information Society IS 2004. October 9−15. -Ljubljana, Slovenia. 2004, — Pp. 79−82
- Gui-Rong Xue, Hua-Jun Zeng, Zheng Chen, Yong Yu, Wei-Ying Ma, WenSi Xi, WeiGuo Fan. Optimizing web search using web-click through data. In processing of 13th ACM Conference on Information and Knowledge Management (CIKM'2004). 2004. — Pp. 118−126.
- Hammersley J., Clifford, P. Markov fields on finite graphs and lattices. Unpublished manuscript. 1971
- I. Soboroff. On evaluating Web Search With Very Few Relevant Documents. In Processing of the Annual International ACM SIGIR conference on Research and Development in Informational Retrieval (, SIGIR'04').- 2004. -Pp. 530−531.
- J. Borges and M. Levene. Detecting Concept Drift in Web Usage Mining. In Proceeding of the Workshop on Web Mining and Web Usage Analysis. -2008.-Pp. 98−110.
- J. Lafferty, A. McCallum and F.Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequences data. In Processing of the International Conference of Machine Learning. Williamstown, MA, USA. — 2001.-Pp. 282−289.
- J. Pearl. Causality: Models, Reasoning and Inference. Cambridge Univ. Press. — 2000.
- J. Zobel. How reliable are the Results of Large-Scale Information Retrieval Experiment?. In Processing of the 21st Annual ACM SIGIR Conference on Research and Development in Informational Retrieval (SIGIR'98). 1998. -Pp. 307−314.
- Jiao F., Wang S., Lee C.-H., Greiner R., Schuumians D. Semi-Supervised Conditional Random Fields for Improved Sequence Segmentation and Labeling. In processing of 47th Annual Meeting of the Association of Computation Linguistic (ACL'2007). 2009
- Johannes L., Gareth J., Jones F. Queiy recovexy of short user queries: on query expansion with stopwords. In Processing of the 33rd Annual ACM SI95
- GIR Conference on Research and Development in Informational Retrieval (SIGIR 2010). 2010 — Pp. 733−734.
- Joseph K. Bradley, Carlos Guestrin. Learning Tree Conditional Random Fields. In Processing of the 27th International Conference on Machine Learning (ICML 2010). 2010. Pp. 127−134.
- Julia Kiseleva, Eugene Agichtein, Qi Guo, Daniel Billsus, Wei Chai. Unsupervised Query Segmentation Using Click Data: Preliminary Result. In Processing of 19th International World Wide Web Conference (WWW2010). -2010.- Pp. 1131−1132.
- Julia Kiseleva. Grouping Web Users based on Query Log. In processing of 12th East European Conference Advances in Databases and Information Systems (ADBIS'2008). -2008. Pp. 184−190.
- Kevin P.Murphy. An introduction to graphical models. MIT Press. 2001.
- Li X., Wang Y.-Y., Acero A. Learning query intent from regularized click graph. In Processing of the 31st Annual ACM SIGIR Conference on Research and Development in Informational Retrieval. — 2008. Pp. 339−346.
- M. I. Jordan, editor. Learning in Graphical Models. MIT Press. -1999.
- Mikhail Kalinkin, Julia Kiseleva, Nikolay Vyahhi. Bernhard Lang. Comparison of Machine Learning Techniques for Document Ranking Problem. In processing of Workshop Distributed Intelligent Systems and Technologies proceedings. 2009. — Pp. 85−92.
- Olfa Nasraoui, Myra Spiliopoulou, Jaideep Srivastava, Bamshad Mobash-er, Brij M. Masand. Advances in Web Mining and Web Usage Analysis. In Processing of 8th International Workshop on Knowledge Discovery on the Web (WebKDD). 2006.
- P. Spirtes, C. Glymour, and R. Schemes. Causation, Prediction and Search. MIT Press. 2nd edition. — 2000.
- Pinto D., McCallum A., Wei X., Croft W.B. Table extraction using conditional random fields. In Processing of the 26th Annual ACM SIGIR Conference on Research and Development in Informational Retrieval. 2003. Pp. 235−241,
- Q. Yang, H.H. Zhang, and T. Li. Mining web logs for prediction models in www caching and prefetching. In Processing of International Conference on Computer Networks and Mobile Computing (ICCNMC'01). 2001.
- R. Baeza-Yates, B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley, Wokingham, UK 1999.
- Robins D. Interactive Information Retrieval. Context and Basic Notions. Informing Science, 3(2). -2000. Pp. 57−62.
- S. Russell and P. Norvig. Artificial Intelligence: A Modern Approach -Prentice Hall, Englewood, NJ. 1995.
- Sanda M. Harabagiu. An Application of WordNet to Prepositional Attachment. In Processing of 34th Annual Meeting of the Association of Computation Linguistic (ACL '1996). -1996. Pp. 360−362.
- Shen D., Li Y., Li X. Dengyong Zhou. Product query classification. In Processing o/18th ACM Conference on Information and Knowledge Management (CIKM 2009). 2009. Pp. 741−750.
- T.Li, Q. Yang, K.Wang. Classification pruning for web-request prediction. In processing of 10th International World Wide Web Conference (WWW'2001). 2001.
- X. Yu and H. Shi. Query Segmentation Using Conditional Random Fields. In Processing of The first International Workshop on Keyword Search on structured data. Providence, Rhode Island, USA. — 2009. Pp. 21−26
- Yanhong Zhai, Bing Liu. Extracting Web Data Using Instance-Based Learning. In Processing of International World Wide Web Conference (WWW'2007). 2007. Pp. 113−132.
- Yanhong Zhai, Bing Liu. Web data extraction based on partial tree alignment. In Processing of World Wide Web Conference (WWW'2005). 2005. Pp. 76−85.
- Yongge Shi, Yiqun Zhou. An Improved Apriori Algorithm. In Processing of Gordon Research Conference (GRC' 2010). 2010. Pp. 759 762.
- Yves Grandvalet, Yoshua Bengio. Semi-supervised Learning by Entropy Minimization. In Processing of CAP. 2005. Pp. 281−296.
- Zhao C., Mahmud J. and Ramakrishna I. Exploiting structured reference data for unsupervised text segmentation with conditional random fields. In Processing of the SI AM International Conference on Data Mining. 2008.
- Zhu J., Zhang B., Nie Z., Wen J.-R. and Hon H.-W. Webpage understanding: and integrated approach. In Processing of the Thirteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2007. Pp. 903−912.