Categorizing queries by topic directory
|Categorizing queries by topic directory|
|Author(s)||He M., Cutler M., Wu K.|
|Published in||Proceedings - The 9th International Conference on Web-Age Information Management, WAIM 2008|
|Keyword(s)||Unknown (Extra: Administrative data processing, Conformal mapping, Information science, Ketones, Management information systems, World Wide Web, Deep web, Fact-finding, International conferences, Mapping systems, Precision and recall, Question-answering systems, Topic directory, User queries, Web resources, Web sources, Web users, Wikipedia, Information management)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Categorizing queries by topic directory is a 2008 conference paper written in English by He M., Cutler M., Wu K. and published in Proceedings - The 9th International Conference on Web-Age Information Management, WAIM 2008.
The categorization of a web user query by topic or category can be used to select useful web sources that contain the required information. In pursuit of this goal, we explore methods for mapping user queries to category hierarchies under which deep web resources are also assumed to be classified. Our sources for these category hierarchies, or directories, are Yahoo! Directory and Wikipedia. Forwarding an unrefined query (in our case a typical fact finding query sent to a question answering system) directly to these directory resources usually returns no directories or incorrect ones. Instead, we develop techniques to generate more specific directory finding queries from an unrefined query and use these to retrieve better directories. Despite these engineered queries, our two resources often return multiple directories that include many incorrect results, i.e., directories whose categories are not related to the query, and thus web resources for these categories are unlikely to contain the required information. We develop methods for selecting the most useful ones. We consider a directory to be useful if web sources for any of its narrow categories are likely to contain the searched for information. We evaluate our mapping system on a set of 250 TREC questions and obtain precision and recall in the 0.8 to 1.0 range.
- This section requires expansion. Please, help!
Probably, this publication is cited by others, but there are no articles available for them in WikiPapers. Cited 1 time(s)