OpenAIR OpenAIR
 
 

OpenAIR @ RGU >
Design and Technology >
Computing >
Book chapters (Computing) >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10059/351
This item has been viewed 8 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
Song WISE 2005 34.pdf237.71 kBAdobe PDFView/Open
Title: Document re-ranking by generality in bio-medical information retrieval.
Authors: Yan, Xin
Li, Xue
Song, Dawei
Editors: Ngu, Anne
Kitsuregawa, Masaru
Neuhold, Erich
Keywords: Document ranking
Information retrieval
Bio-medical
Issue Date: 2005
Publisher: Springer
Citation: YAN, X., LI, X. and SONG, D., 2005. Document re-ranking by generality in bio-medical information retrieval. In: A. NGU, M. KITSUREGAWA, E. NEUHOLD, J.-Y. CHUNG and Q. SHENG. Eds. Web Information Systems Engineering – WISE 2005: 6th International Conference on Web Information Systems Engineering, New York, NY, USA, November 20-22, 2005; Proceedings. Berlin: Springer. pp. 376-389.
Series/Report no.: Lectur Notes in Computer Science
3806
Abstract: Document ranking is well known to be a crucial process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. Traditional document rank- ing methods are based on different measurements of similarity between documents and query. Due to information explosion and the popularity of WWW information retrieval, the increased variety of information and users makes it insu±cient to consider similarity alone in the ranking pro- cess. In some cases, there is a need for user to retrieve documents which are generally or broadly describing a certain topic. This is particularly the case in some specific domains such as bio-medical IR. To satisfy the stringent requirement of generality based retrieval, we propose a novel ap- proach to re-rank the retrieved documents by considering their generality as a compliment. By analyzing the semantic cohesion of text, document generality can be quantified. The retrieved documents are then re-ranked by their combined scores of similarity and the closeness of documents' generality to the query's. Results show an encouraging performance on a large scale bio-medical text corpus, OHSUMED, which is a subset of MEDLINE collection containing 348,566 medical journal references and 101 test queries.
ISBN: 9783540300175
Appears in Collections:Book chapters (Computing)

All items in OpenAIR are protected by copyright, with all rights reserved.

 

 
   Disclaimer | Freedom of Information | Privacy Statement |Copyright ©2012 Robert Gordon University, Schoolhill, Aberdeen, AB10 1FR, Scotland, UK: a Scottish charity, registration No. SCO13781