Keyword-based search engines are in widespread use today as a popular means for Web-based information retrieval. Although such systems seem deceptivel...
The eXtensible Markup Language (XML) is fast emerging as the dominant standard for describing and interchanging data among various systems and databas...
Modern large retrieval environments tend to overwhelm their users by their large output. Since all documents are not of equal relevance to their users...
In this article we consider methods for automatic query expansion from top retrieved documents (i.e., retrieval feedback) that make use of various fun...
We present a new algorithm for duplicate document detection that uses collection statistics. We compare our approach with the state-of-the-art approac...