Page 185 Concepts and similar pages

Concepts

Similarity Concept
Nearly decomposable system
Automatic document classification
Document clustering
Retrieval effectiveness
Automatic classification
Document representative
Generality
Data retrieval systems
Automatic keyword clustering
Classification methods

Similar pages

Similarity Page Snapshot
186 behaviour of any one of the components depends in only an aggregate way on the behaviour of the other components ...2 ...On the file structure chosen and the way it is used depends the efficiency of an information retrieval system ...Inverted files have been rather popular in IR systems ...There are many more problems in this area which are of interest to IR systems ...3 ...So far fairly simple search strategies have been tried ...
184 Eight THE FUTURE Future research In the preceding chapters I have tried to bring together some of the more elaborate tools that are used during the design of an experimental information retrieval system ...1 ...Substantial evidence that large document collections can be handled successfully by means of automatic classification will encourage new work into ways of structuring such collections ...It is therefore of some importance that using the kind of data already in existence,that is using document descriptions in terms of keywords,we establish that document clustering on large document collections can be both effective and efficient ...
8 The process may involve structuring the information in some appropriate way,such as classifying it ...Finally,we come to the output,which is usually a set of citations or document numbers ...IR in perspective This section is not meant to constitute an attempt at an exhaustive and complete account of the historical development of IR ...Since the emphasis in this book is on a particular approach to document representation,I shall restrict myself here to a few remarks about its history ...At this point,it may be convenient to elaborate on the use of keyword ...The use of statistical information about distributions of words in documents was further exploited by Maron and Kuhns [11]who obtained statistical associations between keywords ...