Page 36 Concepts and similar pages

Concepts

Similarity Concept
Classification definition
Diagnosis definition
Numerical taxonomy
Information retrieval definition
Operational information retrieval
Document clustering
Experimental information retrieval
Automatic document classification
Information retrieval system
Automatic classification

Similar pages

Similarity Page Snapshot
3 that in IR we are searching for relevant documents as opposed to exactly matching items ...Many automatic information retrieval systems are experimental ...Many of the techniques I shall discuss will not have proved themselves incontrovertibly superior to all other techniques,but they have promise and their promise will only be realised when they are understood ...My aim throughout has been to give a complete coverage of the more important ideas current in various special areas of information retrieval ...
185 approaching this problem of speeding up clustering is to look for what one might call almost classifications ...A big question,that has not yet received much attention,concerns the extent to which retrieval effectiveness is limited by the type of document description used ...Document classification is a special case of a more general process which would also attempt to exploit relationships between documents ...An argument parallel to the one in the last paragraph could be given for automatic keyword classification,which in the more general context might be called automatic content unit classification ...H ...
37 influence the choice of [classification]method and the results obtained ...There are two main areas of application of classification methods in IR:1 keyword clustering;2 document clustering ...The first area is very well dealt with in a recent book by Sparck Jones [5]...Good [6]:We define the organisation as the grouping together of items e ...The efficiency of document clustering has been emphasised by
8 The process may involve structuring the information in some appropriate way,such as classifying it ...Finally,we come to the output,which is usually a set of citations or document numbers ...IR in perspective This section is not meant to constitute an attempt at an exhaustive and complete account of the historical development of IR ...Since the emphasis in this book is on a particular approach to document representation,I shall restrict myself here to a few remarks about its history ...At this point,it may be convenient to elaborate on the use of keyword ...The use of statistical information about distributions of words in documents was further exploited by Maron and Kuhns [11]who obtained statistical associations between keywords ...
61 document clustering,search strategies,and such like to work inside a computer ...Bibliographic remarks In recent years a vast literature on automatic classification has been generated ...A book and a report on cluster analysis with a computational emphasis are Anderberg [59]...Two papers worth singling out are Sibson [65]...Much of the early work in document clustering was done on the SMART project ...There are a number of areas in IR where automatic classification is used which have not been touched on in this chapter ...One further interesting area of application of clustering techniques is in the clustering of citation graphs ...
62 documents can be stored as closely together as possible ...Finally,the reader may be interested in pursuing the use of cluster methods in pattern recognition since some of the ideas developed there are applicable to IR ...References 1 ...2 ...3 ...4 ...5 ...6 ...7 ...8 ...9 ...10 ...11 ...12 ...13 ...14 ...
195 DAMERAU,F ...DAMERAU,F ...DATE,C ...DATTOLA,R ...DATTOLA,R ...DE FINETTI,B ...DENNIS,S ...DISISS,Design of information systems in the social sciences ...DODD,G ...DOROFEYUK,A ...DOYLE,L ...DOYLE,L ...DOYLE,L ...DOYLE,L ...DOYLE,L ...DOYLE,L ...DUDA,R ...EDMUNDSON,H ...EFRON,B ...EIN DOR,P ...ELLIS,B ...ETZWEILER,L ...EVERITT,B ...FAIRTHORNE,R ...
9 Sparck Jones has carried on this work using measures of association between keywords based on their frequency of co occurrence that is,the frequency with which any two keywords occur together in the same document ...The term information structure for want of better words covers specifically a logical organisation of information,such as document representatives,for the purpose of information retrieval ...The organisation of these files is produced by an automatic classification method ...Evaluation of retrieval systems has proved extremely difficult ...
35 34 ...35 ...36 ...37 ...38 ...39 ...40 ...41 ...42 ...43 ...44 ...45 ...46 ...47 ...48 ...49 ...50 ...51 ...52 ...53 ...54 ...55 ...