Concepts
Similar pages
Similarity |
Page |
Snapshot |
| 46 |
relevant relevant R R and relevant non relevant R N R associations of a collection
...From these it is apparent:a that the separation for collection X is good while for Y it is poor;and b that the strength of the association between relevant documents is greater for X than for Y
...Figure 3
...It is this separation between the distributions that one attempts to exploit in document clustering
...I should add that these conclusions can only be verified,finally,by experimental work on a large number of collections
... |
| 160 |
system so that if we were to adopt [[Delta]]as a measure of effectiveness we could be throwing away vital information needed to make an extrapolation to the performance of other systems
...The Cooper model expected search length In 1968,Cooper [20]stated:The primary function of a retrieval system is conceived to be that of saving its users to as great an extent as is possible,the labour of perusing and discarding irrelevant documents,in their search for relevant ones
...a only one relevant document is wanted;b some arbitrary number n is wanted;c all relevant documents are wanted;4 a given proportion of the relevant documents is wanted,etc
...Thus,the index is a measure of performance for a query of given type
...The output of a search strategy is assumed to be a weak ordering of documents
... |
| 47 |
that a search strategy will infallibly find the class of documents containing the relevant documents
...Note that the Cluster Hypothesis refers to given document descriptions
...As can be seen from the above,the Cluster Hypothesis is a convenient way of expressing the aim of such operations as document clustering
...The use of clustering in information retrieval There are a number of discussions in print now which cover the use of clustering in IR
...In choosing a cluster method for use in experimental IR,two,often conflicting,criteria have frequently been used
...1 the method produces a clustering which is unlikely to be altered drastically when further objects are incorporated,i
...2 the method is stable in the sense that small errors in the description of the objects lead to small changes in the clustering;3 the method is independent of the initial ordering of the objects
...These conditions have been adapted from Jardine and Sibson [2]... |
|
|