Page 97 Concepts and similar pages

Concepts

Similarity Concept
Matching function
Simple matching coefficient
Coordination level
Document clustering
Document representative
Association Measures
Term clustering
Measures of association
Term
E measure

Similar pages

Similarity Page Snapshot
39 There are five commonly used measures of association in information retrieval ...The simplest of all association measures is X [[intersection]]Y Simple matching coefficient which is the number of shared index terms ...These may all be considered to be normalised versions of the simple matching coefficient ...then X 1 1 Y 1 1 X 1 [[intersection]]Y 2 1 >S 1 1 S 2 1 X 2 10 Y 2 10 X 2 [[intersection]]Y 2 1 >S 1 1 S 2 1 10 S 1 X 1,Y 1 S 1 X 2,Y 2 which is clearly absurd since X 1 and Y 1 are identical representatives whereas X 2 and Y 2 are radically different ...Doyle [17]hinted at the importance of normalisation in an amusing way:One would regard the postulate All documents are created equal as being a reasonable foundation for a library description ...