| 39 |
There are five commonly used measures of association in information retrieval
...The simplest of all association measures is X [[intersection]]Y Simple matching coefficient which is the number of shared index terms
...These may all be considered to be normalised versions of the simple matching coefficient
...then X 1 1 Y 1 1 X 1 [[intersection]]Y 2 1 >S 1 1 S 2 1 X 2 10 Y 2 10 X 2 [[intersection]]Y 2 1 >S 1 1 S 2 1 10 S 1 X 1,Y 1 S 1 X 2,Y 2 which is clearly absurd since X 1 and Y 1 are identical representatives whereas X 2 and Y 2 are radically different
...Doyle [17]hinted at the importance of normalisation in an amusing way:One would regard the postulate All documents are created equal as being a reasonable foundation for a library description
... |