Similar concepts
Pages with this concept
Similarity |
Page |
Snapshot |
| 29 |
subsets differing in the extent to which they are about a word w then the distribution of w can be described by a mixture of two Poisson distributions
...here p 1 is the probability of a random document belonging to one of the subsets and x 1 and x 2 are the mean occurrences in the two classes
...Although Harter [31]uses function in his wording of this assumption,I think measure would have been more appropriate
...assumption 1 we can calculate the probability of relevance for any document from one of these classes
...that is used to make the decision whether to assign an index term w that occurs k times in a document
...Finally,although tests have shown that this model assigns sensible index terms,it has not been tested from the point of view of its effectiveness in retrieval
...Discrimination and or representation There are two conflicting ways of looking at the problem of characterising documents for retrieval
... |
| 2 |
Table 1
...Data Retrieval DR Information Retrieval IR Matching Exact match Partial match,best match Inference Deduction Induction Model Deterministic Probabilistic Classification Monothetic Polythetic Query language Artificial Natural Query specification Complete Incomplete Items wanted Matching Relevant Error response Sensitive Insensitive between the two is a vague one
...Let us now take each item in the table in turn and look at it more closely
...The inference used in data retrieval is of the simple deductive kind,that is,a R b and b R c then a R c
...Another distinction can be made in terms of classifications that are likely to be useful
...The query language for DR will generally be of the artificial kind,one with restricted syntax and vocabulary,in IR we prefer to use natural language although there are some notable exceptions
... |
|
|