next up previous
Next: Semantic networks Up: On the Equivalence of Previous: Definitions

Retrieval models

A retrieval model tex2html_wrap_inline1227 is embedded in a universe of objects. An object is a 2-tuple tex2html_wrap_inline1229 , where tex2html_wrap_inline1231 is the object's unique id, and tex2html_wrap_inline1233 is the object's unique representation. The exact definition of representation depends on specific retrieval tasks (Kulyukin 1999a; Kulyukin 1999b). The universe in which M is embedded is the set of all objects, and is denoted by tex2html_wrap_inline1237 . The finite set of objects retrievable by M is denoted by tex2html_wrap_inline1241 . The set tex2html_wrap_inline1243 contains the ids of objects in tex2html_wrap_inline1245 . Since there is a bijection between tex2html_wrap_inline1245 and tex2html_wrap_inline1249 , when the context permits, objects are referred to by their ids. M's primitives are called tokens. The precise definition of a token depends on the context (Kulyukin 1998a; Kulyukin 1998b). Tokens can be keywords, keyword collocations, nodes in a semantic network, etc. The set of all possible tokens is denoted by tex2html_wrap_inline1253 .

If tex2html_wrap_inline1255 is M's set of representations, M's representation function is tex2html_wrap_inline1261 . If tex2html_wrap_inline1263 , tex2html_wrap_inline1265 . The token weight function tex2html_wrap_inline1267 assigns weights to tokens in objects. The object similarity function tex2html_wrap_inline1269 computes the similarity between two objects in tex2html_wrap_inline1237 . The rank function tex2html_wrap_inline1273 imposes an ordering on tex2html_wrap_inline1245 's objects. The rank of tex2html_wrap_inline1277 with respect to tex2html_wrap_inline1279 is denoted by tex2html_wrap_inline1281 . If tex2html_wrap_inline1283 , then tex2html_wrap_inline1285 , and tex2html_wrap_inline1287 . Thus, the ranking of objects is determined by tex2html_wrap_inline1289 and their initial ordering in tex2html_wrap_inline1245 .

A retrieval sequence returned by M in response to tex2html_wrap_inline1279 is denoted by tex2html_wrap_inline1297 , and is a permutation tex2html_wrap_inline1299 of the ids of objects in tex2html_wrap_inline1245 such that tex2html_wrap_inline1303 . Let tex2html_wrap_inline1305 and tex2html_wrap_inline1307 . tex2html_wrap_inline1309 and tex2html_wrap_inline1311 are equivalent under ranked retrieval ( tex2html_wrap_inline1313 ) iff tex2html_wrap_inline1315 = tex2html_wrap_inline1319 , tex2html_wrap_inline1321 , tex2html_wrap_inline1323 , ... , tex2html_wrap_inline1325 , tex2html_wrap_inline1327 tex2html_wrap_inline1329 , tex2html_wrap_inline1331 , tex2html_wrap_inline1333 = tex2html_wrap_inline1319 , tex2html_wrap_inline1339 tex2html_wrap_inline1341 , tex2html_wrap_inline1323 , ... , tex2html_wrap_inline1325 , tex2html_wrap_inline1339 tex2html_wrap_inline1329 , tex2html_wrap_inline1331 , and tex2html_wrap_inline1353 .





Vladimir Kulyukin
Fri Oct 29 09:32:57 CDT 1999