Document (#5449)

Author
Soergel, D.
Title
Mathematical analysis of documentation systems : an attempt to a theory of classification and search request formulation
Source
Information storage and retrieval. 3(1967), S.129-173
Year
1967
Abstract
As an attempt to make a general structural theory of information retrieval, a documentation system (DS) is defined as a formal system consisting of (a) a set o of objects (documents); (b) a set A++ of elementary attributes (key-words), from which further attributes may be constructed: A++ generates A; (c) a set of axioms of the form X++(x)=m (m¯M, M a set of constant connecting attributes with objects: from the axioms further theorems (=true statements) may be constructed. By use of the theorems, different mappings O -> P(o) (P(o) set of all subsets of o) (search question -> set of documents retrieved) are defined. The type of a DS depends on two basic decisions: (1) choice of the rules for the construction of attributes and theorems, e.g., logical product in coordinate indexing; links. (2) choice of M; M may consist of the two constants 'applicable' and 'not applicable', or some positive integers, ...; Further practical decisions: A++ hierarchical or not; kind of mapping; introduction of roles (=further attributes). The most simple case - ordinary two-valued Coordinate Indexing - is discusssed in detail; o is a free distributive (but not Boolean) lattice, the homographic image a ring of subsets of o; instead of negation which is not useful, a useful retrieval operation 'praeternagation' is introduced. Furthermore these are discussed: a generalized definition of superimposed coding, some functions for the distance of objects or attributes; optimization and automatic derivation of classifications. The model takes into account term-term relations and document-document relations. It may serve as a structural framework in terms of which the functional problems of retrieval theory may be expressed more clearly

Similar documents (author)

  1. Soergel, D.E.: Organizing information : principles of database and retrieval systems (1985) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:soergel in 867) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=867)
    
  2. Soergel, D.: ¬The Broad System of Ordering : a critique (1979) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:soergel in 1863) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 1863, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=1863)
    
  3. Soergel, D.: Software support for thesaurus construction and display (1994) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:soergel in 504) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 504, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=504)
    
  4. Soergel, D.: Information structure management : a unified framework for indexing and searching in database, expert, information-retrieval, and hypermedia systems (1994) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:soergel in 3052) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 3052, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=3052)
    
  5. Soergel, D.: Framework for data element standardization (1995) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:soergel in 4642) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 4642, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=4642)
    

Similar documents (content)

  1. Srinivasan, P.: Intelligent information retrieval using rough set approximations (1989) 0.15
    0.14897631 = sum of:
      0.14897631 = product of:
        0.7448815 = sum of:
          0.04424845 = weight(abstract_txt:term in 2594) [ClassicSimilarity], result of:
            0.04424845 = score(doc=2594,freq=1.0), product of:
              0.098438315 = queryWeight, product of:
                1.0970167 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.018714935 = queryNorm
              0.44950435 = fieldWeight in 2594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.09375 = fieldNorm(doc=2594)
          0.025300447 = weight(abstract_txt:retrieval in 2594) [ClassicSimilarity], result of:
            0.025300447 = score(doc=2594,freq=1.0), product of:
              0.07762733 = queryWeight, product of:
                1.19312 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018714935 = queryNorm
              0.3259219 = fieldWeight in 2594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2594)
          0.09693984 = weight(abstract_txt:theory in 2594) [ClassicSimilarity], result of:
            0.09693984 = score(doc=2594,freq=3.0), product of:
              0.1317924 = queryWeight, product of:
                1.5546118 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.018714935 = queryNorm
              0.73554957 = fieldWeight in 2594, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.09375 = fieldNorm(doc=2594)
          0.09807623 = weight(abstract_txt:objects in 2594) [ClassicSimilarity], result of:
            0.09807623 = score(doc=2594,freq=1.0), product of:
              0.1915601 = queryWeight, product of:
                1.8742578 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.018714935 = queryNorm
              0.51198673 = fieldWeight in 2594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.09375 = fieldNorm(doc=2594)
          0.48031655 = weight(abstract_txt:attributes in 2594) [ClassicSimilarity], result of:
            0.48031655 = score(doc=2594,freq=3.0), product of:
              0.48259613 = queryWeight, product of:
                4.2071085 = boost
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.018714935 = queryNorm
              0.99527645 = fieldWeight in 2594, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.09375 = fieldNorm(doc=2594)
        0.2 = coord(5/25)
    
  2. Rorissa, A.: Relationships between perceived features and similarity of images : a test of Tversky's contrast model (2007) 0.13
    0.12646675 = sum of:
      0.12646675 = product of:
        0.45166695 = sum of:
          0.009930809 = weight(abstract_txt:which in 1520) [ClassicSimilarity], result of:
            0.009930809 = score(doc=1520,freq=1.0), product of:
              0.054531377 = queryWeight, product of:
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.018714935 = queryNorm
              0.18211183 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.029214436 = weight(abstract_txt:retrieval in 1520) [ClassicSimilarity], result of:
            0.029214436 = score(doc=1520,freq=3.0), product of:
              0.07762733 = queryWeight, product of:
                1.19312 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018714935 = queryNorm
              0.37634215 = fieldWeight in 1520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.053322382 = weight(abstract_txt:structural in 1520) [ClassicSimilarity], result of:
            0.053322382 = score(doc=1520,freq=1.0), product of:
              0.14607167 = queryWeight, product of:
                1.3363314 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.018714935 = queryNorm
              0.3650426 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.05661377 = weight(abstract_txt:attempt in 1520) [ClassicSimilarity], result of:
            0.05661377 = score(doc=1520,freq=1.0), product of:
              0.15202244 = queryWeight, product of:
                1.3632798 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.018714935 = queryNorm
              0.37240404 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.06538415 = weight(abstract_txt:objects in 1520) [ClassicSimilarity], result of:
            0.06538415 = score(doc=1520,freq=1.0), product of:
              0.1915601 = queryWeight, product of:
                1.8742578 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.018714935 = queryNorm
              0.34132448 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.18306646 = weight(abstract_txt:axioms in 1520) [ClassicSimilarity], result of:
            0.18306646 = score(doc=1520,freq=1.0), product of:
              0.33242893 = queryWeight, product of:
                2.0159538 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.018714935 = queryNorm
              0.5506935 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.054134965 = weight(abstract_txt:further in 1520) [ClassicSimilarity], result of:
            0.054134965 = score(doc=1520,freq=1.0), product of:
              0.18590377 = queryWeight, product of:
                2.132015 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.018714935 = queryNorm
              0.29119885 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
        0.28 = coord(7/25)
    
  3. Khoo, S.G.; Na, J.-C.: Semantic relations in information science (2006) 0.12
    0.11813339 = sum of:
      0.11813339 = product of:
        0.42190495 = sum of:
          0.0070221424 = weight(abstract_txt:which in 2978) [ClassicSimilarity], result of:
            0.0070221424 = score(doc=2978,freq=2.0), product of:
              0.054531377 = queryWeight, product of:
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.018714935 = queryNorm
              0.12877251 = fieldWeight in 2978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
          0.014607218 = weight(abstract_txt:retrieval in 2978) [ClassicSimilarity], result of:
            0.014607218 = score(doc=2978,freq=3.0), product of:
              0.07762733 = queryWeight, product of:
                1.19312 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018714935 = queryNorm
              0.18817107 = fieldWeight in 2978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
          0.027033681 = weight(abstract_txt:defined in 2978) [ClassicSimilarity], result of:
            0.027033681 = score(doc=2978,freq=2.0), product of:
              0.11701452 = queryWeight, product of:
                1.1960547 = boost
                5.2275767 = idf(docFreq=647, maxDocs=44421)
                0.018714935 = queryNorm
              0.23102844 = fieldWeight in 2978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2275767 = idf(docFreq=647, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
          0.12068475 = weight(abstract_txt:relations in 2978) [ClassicSimilarity], result of:
            0.12068475 = score(doc=2978,freq=28.0), product of:
              0.13163212 = queryWeight, product of:
                1.2685632 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.018714935 = queryNorm
              0.9168336 = fieldWeight in 2978, product of:
                5.2915025 = tf(freq=28.0), with freq of:
                  28.0 = termFreq=28.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
          0.06538415 = weight(abstract_txt:objects in 2978) [ClassicSimilarity], result of:
            0.06538415 = score(doc=2978,freq=4.0), product of:
              0.1915601 = queryWeight, product of:
                1.8742578 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.018714935 = queryNorm
              0.34132448 = fieldWeight in 2978, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
          0.027067482 = weight(abstract_txt:further in 2978) [ClassicSimilarity], result of:
            0.027067482 = score(doc=2978,freq=1.0), product of:
              0.18590377 = queryWeight, product of:
                2.132015 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.018714935 = queryNorm
              0.14559942 = fieldWeight in 2978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
          0.16010553 = weight(abstract_txt:attributes in 2978) [ClassicSimilarity], result of:
            0.16010553 = score(doc=2978,freq=3.0), product of:
              0.48259613 = queryWeight, product of:
                4.2071085 = boost
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.018714935 = queryNorm
              0.33175883 = fieldWeight in 2978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.03125 = fieldNorm(doc=2978)
        0.28 = coord(7/25)
    
  4. Huibers, T.W.C.; Bruza, P.D.: Situations, a general framework for studying information retrieval (1996) 0.11
    0.11247863 = sum of:
      0.11247863 = product of:
        0.46866095 = sum of:
          0.021500831 = weight(abstract_txt:which in 32) [ClassicSimilarity], result of:
            0.021500831 = score(doc=32,freq=3.0), product of:
              0.054531377 = queryWeight, product of:
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.018714935 = queryNorm
              0.39428368 = fieldWeight in 32, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.078125 = fieldNorm(doc=32)
          0.042167407 = weight(abstract_txt:retrieval in 32) [ClassicSimilarity], result of:
            0.042167407 = score(doc=32,freq=4.0), product of:
              0.07762733 = queryWeight, product of:
                1.19312 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018714935 = queryNorm
              0.5432031 = fieldWeight in 32, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=32)
          0.04778925 = weight(abstract_txt:defined in 32) [ClassicSimilarity], result of:
            0.04778925 = score(doc=32,freq=1.0), product of:
              0.11701452 = queryWeight, product of:
                1.1960547 = boost
                5.2275767 = idf(docFreq=647, maxDocs=44421)
                0.018714935 = queryNorm
              0.40840444 = fieldWeight in 32, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2275767 = idf(docFreq=647, maxDocs=44421)
                0.078125 = fieldNorm(doc=32)
          0.046640206 = weight(abstract_txt:theory in 32) [ClassicSimilarity], result of:
            0.046640206 = score(doc=32,freq=1.0), product of:
              0.1317924 = queryWeight, product of:
                1.5546118 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.018714935 = queryNorm
              0.3538915 = fieldWeight in 32, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.078125 = fieldNorm(doc=32)
          0.08173019 = weight(abstract_txt:objects in 32) [ClassicSimilarity], result of:
            0.08173019 = score(doc=32,freq=1.0), product of:
              0.1915601 = queryWeight, product of:
                1.8742578 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.018714935 = queryNorm
              0.4266556 = fieldWeight in 32, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.078125 = fieldNorm(doc=32)
          0.22883306 = weight(abstract_txt:axioms in 32) [ClassicSimilarity], result of:
            0.22883306 = score(doc=32,freq=1.0), product of:
              0.33242893 = queryWeight, product of:
                2.0159538 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.018714935 = queryNorm
              0.6883669 = fieldWeight in 32, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=32)
        0.24 = coord(6/25)
    
  5. Rijsbergen, C.J. van; Lalmas, M.: Information calculus for information retrieval (1996) 0.11
    0.10834682 = sum of:
      0.10834682 = product of:
        0.4514451 = sum of:
          0.014044285 = weight(abstract_txt:which in 4269) [ClassicSimilarity], result of:
            0.014044285 = score(doc=4269,freq=2.0), product of:
              0.054531377 = queryWeight, product of:
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.018714935 = queryNorm
              0.25754502 = fieldWeight in 4269, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.0625 = fieldNorm(doc=4269)
          0.029214436 = weight(abstract_txt:retrieval in 4269) [ClassicSimilarity], result of:
            0.029214436 = score(doc=4269,freq=3.0), product of:
              0.07762733 = queryWeight, product of:
                1.19312 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018714935 = queryNorm
              0.37634215 = fieldWeight in 4269, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=4269)
          0.06621872 = weight(abstract_txt:defined in 4269) [ClassicSimilarity], result of:
            0.06621872 = score(doc=4269,freq=3.0), product of:
              0.11701452 = queryWeight, product of:
                1.1960547 = boost
                5.2275767 = idf(docFreq=647, maxDocs=44421)
                0.018714935 = queryNorm
              0.56590176 = fieldWeight in 4269, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2275767 = idf(docFreq=647, maxDocs=44421)
                0.0625 = fieldNorm(doc=4269)
          0.06462656 = weight(abstract_txt:theory in 4269) [ClassicSimilarity], result of:
            0.06462656 = score(doc=4269,freq=3.0), product of:
              0.1317924 = queryWeight, product of:
                1.5546118 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.018714935 = queryNorm
              0.4903664 = fieldWeight in 4269, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.0625 = fieldNorm(doc=4269)
          0.09246716 = weight(abstract_txt:objects in 4269) [ClassicSimilarity], result of:
            0.09246716 = score(doc=4269,freq=2.0), product of:
              0.1915601 = queryWeight, product of:
                1.8742578 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.018714935 = queryNorm
              0.4827057 = fieldWeight in 4269, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.0625 = fieldNorm(doc=4269)
          0.18487394 = weight(abstract_txt:attributes in 4269) [ClassicSimilarity], result of:
            0.18487394 = score(doc=4269,freq=1.0), product of:
              0.48259613 = queryWeight, product of:
                4.2071085 = boost
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.018714935 = queryNorm
              0.3830821 = fieldWeight in 4269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.0625 = fieldNorm(doc=4269)
        0.24 = coord(6/25)