Document (#7154)

Author
Ottaviani, J.S.
Title
¬The fractal nature of relevance : a hypothesis
Source
Journal of the American Society for Information Science. 45(1994) no.4, S.263-272
Year
1994
Abstract
This article proposes a new model, based on fractal geometry, for clusters of relevant documents. It reflects the relatively simple iterative search process used by interactive onlinesearchers. The untested model has the additional sttractive features of high-lighting the logarithmis growth of clusters, which produces complexities in relevance judgements and document clusters not realized by typical models. It indicates that clusters formed using dynamic search strategies appear topoligical distinct, indecomposable, and result from chaotic processes. The model also provides an intuitive definition and representation of cluster dimension which differentiates, where typical models do not, between them. The fractal model, then, gives an indication of what I believe are the limits on clustering relevant documents

Similar documents (content)

  1. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.17
    0.16739938 = sum of:
      0.16739938 = product of:
        1.0462462 = sum of:
          0.029907536 = weight(abstract_txt:documents in 2719) [ClassicSimilarity], result of:
            0.029907536 = score(doc=2719,freq=2.0), product of:
              0.0820613 = queryWeight, product of:
                1.2526281 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.01588799 = queryNorm
              0.3644536 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.042154707 = weight(abstract_txt:models in 2719) [ClassicSimilarity], result of:
            0.042154707 = score(doc=2719,freq=2.0), product of:
              0.10316095 = queryWeight, product of:
                1.404464 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01588799 = queryNorm
              0.40863046 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.053904932 = weight(abstract_txt:model in 2719) [ClassicSimilarity], result of:
            0.053904932 = score(doc=2719,freq=2.0), product of:
              0.1531253 = queryWeight, product of:
                2.4198666 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01588799 = queryNorm
              0.35203153 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.92027897 = weight(abstract_txt:fractal in 2719) [ClassicSimilarity], result of:
            0.92027897 = score(doc=2719,freq=6.0), product of:
              0.6395693 = queryWeight, product of:
                4.2829437 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01588799 = queryNorm
              1.4389043 = fieldWeight in 2719, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
        0.16 = coord(4/25)
    
  2. Abdo, A.H.; Cointet, J.-P.; Bourret, P.; Cambrosio, A,: Domain-topic models with chained dimensions : charting an emergent domain of a major oncology conference (2022) 0.13
    0.1349742 = sum of:
      0.1349742 = product of:
        0.5623925 = sum of:
          0.043038704 = weight(abstract_txt:dimension in 1620) [ClassicSimilarity], result of:
            0.043038704 = score(doc=1620,freq=1.0), product of:
              0.104598165 = queryWeight, product of:
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.01588799 = queryNorm
              0.4114671 = fieldWeight in 1620, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.0625 = fieldNorm(doc=1620)
          0.036629103 = weight(abstract_txt:documents in 1620) [ClassicSimilarity], result of:
            0.036629103 = score(doc=1620,freq=3.0), product of:
              0.0820613 = queryWeight, product of:
                1.2526281 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.01588799 = queryNorm
              0.4463627 = fieldWeight in 1620, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1620)
          0.042154707 = weight(abstract_txt:models in 1620) [ClassicSimilarity], result of:
            0.042154707 = score(doc=1620,freq=2.0), product of:
              0.10316095 = queryWeight, product of:
                1.404464 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01588799 = queryNorm
              0.40863046 = fieldWeight in 1620, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=1620)
          0.029938987 = weight(abstract_txt:relevant in 1620) [ClassicSimilarity], result of:
            0.029938987 = score(doc=1620,freq=1.0), product of:
              0.103463225 = queryWeight, product of:
                1.4065201 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.01588799 = queryNorm
              0.2893684 = fieldWeight in 1620, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=1620)
          0.07623309 = weight(abstract_txt:model in 1620) [ClassicSimilarity], result of:
            0.07623309 = score(doc=1620,freq=4.0), product of:
              0.1531253 = queryWeight, product of:
                2.4198666 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01588799 = queryNorm
              0.49784777 = fieldWeight in 1620, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=1620)
          0.33439785 = weight(abstract_txt:clusters in 1620) [ClassicSimilarity], result of:
            0.33439785 = score(doc=1620,freq=4.0), product of:
              0.410324 = queryWeight, product of:
                3.9612424 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.01588799 = queryNorm
              0.8149605 = fieldWeight in 1620, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=1620)
        0.24 = coord(6/25)
    
  3. Desai, M.; Spink, A.: ¬A algorithm to cluster documents based on relevance (2005) 0.08
    0.08464392 = sum of:
      0.08464392 = product of:
        0.42321962 = sum of:
          0.029449143 = weight(abstract_txt:search in 2035) [ClassicSimilarity], result of:
            0.029449143 = score(doc=2035,freq=4.0), product of:
              0.06446486 = queryWeight, product of:
                1.1102339 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01588799 = queryNorm
              0.45682475 = fieldWeight in 2035, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=2035)
          0.05595188 = weight(abstract_txt:documents in 2035) [ClassicSimilarity], result of:
            0.05595188 = score(doc=2035,freq=7.0), product of:
              0.0820613 = queryWeight, product of:
                1.2526281 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.01588799 = queryNorm
              0.6818303 = fieldWeight in 2035, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=2035)
          0.08981696 = weight(abstract_txt:relevant in 2035) [ClassicSimilarity], result of:
            0.08981696 = score(doc=2035,freq=9.0), product of:
              0.103463225 = queryWeight, product of:
                1.4065201 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.01588799 = queryNorm
              0.8681052 = fieldWeight in 2035, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=2035)
          0.08080272 = weight(abstract_txt:relevance in 2035) [ClassicSimilarity], result of:
            0.08080272 = score(doc=2035,freq=5.0), product of:
              0.11728845 = queryWeight, product of:
                1.4975474 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.01588799 = queryNorm
              0.68892306 = fieldWeight in 2035, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=2035)
          0.16719893 = weight(abstract_txt:clusters in 2035) [ClassicSimilarity], result of:
            0.16719893 = score(doc=2035,freq=1.0), product of:
              0.410324 = queryWeight, product of:
                3.9612424 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.01588799 = queryNorm
              0.40748024 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=2035)
        0.2 = coord(5/25)
    
  4. Losee, R.M.; Church Jr., L.: Are two document clusters better than one? : the cluster performance question for information retrieval (2005) 0.08
    0.081750676 = sum of:
      0.081750676 = product of:
        0.51094174 = sum of:
          0.070333436 = weight(abstract_txt:hypothesis in 4270) [ClassicSimilarity], result of:
            0.070333436 = score(doc=4270,freq=1.0), product of:
              0.11074692 = queryWeight, product of:
                1.0289725 = boost
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.01588799 = queryNorm
              0.63508254 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.09375 = fieldNorm(doc=4270)
          0.031721734 = weight(abstract_txt:documents in 4270) [ClassicSimilarity], result of:
            0.031721734 = score(doc=4270,freq=1.0), product of:
              0.0820613 = queryWeight, product of:
                1.2526281 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.01588799 = queryNorm
              0.38656145 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.09375 = fieldNorm(doc=4270)
          0.05420411 = weight(abstract_txt:relevance in 4270) [ClassicSimilarity], result of:
            0.05420411 = score(doc=4270,freq=1.0), product of:
              0.11728845 = queryWeight, product of:
                1.4975474 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.01588799 = queryNorm
              0.46214363 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.09375 = fieldNorm(doc=4270)
          0.35468248 = weight(abstract_txt:clusters in 4270) [ClassicSimilarity], result of:
            0.35468248 = score(doc=4270,freq=2.0), product of:
              0.410324 = queryWeight, product of:
                3.9612424 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.01588799 = queryNorm
              0.8643961 = fieldWeight in 4270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.09375 = fieldNorm(doc=4270)
        0.16 = coord(4/25)
    
  5. Klobas, J.E.: Beyond information quality : fitness for purpose and electronic information resource use (1995) 0.08
    0.0798514 = sum of:
      0.0798514 = product of:
        0.33271417 = sum of:
          0.058423985 = weight(abstract_txt:formed in 2013) [ClassicSimilarity], result of:
            0.058423985 = score(doc=2013,freq=1.0), product of:
              0.11051097 = queryWeight, product of:
                1.0278758 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.01588799 = queryNorm
              0.5286714 = fieldWeight in 2013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.078125 = fieldNorm(doc=2013)
          0.061217006 = weight(abstract_txt:believe in 2013) [ClassicSimilarity], result of:
            0.061217006 = score(doc=2013,freq=1.0), product of:
              0.11400555 = queryWeight, product of:
                1.0440011 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.01588799 = queryNorm
              0.53696513 = fieldWeight in 2013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.078125 = fieldNorm(doc=2013)
          0.08299754 = weight(abstract_txt:indication in 2013) [ClassicSimilarity], result of:
            0.08299754 = score(doc=2013,freq=1.0), product of:
              0.13965444 = queryWeight, product of:
                1.1554877 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.01588799 = queryNorm
              0.59430647 = fieldWeight in 2013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.078125 = fieldNorm(doc=2013)
          0.03725985 = weight(abstract_txt:models in 2013) [ClassicSimilarity], result of:
            0.03725985 = score(doc=2013,freq=1.0), product of:
              0.10316095 = queryWeight, product of:
                1.404464 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01588799 = queryNorm
              0.36118174 = fieldWeight in 2013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.078125 = fieldNorm(doc=2013)
          0.045170087 = weight(abstract_txt:relevance in 2013) [ClassicSimilarity], result of:
            0.045170087 = score(doc=2013,freq=1.0), product of:
              0.11728845 = queryWeight, product of:
                1.4975474 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.01588799 = queryNorm
              0.38511968 = fieldWeight in 2013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.078125 = fieldNorm(doc=2013)
          0.04764568 = weight(abstract_txt:model in 2013) [ClassicSimilarity], result of:
            0.04764568 = score(doc=2013,freq=1.0), product of:
              0.1531253 = queryWeight, product of:
                2.4198666 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01588799 = queryNorm
              0.31115484 = fieldWeight in 2013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=2013)
        0.24 = coord(6/25)