Document (#22812)

Author
French, J.C.
Powell, A.L.
Schulman, E.
Title
Using clustering strategies for creating authority files
Source
Journal of the American Society for Information Science. 51(2000) no.8, S.774-786
Year
2000
Abstract
As more online databases are integrated into digital libraries, the issue of quality control of the data becomes increasingly important, especially as it relates to the effective retrieval of information. Authority work, the need to discover and reconcile variant forms of strings in bibliographical entries, will become more critical in the future. Spelling variants, misspellings, and transliteration differences will all increase the difficulty of retrieving information. We investigate a number of approximate string matching techniques that have traditionally been used to help with this problem. We then introduce the notion of approximate word matching and show how it can be used to improve detection and categorization of variant forms. We demonstrate the utility of these approaches using data from the Astrophysics Data System and show how we can reduce the human effort involved in the creation of authority files
Theme
Normdateien
Computerlinguistik
Retrievalalgorithmen

Similar documents (author)

  1. French, J.C.; Knight, J.C.; Powell, A.L.: Applying hypertext structures to software documentation (1997) 4.76
    4.762533 = sum of:
      4.762533 = sum of:
        2.124956 = weight(author_txt:powell in 3256) [ClassicSimilarity], result of:
          2.124956 = score(doc=3256,freq=1.0), product of:
            0.6545668 = queryWeight, product of:
              8.656945 = idf(docFreq=20, maxDocs=44421)
              0.075611755 = queryNorm
            3.2463546 = fieldWeight in 3256, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.656945 = idf(docFreq=20, maxDocs=44421)
              0.375 = fieldNorm(doc=3256)
        2.6375773 = weight(author_txt:french in 3256) [ClassicSimilarity], result of:
          2.6375773 = score(doc=3256,freq=1.0), product of:
            0.75600415 = queryWeight, product of:
              1.0746946 = boost
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.075611755 = queryNorm
            3.4888396 = fieldWeight in 3256, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.375 = fieldNorm(doc=3256)
    
  2. French, J.C.; Powell, A.L.; Gey, F.; Perelman, N.: Exploiting manual indexing to improve collection selection and retrieval effectiveness (2002) 3.97
    3.9687777 = sum of:
      3.9687777 = sum of:
        1.7707965 = weight(author_txt:powell in 4896) [ClassicSimilarity], result of:
          1.7707965 = score(doc=4896,freq=1.0), product of:
            0.6545668 = queryWeight, product of:
              8.656945 = idf(docFreq=20, maxDocs=44421)
              0.075611755 = queryNorm
            2.7052953 = fieldWeight in 4896, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.656945 = idf(docFreq=20, maxDocs=44421)
              0.3125 = fieldNorm(doc=4896)
        2.197981 = weight(author_txt:french in 4896) [ClassicSimilarity], result of:
          2.197981 = score(doc=4896,freq=1.0), product of:
            0.75600415 = queryWeight, product of:
              1.0746946 = boost
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.075611755 = queryNorm
            2.9073665 = fieldWeight in 4896, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.3125 = fieldNorm(doc=4896)
    
  3. French, J.: Changes in reference services (1995) 2.20
    2.197981 = sum of:
      2.197981 = product of:
        4.395962 = sum of:
          4.395962 = weight(author_txt:french in 3748) [ClassicSimilarity], result of:
            4.395962 = score(doc=3748,freq=1.0), product of:
              0.75600415 = queryWeight, product of:
                1.0746946 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.075611755 = queryNorm
              5.814733 = fieldWeight in 3748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.625 = fieldNorm(doc=3748)
        0.5 = coord(1/2)
    
  4. Powell, A.P.: ZYindex: bringing order to electronic chaos (1989) 1.77
    1.7707965 = sum of:
      1.7707965 = product of:
        3.541593 = sum of:
          3.541593 = weight(author_txt:powell in 3232) [ClassicSimilarity], result of:
            3.541593 = score(doc=3232,freq=1.0), product of:
              0.6545668 = queryWeight, product of:
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.075611755 = queryNorm
              5.4105906 = fieldWeight in 3232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.625 = fieldNorm(doc=3232)
        0.5 = coord(1/2)
    
  5. Powell, J.: Spinning the World-Wide Web : an HTML primer (1995) 1.77
    1.7707965 = sum of:
      1.7707965 = product of:
        3.541593 = sum of:
          3.541593 = weight(author_txt:powell in 6012) [ClassicSimilarity], result of:
            3.541593 = score(doc=6012,freq=1.0), product of:
              0.6545668 = queryWeight, product of:
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.075611755 = queryNorm
              5.4105906 = fieldWeight in 6012, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.625 = fieldNorm(doc=6012)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.42
    0.41830453 = sum of:
      0.41830453 = product of:
        1.161957 = sum of:
          0.01475845 = weight(abstract_txt:used in 1614) [ClassicSimilarity], result of:
            0.01475845 = score(doc=1614,freq=1.0), product of:
              0.070336945 = queryWeight, product of:
                1.0189575 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.020561282 = queryNorm
              0.20982501 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.07217622 = weight(abstract_txt:string in 1614) [ClassicSimilarity], result of:
            0.07217622 = score(doc=1614,freq=1.0), product of:
              0.16084556 = queryWeight, product of:
                1.0895668 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.020561282 = queryNorm
              0.44872993 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.16433592 = weight(abstract_txt:variants in 1614) [ClassicSimilarity], result of:
            0.16433592 = score(doc=1614,freq=4.0), product of:
              0.17536706 = queryWeight, product of:
                1.1376884 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.020561282 = queryNorm
              0.9370969 = fieldWeight in 1614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.08759835 = weight(abstract_txt:spelling in 1614) [ClassicSimilarity], result of:
            0.08759835 = score(doc=1614,freq=1.0), product of:
              0.1830109 = queryWeight, product of:
                1.1622186 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.020561282 = queryNorm
              0.47865102 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.14514773 = weight(abstract_txt:misspellings in 1614) [ClassicSimilarity], result of:
            0.14514773 = score(doc=1614,freq=1.0), product of:
              0.25626335 = queryWeight, product of:
                1.3752847 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.020561282 = queryNorm
              0.56640065 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.12651648 = weight(abstract_txt:forms in 1614) [ClassicSimilarity], result of:
            0.12651648 = score(doc=1614,freq=4.0), product of:
              0.18559562 = queryWeight, product of:
                1.6551913 = boost
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.020561282 = queryNorm
              0.6816781 = fieldWeight in 1614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.08602964 = weight(abstract_txt:matching in 1614) [ClassicSimilarity], result of:
            0.08602964 = score(doc=1614,freq=1.0), product of:
              0.22781819 = queryWeight, product of:
                1.8338277 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.020561282 = queryNorm
              0.3776241 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.27346855 = weight(abstract_txt:variant in 1614) [ClassicSimilarity], result of:
            0.27346855 = score(doc=1614,freq=3.0), product of:
              0.34149748 = queryWeight, product of:
                2.245216 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.020561282 = queryNorm
              0.8007923 = fieldWeight in 1614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.19192573 = weight(abstract_txt:approximate in 1614) [ClassicSimilarity], result of:
            0.19192573 = score(doc=1614,freq=1.0), product of:
              0.38896614 = queryWeight, product of:
                2.3961844 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.020561282 = queryNorm
              0.4934253 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
        0.36 = coord(9/25)
    
  2. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 0.32
    0.32016563 = sum of:
      0.32016563 = product of:
        0.8893489 = sum of:
          0.08216796 = weight(abstract_txt:variants in 450) [ClassicSimilarity], result of:
            0.08216796 = score(doc=450,freq=1.0), product of:
              0.17536706 = queryWeight, product of:
                1.1376884 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.020561282 = queryNorm
              0.46854845 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.08759835 = weight(abstract_txt:spelling in 450) [ClassicSimilarity], result of:
            0.08759835 = score(doc=450,freq=1.0), product of:
              0.1830109 = queryWeight, product of:
                1.1622186 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.020561282 = queryNorm
              0.47865102 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.04701935 = weight(abstract_txt:show in 450) [ClassicSimilarity], result of:
            0.04701935 = score(doc=450,freq=2.0), product of:
              0.12087341 = queryWeight, product of:
                1.3357639 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.020561282 = queryNorm
              0.38899666 = fieldWeight in 450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.14514773 = weight(abstract_txt:misspellings in 450) [ClassicSimilarity], result of:
            0.14514773 = score(doc=450,freq=1.0), product of:
              0.25626335 = queryWeight, product of:
                1.3752847 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.020561282 = queryNorm
              0.56640065 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.030558797 = weight(abstract_txt:data in 450) [ClassicSimilarity], result of:
            0.030558797 = score(doc=450,freq=2.0), product of:
              0.10381679 = queryWeight, product of:
                1.5161556 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.020561282 = queryNorm
              0.29435313 = fieldWeight in 450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.089460656 = weight(abstract_txt:forms in 450) [ClassicSimilarity], result of:
            0.089460656 = score(doc=450,freq=2.0), product of:
              0.18559562 = queryWeight, product of:
                1.6551913 = boost
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.020561282 = queryNorm
              0.48201922 = fieldWeight in 450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.07319175 = weight(abstract_txt:files in 450) [ClassicSimilarity], result of:
            0.07319175 = score(doc=450,freq=1.0), product of:
              0.20454918 = queryWeight, product of:
                1.7376536 = boost
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.020561282 = queryNorm
              0.3578198 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.15788715 = weight(abstract_txt:variant in 450) [ClassicSimilarity], result of:
            0.15788715 = score(doc=450,freq=1.0), product of:
              0.34149748 = queryWeight, product of:
                2.245216 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.020561282 = queryNorm
              0.46233764 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.17631714 = weight(abstract_txt:authority in 450) [ClassicSimilarity], result of:
            0.17631714 = score(doc=450,freq=4.0), product of:
              0.26507154 = queryWeight, product of:
                2.4226549 = boost
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.020561282 = queryNorm
              0.6651681 = fieldWeight in 450, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
        0.36 = coord(9/25)
    
  3. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.32
    0.31945556 = sum of:
      0.31945556 = product of:
        0.9982987 = sum of:
          0.01475845 = weight(abstract_txt:used in 4223) [ClassicSimilarity], result of:
            0.01475845 = score(doc=4223,freq=1.0), product of:
              0.070336945 = queryWeight, product of:
                1.0189575 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.020561282 = queryNorm
              0.20982501 = fieldWeight in 4223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.015279199 = weight(abstract_txt:more in 4223) [ClassicSimilarity], result of:
            0.015279199 = score(doc=4223,freq=1.0), product of:
              0.071981914 = queryWeight, product of:
                1.0308039 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.020561282 = queryNorm
              0.21226442 = fieldWeight in 4223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.016112335 = weight(abstract_txt:using in 4223) [ClassicSimilarity], result of:
            0.016112335 = score(doc=4223,freq=1.0), product of:
              0.07457536 = queryWeight, product of:
                1.049209 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020561282 = queryNorm
              0.21605442 = fieldWeight in 4223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.14435244 = weight(abstract_txt:string in 4223) [ClassicSimilarity], result of:
            0.14435244 = score(doc=4223,freq=4.0), product of:
              0.16084556 = queryWeight, product of:
                1.0895668 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.020561282 = queryNorm
              0.89745986 = fieldWeight in 4223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.14231908 = weight(abstract_txt:variants in 4223) [ClassicSimilarity], result of:
            0.14231908 = score(doc=4223,freq=3.0), product of:
              0.17536706 = queryWeight, product of:
                1.1376884 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.020561282 = queryNorm
              0.8115497 = fieldWeight in 4223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.10956648 = weight(abstract_txt:forms in 4223) [ClassicSimilarity], result of:
            0.10956648 = score(doc=4223,freq=3.0), product of:
              0.18559562 = queryWeight, product of:
                1.6551913 = boost
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.020561282 = queryNorm
              0.59035057 = fieldWeight in 4223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.17205928 = weight(abstract_txt:matching in 4223) [ClassicSimilarity], result of:
            0.17205928 = score(doc=4223,freq=4.0), product of:
              0.22781819 = queryWeight, product of:
                1.8338277 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.020561282 = queryNorm
              0.7552482 = fieldWeight in 4223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
          0.38385147 = weight(abstract_txt:approximate in 4223) [ClassicSimilarity], result of:
            0.38385147 = score(doc=4223,freq=4.0), product of:
              0.38896614 = queryWeight, product of:
                2.3961844 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.020561282 = queryNorm
              0.9868506 = fieldWeight in 4223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=4223)
        0.32 = coord(8/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.28
    0.27582493 = sum of:
      0.27582493 = product of:
        0.8619529 = sum of:
          0.016112335 = weight(abstract_txt:using in 3372) [ClassicSimilarity], result of:
            0.016112335 = score(doc=3372,freq=1.0), product of:
              0.07457536 = queryWeight, product of:
                1.049209 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020561282 = queryNorm
              0.21605442 = fieldWeight in 3372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.10207258 = weight(abstract_txt:string in 3372) [ClassicSimilarity], result of:
            0.10207258 = score(doc=3372,freq=2.0), product of:
              0.16084556 = queryWeight, product of:
                1.0895668 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.020561282 = queryNorm
              0.6345999 = fieldWeight in 3372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.08216796 = weight(abstract_txt:variants in 3372) [ClassicSimilarity], result of:
            0.08216796 = score(doc=3372,freq=1.0), product of:
              0.17536706 = queryWeight, product of:
                1.1376884 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.020561282 = queryNorm
              0.46854845 = fieldWeight in 3372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.123882785 = weight(abstract_txt:spelling in 3372) [ClassicSimilarity], result of:
            0.123882785 = score(doc=3372,freq=2.0), product of:
              0.1830109 = queryWeight, product of:
                1.1622186 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.020561282 = queryNorm
              0.67691475 = fieldWeight in 3372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.19087948 = weight(abstract_txt:transliteration in 3372) [ClassicSimilarity], result of:
            0.19087948 = score(doc=3372,freq=3.0), product of:
              0.21327768 = queryWeight, product of:
                1.2546484 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.020561282 = queryNorm
              0.894981 = fieldWeight in 3372, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.0332477 = weight(abstract_txt:show in 3372) [ClassicSimilarity], result of:
            0.0332477 = score(doc=3372,freq=1.0), product of:
              0.12087341 = queryWeight, product of:
                1.3357639 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.020561282 = queryNorm
              0.27506217 = fieldWeight in 3372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.12166428 = weight(abstract_txt:matching in 3372) [ClassicSimilarity], result of:
            0.12166428 = score(doc=3372,freq=2.0), product of:
              0.22781819 = queryWeight, product of:
                1.8338277 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.020561282 = queryNorm
              0.5340411 = fieldWeight in 3372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
          0.19192573 = weight(abstract_txt:approximate in 3372) [ClassicSimilarity], result of:
            0.19192573 = score(doc=3372,freq=1.0), product of:
              0.38896614 = queryWeight, product of:
                2.3961844 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.020561282 = queryNorm
              0.4934253 = fieldWeight in 3372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=3372)
        0.32 = coord(8/25)
    
  5. Maaten, L. van den: Accelerating t-SNE using Tree-Based Algorithms (2014) 0.25
    0.25123152 = sum of:
      0.25123152 = product of:
        0.8972554 = sum of:
          0.0313074 = weight(abstract_txt:used in 4886) [ClassicSimilarity], result of:
            0.0313074 = score(doc=4886,freq=2.0), product of:
              0.070336945 = queryWeight, product of:
                1.0189575 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.020561282 = queryNorm
              0.44510606 = fieldWeight in 4886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
          0.024168503 = weight(abstract_txt:using in 4886) [ClassicSimilarity], result of:
            0.024168503 = score(doc=4886,freq=1.0), product of:
              0.07457536 = queryWeight, product of:
                1.049209 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020561282 = queryNorm
              0.32408163 = fieldWeight in 4886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
          0.123251945 = weight(abstract_txt:variants in 4886) [ClassicSimilarity], result of:
            0.123251945 = score(doc=4886,freq=1.0), product of:
              0.17536706 = queryWeight, product of:
                1.1376884 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.020561282 = queryNorm
              0.7028227 = fieldWeight in 4886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
          0.049871553 = weight(abstract_txt:show in 4886) [ClassicSimilarity], result of:
            0.049871553 = score(doc=4886,freq=1.0), product of:
              0.12087341 = queryWeight, product of:
                1.3357639 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.020561282 = queryNorm
              0.41259325 = fieldWeight in 4886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
          0.045838196 = weight(abstract_txt:data in 4886) [ClassicSimilarity], result of:
            0.045838196 = score(doc=4886,freq=2.0), product of:
              0.10381679 = queryWeight, product of:
                1.5161556 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.020561282 = queryNorm
              0.4415297 = fieldWeight in 4886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
          0.33492923 = weight(abstract_txt:variant in 4886) [ClassicSimilarity], result of:
            0.33492923 = score(doc=4886,freq=2.0), product of:
              0.34149748 = queryWeight, product of:
                2.245216 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.020561282 = queryNorm
              0.9807663 = fieldWeight in 4886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
          0.2878886 = weight(abstract_txt:approximate in 4886) [ClassicSimilarity], result of:
            0.2878886 = score(doc=4886,freq=1.0), product of:
              0.38896614 = queryWeight, product of:
                2.3961844 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.020561282 = queryNorm
              0.74013793 = fieldWeight in 4886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.09375 = fieldNorm(doc=4886)
        0.28 = coord(7/25)