Document (#30898)

Author
Golub, K.
Title
Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations
Source
New review of hypermedia and multimedia. 12(2006) no.1, S.11-27
Year
2006
Abstract
The primary objective of this study was to identify and address problems of applying a controlled vocabulary in automated subject classification of textual Web pages, in the area of engineering. Web pages have special characteristics such as structural information, but are at the same time rather heterogeneous. The classification approach used comprises string-to-string matching between words in a term list extracted from the Ei (Engineering Information) thesaurus and classification scheme, and words in the text to be classified. Based on a sample of 70 Web pages, a number of problems with the term list are identified. Reasons for those problems are discussed and improvements proposed. Methods for implementing the improvements are also specified, suggesting further research.
Content
Beitrag eines Themenheftes "Knowledge organization systems and services"
Theme
Automatisches Klassifizieren
Field
Ingenieurwissenschaften

Similar documents (author)

  1. Golub, K.: Automated subject classification of textual web documents (2006) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 600) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 600, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=600)
    
  2. Golub, K.: Subject access to information : an interdisciplinary approach (2015) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 1134) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 1134, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=1134)
    
  3. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 558) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 558, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=558)
    
  4. Golub, K.: Subject access in Swedish discovery services (2018) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 379) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 379, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=379)
    
  5. Golub, K.: Automatic subject indexing of text (2019) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 268) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=268)
    

Similar documents (content)

  1. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.44
    0.44479212 = sum of:
      0.44479212 = product of:
        1.0108912 = sum of:
          0.06577794 = weight(abstract_txt:matching in 2461) [ClassicSimilarity], result of:
            0.06577794 = score(doc=2461,freq=2.0), product of:
              0.12317018 = queryWeight, product of:
                1.0344658 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.019706512 = queryNorm
              0.5340411 = fieldWeight in 2461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.04910867 = weight(abstract_txt:extracted in 2461) [ClassicSimilarity], result of:
            0.04910867 = score(doc=2461,freq=1.0), product of:
              0.12771273 = queryWeight, product of:
                1.0533688 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.019706512 = queryNorm
              0.38452446 = fieldWeight in 2461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.013601817 = weight(abstract_txt:based in 2461) [ClassicSimilarity], result of:
            0.013601817 = score(doc=2461,freq=1.0), product of:
              0.0683707 = queryWeight, product of:
                1.089967 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.019706512 = queryNorm
              0.1989422 = fieldWeight in 2461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.025209723 = weight(abstract_txt:subject in 2461) [ClassicSimilarity], result of:
            0.025209723 = score(doc=2461,freq=1.0), product of:
              0.10316145 = queryWeight, product of:
                1.3388659 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.019706512 = queryNorm
              0.24437155 = fieldWeight in 2461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.046488345 = weight(abstract_txt:term in 2461) [ClassicSimilarity], result of:
            0.046488345 = score(doc=2461,freq=1.0), product of:
              0.15513203 = queryWeight, product of:
                1.6418334 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.019706512 = queryNorm
              0.29966956 = fieldWeight in 2461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.12997206 = weight(abstract_txt:vocabulary in 2461) [ClassicSimilarity], result of:
            0.12997206 = score(doc=2461,freq=4.0), product of:
              0.19394803 = queryWeight, product of:
                1.8357817 = boost
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.019706512 = queryNorm
              0.67013854 = fieldWeight in 2461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.13665718 = weight(abstract_txt:controlled in 2461) [ClassicSimilarity], result of:
            0.13665718 = score(doc=2461,freq=4.0), product of:
              0.20054278 = queryWeight, product of:
                1.8667315 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.019706512 = queryNorm
              0.68143654 = fieldWeight in 2461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.104809955 = weight(abstract_txt:automated in 2461) [ClassicSimilarity], result of:
            0.104809955 = score(doc=2461,freq=2.0), product of:
              0.21170466 = queryWeight, product of:
                1.9179777 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.019706512 = queryNorm
              0.49507627 = fieldWeight in 2461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.12556833 = weight(abstract_txt:engineering in 2461) [ClassicSimilarity], result of:
            0.12556833 = score(doc=2461,freq=2.0), product of:
              0.23880796 = queryWeight, product of:
                2.037055 = boost
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.019706512 = queryNorm
              0.525813 = fieldWeight in 2461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.22074267 = weight(abstract_txt:string in 2461) [ClassicSimilarity], result of:
            0.22074267 = score(doc=2461,freq=2.0), product of:
              0.3478454 = queryWeight, product of:
                2.4585073 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.019706512 = queryNorm
              0.6345999 = fieldWeight in 2461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
          0.09295451 = weight(abstract_txt:classification in 2461) [ClassicSimilarity], result of:
            0.09295451 = score(doc=2461,freq=3.0), product of:
              0.21509086 = queryWeight, product of:
                2.7340367 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019706512 = queryNorm
              0.43216392 = fieldWeight in 2461, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2461)
        0.44 = coord(11/25)
    
  2. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 0.35
    0.35032207 = sum of:
      0.35032207 = product of:
        0.87580514 = sum of:
          0.06577794 = weight(abstract_txt:matching in 558) [ClassicSimilarity], result of:
            0.06577794 = score(doc=558,freq=2.0), product of:
              0.12317018 = queryWeight, product of:
                1.0344658 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.019706512 = queryNorm
              0.5340411 = fieldWeight in 558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.048023958 = weight(abstract_txt:classified in 558) [ClassicSimilarity], result of:
            0.048023958 = score(doc=558,freq=1.0), product of:
              0.12582512 = queryWeight, product of:
                1.0455554 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.019706512 = queryNorm
              0.38167226 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.013601817 = weight(abstract_txt:based in 558) [ClassicSimilarity], result of:
            0.013601817 = score(doc=558,freq=1.0), product of:
              0.0683707 = queryWeight, product of:
                1.089967 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.019706512 = queryNorm
              0.1989422 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.025209723 = weight(abstract_txt:subject in 558) [ClassicSimilarity], result of:
            0.025209723 = score(doc=558,freq=1.0), product of:
              0.10316145 = queryWeight, product of:
                1.3388659 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.019706512 = queryNorm
              0.24437155 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.1125591 = weight(abstract_txt:vocabulary in 558) [ClassicSimilarity], result of:
            0.1125591 = score(doc=558,freq=3.0), product of:
              0.19394803 = queryWeight, product of:
                1.8357817 = boost
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.019706512 = queryNorm
              0.580357 = fieldWeight in 558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.11834858 = weight(abstract_txt:controlled in 558) [ClassicSimilarity], result of:
            0.11834858 = score(doc=558,freq=3.0), product of:
              0.20054278 = queryWeight, product of:
                1.8667315 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.019706512 = queryNorm
              0.59014136 = fieldWeight in 558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.12836546 = weight(abstract_txt:automated in 558) [ClassicSimilarity], result of:
            0.12836546 = score(doc=558,freq=3.0), product of:
              0.21170466 = queryWeight, product of:
                1.9179777 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.019706512 = queryNorm
              0.60634214 = fieldWeight in 558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.08950858 = weight(abstract_txt:textual in 558) [ClassicSimilarity], result of:
            0.08950858 = score(doc=558,freq=1.0), product of:
              0.24009429 = queryWeight, product of:
                2.0425339 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.019706512 = queryNorm
              0.37280595 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.22074267 = weight(abstract_txt:string in 558) [ClassicSimilarity], result of:
            0.22074267 = score(doc=558,freq=2.0), product of:
              0.3478454 = queryWeight, product of:
                2.4585073 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.019706512 = queryNorm
              0.6345999 = fieldWeight in 558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
          0.05366731 = weight(abstract_txt:classification in 558) [ClassicSimilarity], result of:
            0.05366731 = score(doc=558,freq=1.0), product of:
              0.21509086 = queryWeight, product of:
                2.7340367 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019706512 = queryNorm
              0.24950996 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=558)
        0.4 = coord(10/25)
    
  3. Golub, K.; Lykke, M.: Automated classification of web pages in hierarchical browsing (2009) 0.28
    0.27585027 = sum of:
      0.27585027 = product of:
        0.6896256 = sum of:
          0.016831389 = weight(abstract_txt:based in 601) [ClassicSimilarity], result of:
            0.016831389 = score(doc=601,freq=2.0), product of:
              0.0683707 = queryWeight, product of:
                1.089967 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.019706512 = queryNorm
              0.24617839 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.022058507 = weight(abstract_txt:subject in 601) [ClassicSimilarity], result of:
            0.022058507 = score(doc=601,freq=1.0), product of:
              0.10316145 = queryWeight, product of:
                1.3388659 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.019706512 = queryNorm
              0.2138251 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.040677305 = weight(abstract_txt:term in 601) [ClassicSimilarity], result of:
            0.040677305 = score(doc=601,freq=1.0), product of:
              0.15513203 = queryWeight, product of:
                1.6418334 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.019706512 = queryNorm
              0.26221088 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.056695025 = weight(abstract_txt:words in 601) [ClassicSimilarity], result of:
            0.056695025 = score(doc=601,freq=1.0), product of:
              0.19356641 = queryWeight, product of:
                1.8339747 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.019706512 = queryNorm
              0.29289702 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.059787516 = weight(abstract_txt:controlled in 601) [ClassicSimilarity], result of:
            0.059787516 = score(doc=601,freq=1.0), product of:
              0.20054278 = queryWeight, product of:
                1.8667315 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.019706512 = queryNorm
              0.2981285 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.09170871 = weight(abstract_txt:automated in 601) [ClassicSimilarity], result of:
            0.09170871 = score(doc=601,freq=2.0), product of:
              0.21170466 = queryWeight, product of:
                1.9179777 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.019706512 = queryNorm
              0.43319175 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.07769144 = weight(abstract_txt:engineering in 601) [ClassicSimilarity], result of:
            0.07769144 = score(doc=601,freq=1.0), product of:
              0.23880796 = queryWeight, product of:
                2.037055 = boost
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.019706512 = queryNorm
              0.3253302 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.124393694 = weight(abstract_txt:improvements in 601) [ClassicSimilarity], result of:
            0.124393694 = score(doc=601,freq=2.0), product of:
              0.25941133 = queryWeight, product of:
                2.1231117 = boost
                6.2002096 = idf(docFreq=244, maxDocs=44421)
                0.019706512 = queryNorm
              0.47952297 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2002096 = idf(docFreq=244, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.044036984 = weight(abstract_txt:problems in 601) [ClassicSimilarity], result of:
            0.044036984 = score(doc=601,freq=1.0), product of:
              0.18723002 = queryWeight, product of:
                2.2090814 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.019706512 = queryNorm
              0.23520258 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
          0.15574504 = weight(abstract_txt:classification in 601) [ClassicSimilarity], result of:
            0.15574504 = score(doc=601,freq=11.0), product of:
              0.21509086 = queryWeight, product of:
                2.7340367 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019706512 = queryNorm
              0.72408956 = fieldWeight in 601, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0546875 = fieldNorm(doc=601)
        0.4 = coord(10/25)
    
  4. Dumais, S.T.: Latent semantic analysis (2003) 0.24
    0.24032229 = sum of:
      0.24032229 = product of:
        0.54618704 = sum of:
          0.023256015 = weight(abstract_txt:matching in 3462) [ClassicSimilarity], result of:
            0.023256015 = score(doc=3462,freq=1.0), product of:
              0.12317018 = queryWeight, product of:
                1.0344658 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.019706512 = queryNorm
              0.18881205 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.009617937 = weight(abstract_txt:based in 3462) [ClassicSimilarity], result of:
            0.009617937 = score(doc=3462,freq=2.0), product of:
              0.0683707 = queryWeight, product of:
                1.089967 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.019706512 = queryNorm
              0.14067337 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.036757786 = weight(abstract_txt:specified in 3462) [ClassicSimilarity], result of:
            0.036757786 = score(doc=3462,freq=1.0), product of:
              0.16712764 = queryWeight, product of:
                1.2050012 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.019706512 = queryNorm
              0.2199384 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.02183226 = weight(abstract_txt:subject in 3462) [ClassicSimilarity], result of:
            0.02183226 = score(doc=3462,freq=3.0), product of:
              0.10316145 = queryWeight, product of:
                1.3388659 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.019706512 = queryNorm
              0.21163197 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.023244172 = weight(abstract_txt:term in 3462) [ClassicSimilarity], result of:
            0.023244172 = score(doc=3462,freq=1.0), product of:
              0.15513203 = queryWeight, product of:
                1.6418334 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.019706512 = queryNorm
              0.14983478 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.11222704 = weight(abstract_txt:words in 3462) [ClassicSimilarity], result of:
            0.11222704 = score(doc=3462,freq=12.0), product of:
              0.19356641 = queryWeight, product of:
                1.8339747 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.019706512 = queryNorm
              0.5797857 = fieldWeight in 3462, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.06498603 = weight(abstract_txt:vocabulary in 3462) [ClassicSimilarity], result of:
            0.06498603 = score(doc=3462,freq=4.0), product of:
              0.19394803 = queryWeight, product of:
                1.8357817 = boost
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.019706512 = queryNorm
              0.33506927 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.03301627 = weight(abstract_txt:list in 3462) [ClassicSimilarity], result of:
            0.03301627 = score(doc=3462,freq=1.0), product of:
              0.19602467 = queryWeight, product of:
                1.8455836 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.019706512 = queryNorm
              0.16842915 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.096631214 = weight(abstract_txt:controlled in 3462) [ClassicSimilarity], result of:
            0.096631214 = score(doc=3462,freq=8.0), product of:
              0.20054278 = queryWeight, product of:
                1.8667315 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.019706512 = queryNorm
              0.4818484 = fieldWeight in 3462, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.05032798 = weight(abstract_txt:problems in 3462) [ClassicSimilarity], result of:
            0.05032798 = score(doc=3462,freq=4.0), product of:
              0.18723002 = queryWeight, product of:
                2.2090814 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.019706512 = queryNorm
              0.26880294 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.07429038 = weight(abstract_txt:pages in 3462) [ClassicSimilarity], result of:
            0.07429038 = score(doc=3462,freq=1.0), product of:
              0.4240891 = queryWeight, product of:
                3.8390336 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.019706512 = queryNorm
              0.17517635 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
        0.44 = coord(11/25)
    
  5. Wang, J.: Automatic thesaurus development : term extraction from title metadata (2006) 0.22
    0.220478 = sum of:
      0.220478 = product of:
        0.61243886 = sum of:
          0.04316259 = weight(abstract_txt:applying in 63) [ClassicSimilarity], result of:
            0.04316259 = score(doc=63,freq=1.0), product of:
              0.11718366 = queryWeight, product of:
                1.0090133 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.019706512 = queryNorm
              0.36833283 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.04910867 = weight(abstract_txt:extracted in 63) [ClassicSimilarity], result of:
            0.04910867 = score(doc=63,freq=1.0), product of:
              0.12771273 = queryWeight, product of:
                1.0533688 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.019706512 = queryNorm
              0.38452446 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.013601817 = weight(abstract_txt:based in 63) [ClassicSimilarity], result of:
            0.013601817 = score(doc=63,freq=1.0), product of:
              0.0683707 = queryWeight, product of:
                1.089967 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.019706512 = queryNorm
              0.1989422 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.03565193 = weight(abstract_txt:subject in 63) [ClassicSimilarity], result of:
            0.03565193 = score(doc=63,freq=2.0), product of:
              0.10316145 = queryWeight, product of:
                1.3388659 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.019706512 = queryNorm
              0.34559354 = fieldWeight in 63, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.11222704 = weight(abstract_txt:words in 63) [ClassicSimilarity], result of:
            0.11222704 = score(doc=63,freq=3.0), product of:
              0.19356641 = queryWeight, product of:
                1.8339747 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.019706512 = queryNorm
              0.5797857 = fieldWeight in 63, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.1125591 = weight(abstract_txt:vocabulary in 63) [ClassicSimilarity], result of:
            0.1125591 = score(doc=63,freq=3.0), product of:
              0.19394803 = queryWeight, product of:
                1.8357817 = boost
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.019706512 = queryNorm
              0.580357 = fieldWeight in 63, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.11834858 = weight(abstract_txt:controlled in 63) [ClassicSimilarity], result of:
            0.11834858 = score(doc=63,freq=3.0), product of:
              0.20054278 = queryWeight, product of:
                1.8667315 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.019706512 = queryNorm
              0.59014136 = fieldWeight in 63, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.07411183 = weight(abstract_txt:automated in 63) [ClassicSimilarity], result of:
            0.07411183 = score(doc=63,freq=1.0), product of:
              0.21170466 = queryWeight, product of:
                1.9179777 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.019706512 = queryNorm
              0.3500718 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
          0.05366731 = weight(abstract_txt:classification in 63) [ClassicSimilarity], result of:
            0.05366731 = score(doc=63,freq=1.0), product of:
              0.21509086 = queryWeight, product of:
                2.7340367 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019706512 = queryNorm
              0.24950996 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=63)
        0.36 = coord(9/25)