Document (#34625)

Author
Heidorn, P.B.
Wei, Q.
Title
Automatic metadata extraction from museum specimen labels
Source
Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
Imprint
Göttingen : Univ.-Verl.
Year
2008
Pages
S.57-68
Abstract
This paper describes the information properties of museum specimen labels and machine learning tools to automatically extract Darwin Core (DwC) and other metadata from these labels processed through Optical Character Recognition (OCR). The DwC is a metadata profile describing the core set of access points for search and retrieval of natural history collections and observation databases. Using the HERBIS Learning System (HLS) we extract 74 independent elements from these labels. The automated text extraction tools are provided as a web service so that users can reference digital images of specimens and receive back an extended Darwin Core XML representation of the content of the label. This automated extraction task is made more difficult by the high variability of museum label formats, OCR errors and the open class nature of some elements. In this paper we introduce our overall system architecture, and variability robust solutions including, the application of Hidden Markov and Naïve Bayes machine learning models, data cleaning, use of field element identifiers, and specialist learning models. The techniques developed here could be adapted to any metadata extraction situation with noisy text and weakly ordered elements.
Content
Vgl. unter: http://dcpapers.dublincore.org/ojs/pubs/article/view/919/915.
Theme
Metadaten
Area
Museen

Similar documents (author)

  1. Heidorn, P.B.: ¬The identification of index terms in natural language object descriptions (1999) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:heidorn in 681) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 681, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=681)
    
  2. Heidorn, P.B.: Image retrieval as linguistic and nonlinguistic visual model matching (1999) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:heidorn in 966) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 966, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=966)
    
  3. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:heidorn in 1084) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 1084, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=1084)
    
  4. Jensen, K.; Heidorn, G.E.; Richardson, S.D.: Natural language processing : the PLNLP approach (19??) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:heidorn in 5363) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 5363, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=5363)
    
  5. Koshman, S.; Heidorn, B.; Kim, H.: ACM SIGIR '93 provides information retrieval roundup (1993) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:heidorn in 6692) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 6692, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=6692)
    

Similar documents (content)

  1. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.21
    0.20898387 = sum of:
      0.20898387 = product of:
        0.6530746 = sum of:
          0.028157761 = weight(abstract_txt:text in 95) [ClassicSimilarity], result of:
            0.028157761 = score(doc=95,freq=4.0), product of:
              0.06370945 = queryWeight, product of:
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015766224 = queryNorm
              0.44197148 = fieldWeight in 95, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.05893034 = weight(abstract_txt:noisy in 95) [ClassicSimilarity], result of:
            0.05893034 = score(doc=95,freq=1.0), product of:
              0.13133317 = queryWeight, product of:
                1.0152436 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.015766224 = queryNorm
              0.44870874 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.0067247213 = weight(abstract_txt:from in 95) [ClassicSimilarity], result of:
            0.0067247213 = score(doc=95,freq=1.0), product of:
              0.044562723 = queryWeight, product of:
                1.0243056 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.015766224 = queryNorm
              0.15090463 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.031318564 = weight(abstract_txt:machine in 95) [ClassicSimilarity], result of:
            0.031318564 = score(doc=95,freq=1.0), product of:
              0.10856579 = queryWeight, product of:
                1.3054029 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015766224 = queryNorm
              0.28847542 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.1616538 = weight(abstract_txt:label in 95) [ClassicSimilarity], result of:
            0.1616538 = score(doc=95,freq=4.0), product of:
              0.20426583 = queryWeight, product of:
                1.7905891 = boost
                7.2355595 = idf(docFreq=86, maxDocs=44421)
                0.015766224 = queryNorm
              0.79138935 = fieldWeight in 95, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2355595 = idf(docFreq=86, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.10175559 = weight(abstract_txt:learning in 95) [ClassicSimilarity], result of:
            0.10175559 = score(doc=95,freq=5.0), product of:
              0.17547584 = queryWeight, product of:
                2.3470466 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015766224 = queryNorm
              0.57988375 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.10131624 = weight(abstract_txt:extraction in 95) [ClassicSimilarity], result of:
            0.10131624 = score(doc=95,freq=1.0), product of:
              0.2991951 = queryWeight, product of:
                3.0647166 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.015766224 = queryNorm
              0.33862934 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.1632176 = weight(abstract_txt:labels in 95) [ClassicSimilarity], result of:
            0.1632176 = score(doc=95,freq=1.0), product of:
              0.41116214 = queryWeight, product of:
                3.592689 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.015766224 = queryNorm
              0.39696652 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
        0.32 = coord(8/25)
    
  2. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.16
    0.15838487 = sum of:
      0.15838487 = product of:
        0.79192436 = sum of:
          0.010868791 = weight(abstract_txt:from in 55) [ClassicSimilarity], result of:
            0.010868791 = score(doc=55,freq=2.0), product of:
              0.044562723 = queryWeight, product of:
                1.0243056 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.015766224 = queryNorm
              0.2438987 = fieldWeight in 55, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.024095654 = weight(abstract_txt:models in 55) [ClassicSimilarity], result of:
            0.024095654 = score(doc=55,freq=1.0), product of:
              0.083391726 = queryWeight, product of:
                1.1440883 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.015766224 = queryNorm
              0.28894538 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.108276315 = weight(abstract_txt:core in 55) [ClassicSimilarity], result of:
            0.108276315 = score(doc=55,freq=4.0), product of:
              0.16375574 = queryWeight, product of:
                1.96355 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.015766224 = queryNorm
              0.6612062 = fieldWeight in 55, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.23157997 = weight(abstract_txt:extraction in 55) [ClassicSimilarity], result of:
            0.23157997 = score(doc=55,freq=4.0), product of:
              0.2991951 = queryWeight, product of:
                3.0647166 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.015766224 = queryNorm
              0.7740099 = fieldWeight in 55, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.41710362 = weight(abstract_txt:labels in 55) [ClassicSimilarity], result of:
            0.41710362 = score(doc=55,freq=5.0), product of:
              0.41116214 = queryWeight, product of:
                3.592689 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.015766224 = queryNorm
              1.0144504 = fieldWeight in 55, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
        0.2 = coord(5/25)
    
  3. Cui, H.: Competency evaluation of plant character ontologies against domain literature (2010) 0.13
    0.13187842 = sum of:
      0.13187842 = product of:
        0.47099435 = sum of:
          0.027868954 = weight(abstract_txt:text in 453) [ClassicSimilarity], result of:
            0.027868954 = score(doc=453,freq=3.0), product of:
              0.06370945 = queryWeight, product of:
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015766224 = queryNorm
              0.4374383 = fieldWeight in 453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
          0.015370792 = weight(abstract_txt:from in 453) [ClassicSimilarity], result of:
            0.015370792 = score(doc=453,freq=4.0), product of:
              0.044562723 = queryWeight, product of:
                1.0243056 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.015766224 = queryNorm
              0.34492487 = fieldWeight in 453, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
          0.021470016 = weight(abstract_txt:tools in 453) [ClassicSimilarity], result of:
            0.021470016 = score(doc=453,freq=1.0), product of:
              0.07721803 = queryWeight, product of:
                1.1009243 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.015766224 = queryNorm
              0.27804407 = fieldWeight in 453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
          0.10123489 = weight(abstract_txt:specimens in 453) [ClassicSimilarity], result of:
            0.10123489 = score(doc=453,freq=1.0), product of:
              0.1723352 = queryWeight, product of:
                1.1629741 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.015766224 = queryNorm
              0.5874302 = fieldWeight in 453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
          0.03579264 = weight(abstract_txt:machine in 453) [ClassicSimilarity], result of:
            0.03579264 = score(doc=453,freq=1.0), product of:
              0.10856579 = queryWeight, product of:
                1.3054029 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015766224 = queryNorm
              0.3296862 = fieldWeight in 453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
          0.042851184 = weight(abstract_txt:automated in 453) [ClassicSimilarity], result of:
            0.042851184 = score(doc=453,freq=1.0), product of:
              0.12240685 = queryWeight, product of:
                1.3861203 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015766224 = queryNorm
              0.3500718 = fieldWeight in 453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
          0.22640589 = weight(abstract_txt:specimen in 453) [ClassicSimilarity], result of:
            0.22640589 = score(doc=453,freq=1.0), product of:
              0.37132624 = queryWeight, product of:
                2.4142146 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.015766224 = queryNorm
              0.6097223 = fieldWeight in 453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=453)
        0.28 = coord(7/25)
    
  4. Laparra, E.; Binford-Walsh, A.; Emerson, K.; Miller, M.L.; López-Hoffman, L.; Currim, F.; Bethard, S.: Addressing structural hurdles for metadata extraction from environmental impact statements (2023) 0.13
    0.12644574 = sum of:
      0.12644574 = product of:
        0.52685726 = sum of:
          0.013311495 = weight(abstract_txt:from in 2044) [ClassicSimilarity], result of:
            0.013311495 = score(doc=2044,freq=3.0), product of:
              0.044562723 = queryWeight, product of:
                1.0243056 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.015766224 = queryNorm
              0.29871368 = fieldWeight in 2044, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.050618444 = weight(abstract_txt:machine in 2044) [ClassicSimilarity], result of:
            0.050618444 = score(doc=2044,freq=2.0), product of:
              0.10856579 = queryWeight, product of:
                1.3054029 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015766224 = queryNorm
              0.4662467 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.098945044 = weight(abstract_txt:extract in 2044) [ClassicSimilarity], result of:
            0.098945044 = score(doc=2044,freq=2.0), product of:
              0.16972658 = queryWeight, product of:
                1.6321986 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.015766224 = queryNorm
              0.5829673 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.07354958 = weight(abstract_txt:learning in 2044) [ClassicSimilarity], result of:
            0.07354958 = score(doc=2044,freq=2.0), product of:
              0.17547584 = queryWeight, product of:
                2.3470466 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015766224 = queryNorm
              0.41914365 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.1266809 = weight(abstract_txt:metadata in 2044) [ClassicSimilarity], result of:
            0.1266809 = score(doc=2044,freq=5.0), product of:
              0.1857767 = queryWeight, product of:
                2.414953 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.015766224 = queryNorm
              0.6818987 = fieldWeight in 2044, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.16375177 = weight(abstract_txt:extraction in 2044) [ClassicSimilarity], result of:
            0.16375177 = score(doc=2044,freq=2.0), product of:
              0.2991951 = queryWeight, product of:
                3.0647166 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.015766224 = queryNorm
              0.5473076 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
        0.24 = coord(6/25)
    
  5. Hooland, S. van; Verborgh, R.: Linked data for Lilibraries, archives and museums : how to clean, link, and publish your metadata (2014) 0.12
    0.120523036 = sum of:
      0.120523036 = product of:
        0.4304394 = sum of:
          0.008151593 = weight(abstract_txt:from in 153) [ClassicSimilarity], result of:
            0.008151593 = score(doc=153,freq=2.0), product of:
              0.044562723 = queryWeight, product of:
                1.0243056 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.015766224 = queryNorm
              0.18292403 = fieldWeight in 153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.10275841 = weight(abstract_txt:cleaning in 153) [ClassicSimilarity], result of:
            0.10275841 = score(doc=153,freq=3.0), product of:
              0.14620116 = queryWeight, product of:
                1.0711702 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.015766224 = queryNorm
              0.7028563 = fieldWeight in 153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.02277239 = weight(abstract_txt:tools in 153) [ClassicSimilarity], result of:
            0.02277239 = score(doc=153,freq=2.0), product of:
              0.07721803 = queryWeight, product of:
                1.1009243 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.015766224 = queryNorm
              0.29491025 = fieldWeight in 153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.01807174 = weight(abstract_txt:models in 153) [ClassicSimilarity], result of:
            0.01807174 = score(doc=153,freq=1.0), product of:
              0.083391726 = queryWeight, product of:
                1.1440883 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.015766224 = queryNorm
              0.21670903 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.03213839 = weight(abstract_txt:automated in 153) [ClassicSimilarity], result of:
            0.03213839 = score(doc=153,freq=1.0), product of:
              0.12240685 = queryWeight, product of:
                1.3861203 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015766224 = queryNorm
              0.26255384 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.07135585 = weight(abstract_txt:museum in 153) [ClassicSimilarity], result of:
            0.07135585 = score(doc=153,freq=1.0), product of:
              0.23847331 = queryWeight, product of:
                2.3695376 = boost
                6.3833475 = idf(docFreq=203, maxDocs=44421)
                0.015766224 = queryNorm
              0.29921943 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3833475 = idf(docFreq=203, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.17519103 = weight(abstract_txt:metadata in 153) [ClassicSimilarity], result of:
            0.17519103 = score(doc=153,freq=17.0), product of:
              0.1857767 = queryWeight, product of:
                2.414953 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.015766224 = queryNorm
              0.9430194 = fieldWeight in 153, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
        0.28 = coord(7/25)