Document (#15821)

Author
Riloff, E.
Title
¬An empirical study of automated dictionary construction for information extraction in three domains
Source
Artificial intelligence. 85(1996) nos.1/2, S.101-134
Year
1996
Abstract
AutoSlog is a system that addresses the knowledge engineering bottleneck for information extraction. AutoSlog automatically creates domain specific dictionaries for information extraction, given an appropriate training corpus. Describes experiments with AutoSlog in terrorism, joint ventures and microelectronics domains. Compares the performance of AutoSlog across the 3 domains, discusses the lessons learned and presents results from 2 experiments which demonstrate that novice users can generate effective dictionaries using AutoSlog
Theme
Automatisches Indexieren
Computerlinguistik
Object
AutoSlog

Similar documents (content)

  1. El idrissi esserhrouchni, O. et al.; Frikh, B.; Ouhbi, B.: OntologyLine : a new framework for learning non-taxonomic relations of domain ontology (2016) 0.13
    0.13356368 = sum of:
      0.13356368 = product of:
        0.66781837 = sum of:
          0.043576606 = weight(abstract_txt:construction in 4379) [ClassicSimilarity], result of:
            0.043576606 = score(doc=4379,freq=1.0), product of:
              0.10231704 = queryWeight, product of:
                1.0480251 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.01790857 = queryNorm
              0.42589784 = fieldWeight in 4379, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.078125 = fieldNorm(doc=4379)
          0.011420417 = weight(abstract_txt:information in 4379) [ClassicSimilarity], result of:
            0.011420417 = score(doc=4379,freq=1.0), product of:
              0.060432993 = queryWeight, product of:
                1.395068 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01790857 = queryNorm
              0.18897653 = fieldWeight in 4379, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=4379)
          0.22332428 = weight(abstract_txt:bottleneck in 4379) [ClassicSimilarity], result of:
            0.22332428 = score(doc=4379,freq=1.0), product of:
              0.3041373 = queryWeight, product of:
                1.8068935 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01790857 = queryNorm
              0.73428774 = fieldWeight in 4379, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=4379)
          0.19792242 = weight(abstract_txt:domains in 4379) [ClassicSimilarity], result of:
            0.19792242 = score(doc=4379,freq=2.0), product of:
              0.32122263 = queryWeight, product of:
                3.216336 = boost
                5.576784 = idf(docFreq=456, maxDocs=44421)
                0.01790857 = queryNorm
              0.6161534 = fieldWeight in 4379, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.576784 = idf(docFreq=456, maxDocs=44421)
                0.078125 = fieldNorm(doc=4379)
          0.19157462 = weight(abstract_txt:extraction in 4379) [ClassicSimilarity], result of:
            0.19157462 = score(doc=4379,freq=1.0), product of:
              0.3960148 = queryWeight, product of:
                3.571199 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.01790857 = queryNorm
              0.48375618 = fieldWeight in 4379, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=4379)
        0.2 = coord(5/25)
    
  2. Conde, A.; Larrañaga, M.; Arruarte, A.; Elorriaga, J.A.; Roth, D.: litewi: a combined term extraction and entity linking method for eliciting educational ontologies from textbooks (2016) 0.12
    0.12497792 = sum of:
      0.12497792 = product of:
        0.52074134 = sum of:
          0.03245464 = weight(abstract_txt:appropriate in 3645) [ClassicSimilarity], result of:
            0.03245464 = score(doc=3645,freq=1.0), product of:
              0.097552165 = queryWeight, product of:
                1.023331 = boost
                5.3230414 = idf(docFreq=588, maxDocs=44421)
                0.01790857 = queryNorm
              0.3326901 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3230414 = idf(docFreq=588, maxDocs=44421)
                0.0625 = fieldNorm(doc=3645)
          0.049301304 = weight(abstract_txt:construction in 3645) [ClassicSimilarity], result of:
            0.049301304 = score(doc=3645,freq=2.0), product of:
              0.10231704 = queryWeight, product of:
                1.0480251 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.01790857 = queryNorm
              0.4818484 = fieldWeight in 3645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.0625 = fieldNorm(doc=3645)
          0.048649237 = weight(abstract_txt:corpus in 3645) [ClassicSimilarity], result of:
            0.048649237 = score(doc=3645,freq=1.0), product of:
              0.12777221 = queryWeight, product of:
                1.1711591 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.01790857 = queryNorm
              0.38074973 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=3645)
          0.012920727 = weight(abstract_txt:information in 3645) [ClassicSimilarity], result of:
            0.012920727 = score(doc=3645,freq=2.0), product of:
              0.060432993 = queryWeight, product of:
                1.395068 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01790857 = queryNorm
              0.21380253 = fieldWeight in 3645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=3645)
          0.11196183 = weight(abstract_txt:domains in 3645) [ClassicSimilarity], result of:
            0.11196183 = score(doc=3645,freq=1.0), product of:
              0.32122263 = queryWeight, product of:
                3.216336 = boost
                5.576784 = idf(docFreq=456, maxDocs=44421)
                0.01790857 = queryNorm
              0.348549 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.576784 = idf(docFreq=456, maxDocs=44421)
                0.0625 = fieldNorm(doc=3645)
          0.26545358 = weight(abstract_txt:extraction in 3645) [ClassicSimilarity], result of:
            0.26545358 = score(doc=3645,freq=3.0), product of:
              0.3960148 = queryWeight, product of:
                3.571199 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.01790857 = queryNorm
              0.6703122 = fieldWeight in 3645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=3645)
        0.24 = coord(6/25)
    
  3. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 0.12
    0.1160928 = sum of:
      0.1160928 = product of:
        0.48372 = sum of:
          0.02915545 = weight(abstract_txt:demonstrate in 3848) [ClassicSimilarity], result of:
            0.02915545 = score(doc=3848,freq=1.0), product of:
              0.099279635 = queryWeight, product of:
                1.032352 = boost
                5.3699656 = idf(docFreq=561, maxDocs=44421)
                0.01790857 = queryNorm
              0.29367 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3699656 = idf(docFreq=561, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.044820834 = weight(abstract_txt:automatically in 3848) [ClassicSimilarity], result of:
            0.044820834 = score(doc=3848,freq=2.0), product of:
              0.10495996 = queryWeight, product of:
                1.0614744 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.01790857 = queryNorm
              0.42702794 = fieldWeight in 3848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.05314689 = weight(abstract_txt:learned in 3848) [ClassicSimilarity], result of:
            0.05314689 = score(doc=3848,freq=1.0), product of:
              0.14814907 = queryWeight, product of:
                1.2610931 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.01790857 = queryNorm
              0.3587393 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.007994292 = weight(abstract_txt:information in 3848) [ClassicSimilarity], result of:
            0.007994292 = score(doc=3848,freq=1.0), product of:
              0.060432993 = queryWeight, product of:
                1.395068 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01790857 = queryNorm
              0.13228357 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.08039809 = weight(abstract_txt:experiments in 3848) [ClassicSimilarity], result of:
            0.08039809 = score(doc=3848,freq=2.0), product of:
              0.19522893 = queryWeight, product of:
                2.0473156 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.01790857 = queryNorm
              0.41181442 = fieldWeight in 3848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.26820445 = weight(abstract_txt:extraction in 3848) [ClassicSimilarity], result of:
            0.26820445 = score(doc=3848,freq=4.0), product of:
              0.3960148 = queryWeight, product of:
                3.571199 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.01790857 = queryNorm
              0.6772587 = fieldWeight in 3848, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
        0.24 = coord(6/25)
    
  4. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.12
    0.11507123 = sum of:
      0.11507123 = product of:
        0.47946346 = sum of:
          0.031693116 = weight(abstract_txt:automatically in 2683) [ClassicSimilarity], result of:
            0.031693116 = score(doc=2683,freq=1.0), product of:
              0.10495996 = queryWeight, product of:
                1.0614744 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.01790857 = queryNorm
              0.30195436 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.07373008 = weight(abstract_txt:corpus in 2683) [ClassicSimilarity], result of:
            0.07373008 = score(doc=2683,freq=3.0), product of:
              0.12777221 = queryWeight, product of:
                1.1711591 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.01790857 = queryNorm
              0.5770432 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.055091962 = weight(abstract_txt:dictionary in 2683) [ClassicSimilarity], result of:
            0.055091962 = score(doc=2683,freq=1.0), product of:
              0.15174201 = queryWeight, product of:
                1.2762938 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.01790857 = queryNorm
              0.36306334 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.013846519 = weight(abstract_txt:information in 2683) [ClassicSimilarity], result of:
            0.013846519 = score(doc=2683,freq=3.0), product of:
              0.060432993 = queryWeight, product of:
                1.395068 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01790857 = queryNorm
              0.22912185 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.056850035 = weight(abstract_txt:experiments in 2683) [ClassicSimilarity], result of:
            0.056850035 = score(doc=2683,freq=1.0), product of:
              0.19522893 = queryWeight, product of:
                2.0473156 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.01790857 = queryNorm
              0.29119676 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.24825174 = weight(abstract_txt:dictionaries in 2683) [ClassicSimilarity], result of:
            0.24825174 = score(doc=2683,freq=3.0), product of:
              0.3616414 = queryWeight, product of:
                2.7864535 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.01790857 = queryNorm
              0.6864583 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
        0.24 = coord(6/25)
    
  5. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.11
    0.109858595 = sum of:
      0.109858595 = product of:
        0.549293 = sum of:
          0.06402976 = weight(abstract_txt:automatically in 2611) [ClassicSimilarity], result of:
            0.06402976 = score(doc=2611,freq=2.0), product of:
              0.10495996 = queryWeight, product of:
                1.0614744 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.01790857 = queryNorm
              0.6100399 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.060811542 = weight(abstract_txt:corpus in 2611) [ClassicSimilarity], result of:
            0.060811542 = score(doc=2611,freq=1.0), product of:
              0.12777221 = queryWeight, product of:
                1.1711591 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.01790857 = queryNorm
              0.47593716 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.011420417 = weight(abstract_txt:information in 2611) [ClassicSimilarity], result of:
            0.011420417 = score(doc=2611,freq=1.0), product of:
              0.060432993 = queryWeight, product of:
                1.395068 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01790857 = queryNorm
              0.18897653 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.08121434 = weight(abstract_txt:experiments in 2611) [ClassicSimilarity], result of:
            0.08121434 = score(doc=2611,freq=1.0), product of:
              0.19522893 = queryWeight, product of:
                2.0473156 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.01790857 = queryNorm
              0.4159954 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.33181694 = weight(abstract_txt:extraction in 2611) [ClassicSimilarity], result of:
            0.33181694 = score(doc=2611,freq=3.0), product of:
              0.3960148 = queryWeight, product of:
                3.571199 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.01790857 = queryNorm
              0.83789027 = fieldWeight in 2611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
        0.2 = coord(5/25)