Document (#17138)

Author
Spitz, A.L.
Wilcox, L.D.
Title
Classification techniques applied to the recognition of office documents
Source
Wissensorganisation im Wandel: Dezimalklassifikation - Thesaurusfragen - Warenklassifikation. Proc. 11. Jahrestagung der Gesellschaft für Klassifikation, Aachen, 29.6.-1.7.1987. Hrsg.: H.-J. Hermes u. J. Hölzl
Imprint
Frankfurt : Indeks
Year
1988
Pages
S.115-122
Series
Studien zur Klassifikation; Bd.18
Abstract
In the process of developing a document recognition network service, techniques were developed for the segmentation and classification of text, line drawing graphics and pictures
Theme
Dokumentenmanagement

Similar documents (content)

  1. Peng, F.; Huang, X.: Machine learning for Asian language text classification (2007) 0.29
    0.2885009 = sum of:
      0.2885009 = product of:
        0.91358626 = sum of:
          0.03800149 = weight(abstract_txt:were in 1831) [ClassicSimilarity], result of:
            0.03800149 = score(doc=1831,freq=4.0), product of:
              0.08289384 = queryWeight, product of:
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.022602368 = queryNorm
              0.4584356 = fieldWeight in 1831, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.0568302 = weight(abstract_txt:text in 1831) [ClassicSimilarity], result of:
            0.0568302 = score(doc=1831,freq=5.0), product of:
              0.10063244 = queryWeight, product of:
                1.101813 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022602368 = queryNorm
              0.56473047 = fieldWeight in 1831, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.060119145 = weight(abstract_txt:applied in 1831) [ClassicSimilarity], result of:
            0.060119145 = score(doc=1831,freq=2.0), product of:
              0.14179918 = queryWeight, product of:
                1.3079036 = boost
                4.7967167 = idf(docFreq=996, maxDocs=44421)
                0.022602368 = queryNorm
              0.42397386 = fieldWeight in 1831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7967167 = idf(docFreq=996, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.510176 = weight(abstract_txt:segmentation in 1831) [ClassicSimilarity], result of:
            0.510176 = score(doc=1831,freq=7.0), product of:
              0.38855803 = queryWeight, product of:
                2.1650445 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.022602368 = queryNorm
              1.3129983 = fieldWeight in 1831, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.14704171 = weight(abstract_txt:classification in 1831) [ClassicSimilarity], result of:
            0.14704171 = score(doc=1831,freq=9.0), product of:
              0.19644065 = queryWeight, product of:
                2.1770558 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.022602368 = queryNorm
              0.7485299 = fieldWeight in 1831, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.10141778 = weight(abstract_txt:techniques in 1831) [ClassicSimilarity], result of:
            0.10141778 = score(doc=1831,freq=2.0), product of:
              0.25317332 = queryWeight, product of:
                2.4715126 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.022602368 = queryNorm
              0.40058637 = fieldWeight in 1831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
        0.31578946 = coord(6/19)
    
  2. Tseng, Y.-H.; Lin, C.-J.; Lin, Y.-I.: Text mining techniques for patent analysis (2007) 0.26
    0.2572089 = sum of:
      0.2572089 = product of:
        0.6108712 = sum of:
          0.035942573 = weight(abstract_txt:text in 1935) [ClassicSimilarity], result of:
            0.035942573 = score(doc=1935,freq=2.0), product of:
              0.10063244 = queryWeight, product of:
                1.101813 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022602368 = queryNorm
              0.3571669 = fieldWeight in 1935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.03615754 = weight(abstract_txt:process in 1935) [ClassicSimilarity], result of:
            0.03615754 = score(doc=1935,freq=2.0), product of:
              0.10103328 = queryWeight, product of:
                1.1040051 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.022602368 = queryNorm
              0.35787752 = fieldWeight in 1935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.027002713 = weight(abstract_txt:documents in 1935) [ClassicSimilarity], result of:
            0.027002713 = score(doc=1935,freq=1.0), product of:
              0.10478042 = queryWeight, product of:
                1.1242915 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.022602368 = queryNorm
              0.25770763 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.030499816 = weight(abstract_txt:document in 1935) [ClassicSimilarity], result of:
            0.030499816 = score(doc=1935,freq=1.0), product of:
              0.1136423 = queryWeight, product of:
                1.1708705 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.022602368 = queryNorm
              0.26838437 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.060119145 = weight(abstract_txt:applied in 1935) [ClassicSimilarity], result of:
            0.060119145 = score(doc=1935,freq=2.0), product of:
              0.14179918 = queryWeight, product of:
                1.3079036 = boost
                4.7967167 = idf(docFreq=996, maxDocs=44421)
                0.022602368 = queryNorm
              0.42397386 = fieldWeight in 1935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7967167 = idf(docFreq=996, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.19282842 = weight(abstract_txt:segmentation in 1935) [ClassicSimilarity], result of:
            0.19282842 = score(doc=1935,freq=1.0), product of:
              0.38855803 = queryWeight, product of:
                2.1650445 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.022602368 = queryNorm
              0.49626672 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.08489456 = weight(abstract_txt:classification in 1935) [ClassicSimilarity], result of:
            0.08489456 = score(doc=1935,freq=3.0), product of:
              0.19644065 = queryWeight, product of:
                2.1770558 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.022602368 = queryNorm
              0.43216392 = fieldWeight in 1935, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.1434264 = weight(abstract_txt:techniques in 1935) [ClassicSimilarity], result of:
            0.1434264 = score(doc=1935,freq=4.0), product of:
              0.25317332 = queryWeight, product of:
                2.4715126 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.022602368 = queryNorm
              0.5665147 = fieldWeight in 1935, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
        0.42105263 = coord(8/19)
    
  3. Steinmetz, R.: Data compression in multimedia computing : principles and techniques (1994) 0.22
    0.21701373 = sum of:
      0.21701373 = product of:
        0.5890373 = sum of:
          0.025415238 = weight(abstract_txt:text in 8181) [ClassicSimilarity], result of:
            0.025415238 = score(doc=8181,freq=1.0), product of:
              0.10063244 = queryWeight, product of:
                1.101813 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022602368 = queryNorm
              0.25255513 = fieldWeight in 8181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
          0.025567241 = weight(abstract_txt:process in 8181) [ClassicSimilarity], result of:
            0.025567241 = score(doc=8181,freq=1.0), product of:
              0.10103328 = queryWeight, product of:
                1.1040051 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.022602368 = queryNorm
              0.25305763 = fieldWeight in 8181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
          0.02845538 = weight(abstract_txt:developed in 8181) [ClassicSimilarity], result of:
            0.02845538 = score(doc=8181,freq=1.0), product of:
              0.10850543 = queryWeight, product of:
                1.1441016 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.022602368 = queryNorm
              0.26224846 = fieldWeight in 8181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
          0.03808119 = weight(abstract_txt:network in 8181) [ClassicSimilarity], result of:
            0.03808119 = score(doc=8181,freq=1.0), product of:
              0.1317697 = queryWeight, product of:
                1.2608013 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.022602368 = queryNorm
              0.2889981 = fieldWeight in 8181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
          0.18692116 = weight(abstract_txt:graphics in 8181) [ClassicSimilarity], result of:
            0.18692116 = score(doc=8181,freq=2.0), product of:
              0.30206764 = queryWeight, product of:
                1.9089342 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.022602368 = queryNorm
              0.61880565 = fieldWeight in 8181, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
          0.16038622 = weight(abstract_txt:pictures in 8181) [ClassicSimilarity], result of:
            0.16038622 = score(doc=8181,freq=1.0), product of:
              0.3436528 = queryWeight, product of:
                2.0360987 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.022602368 = queryNorm
              0.46671006 = fieldWeight in 8181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
          0.1242109 = weight(abstract_txt:techniques in 8181) [ClassicSimilarity], result of:
            0.1242109 = score(doc=8181,freq=3.0), product of:
              0.25317332 = queryWeight, product of:
                2.4715126 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.022602368 = queryNorm
              0.49061608 = fieldWeight in 8181, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=8181)
        0.36842105 = coord(7/19)
    
  4. Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.21
    0.21069658 = sum of:
      0.21069658 = product of:
        0.5718907 = sum of:
          0.01662565 = weight(abstract_txt:were in 68) [ClassicSimilarity], result of:
            0.01662565 = score(doc=68,freq=1.0), product of:
              0.08289384 = queryWeight, product of:
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.022602368 = queryNorm
              0.20056558 = fieldWeight in 68, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.031449754 = weight(abstract_txt:text in 68) [ClassicSimilarity], result of:
            0.031449754 = score(doc=68,freq=2.0), product of:
              0.10063244 = queryWeight, product of:
                1.101813 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022602368 = queryNorm
              0.31252104 = fieldWeight in 68, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.022371337 = weight(abstract_txt:process in 68) [ClassicSimilarity], result of:
            0.022371337 = score(doc=68,freq=1.0), product of:
              0.10103328 = queryWeight, product of:
                1.1040051 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.022602368 = queryNorm
              0.22142543 = fieldWeight in 68, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.03521174 = weight(abstract_txt:developed in 68) [ClassicSimilarity], result of:
            0.03521174 = score(doc=68,freq=2.0), product of:
              0.10850543 = queryWeight, product of:
                1.1441016 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.022602368 = queryNorm
              0.3245159 = fieldWeight in 68, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.045666758 = weight(abstract_txt:developing in 68) [ClassicSimilarity], result of:
            0.045666758 = score(doc=68,freq=1.0), product of:
              0.16258106 = queryWeight, product of:
                1.4004701 = boost
                5.136203 = idf(docFreq=709, maxDocs=44421)
                0.022602368 = queryNorm
              0.28088608 = fieldWeight in 68, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.136203 = idf(docFreq=709, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.15370315 = weight(abstract_txt:techniques in 68) [ClassicSimilarity], result of:
            0.15370315 = score(doc=68,freq=6.0), product of:
              0.25317332 = queryWeight, product of:
                2.4715126 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.022602368 = queryNorm
              0.60710645 = fieldWeight in 68, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.26686236 = weight(abstract_txt:recognition in 68) [ClassicSimilarity], result of:
            0.26686236 = score(doc=68,freq=3.0), product of:
              0.46078426 = queryWeight, product of:
                3.3342848 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.022602368 = queryNorm
              0.5791482 = fieldWeight in 68, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
        0.36842105 = coord(7/19)
    
  5. Saeed, K.; Dardzinska, A.: Natural language processing : word recognition without segmentation (2001) 0.19
    0.18916358 = sum of:
      0.18916358 = product of:
        0.898527 = sum of:
          0.038350865 = weight(abstract_txt:process in 776) [ClassicSimilarity], result of:
            0.038350865 = score(doc=776,freq=1.0), product of:
              0.10103328 = queryWeight, product of:
                1.1040051 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.022602368 = queryNorm
              0.37958646 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.09375 = fieldNorm(doc=776)
          0.042683072 = weight(abstract_txt:developed in 776) [ClassicSimilarity], result of:
            0.042683072 = score(doc=776,freq=1.0), product of:
              0.10850543 = queryWeight, product of:
                1.1441016 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.022602368 = queryNorm
              0.39337268 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.09375 = fieldNorm(doc=776)
          0.28924263 = weight(abstract_txt:segmentation in 776) [ClassicSimilarity], result of:
            0.28924263 = score(doc=776,freq=1.0), product of:
              0.38855803 = queryWeight, product of:
                2.1650445 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.022602368 = queryNorm
              0.7444001 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.09375 = fieldNorm(doc=776)
          0.52825046 = weight(abstract_txt:recognition in 776) [ClassicSimilarity], result of:
            0.52825046 = score(doc=776,freq=4.0), product of:
              0.46078426 = queryWeight, product of:
                3.3342848 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.022602368 = queryNorm
              1.1464161 = fieldWeight in 776, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.09375 = fieldNorm(doc=776)
        0.21052632 = coord(4/19)