Document (#37402)

Author
Glaesener, L.
Title
Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen
Imprint
Köln : Fachhochschule / Fakultät für Informations- und Kommunikationswissenschaften
Year
2012
Pages
III, 34, VII S
Abstract
Ein Bericht über die Ergebnisse und die Prozessanalyse einer automatischen Indexierung mit Mehrwortgruppen. Diese Bachelorarbeit beschreibt, inwieweit der Inhalt informationswissenschaftlicher Fachtexte durch informationswissenschaftliches Fachvokabular erschlossen werden kann und sollte und dass in diesen wissenschaftlichen Texten ein Großteil der fachlichen Inhalte in Mehrwortgruppen vorkommt. Die Ergebnisse wurden durch eine automatische Indexierung mit Mehrwortgruppen mithilfe des Programme Lingo an einer informationswissenschaftlichen Datenbank ermittelt.
Content
Bachelorarbeit im Studiengang Bibliothekswesen der Fakultät für Informations- und Kommunikationswissenschaften an der Fachhochschule Köln.
Theme
Automatisches Indexieren

Similar documents (content)

  1. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.36
    0.36424428 = sum of:
      0.36424428 = product of:
        1.3008724 = sum of:
          0.02322174 = weight(abstract_txt:inhalt in 2054) [ClassicSimilarity], result of:
            0.02322174 = score(doc=2054,freq=1.0), product of:
              0.08784943 = queryWeight, product of:
                1.0863348 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.011950319 = queryNorm
              0.2643357 = fieldWeight in 2054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.027990129 = weight(abstract_txt:texten in 2054) [ClassicSimilarity], result of:
            0.027990129 = score(doc=2054,freq=1.0), product of:
              0.09949755 = queryWeight, product of:
                1.1561134 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.011950319 = queryNorm
              0.28131476 = fieldWeight in 2054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.016227225 = weight(abstract_txt:durch in 2054) [ClassicSimilarity], result of:
            0.016227225 = score(doc=2054,freq=2.0), product of:
              0.06917863 = queryWeight, product of:
                1.3633119 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.011950319 = queryNorm
              0.2345699 = fieldWeight in 2054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.1173395 = weight(abstract_txt:lingo in 2054) [ClassicSimilarity], result of:
            0.1173395 = score(doc=2054,freq=4.0), product of:
              0.1629616 = queryWeight, product of:
                1.4795746 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.011950319 = queryNorm
              0.72004384 = fieldWeight in 2054, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.03489817 = weight(abstract_txt:ergebnisse in 2054) [ClassicSimilarity], result of:
            0.03489817 = score(doc=2054,freq=2.0), product of:
              0.11525972 = queryWeight, product of:
                1.759738 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.011950319 = queryNorm
              0.30277854 = fieldWeight in 2054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.029423442 = weight(abstract_txt:einer in 2054) [ClassicSimilarity], result of:
            0.029423442 = score(doc=2054,freq=5.0), product of:
              0.08676046 = queryWeight, product of:
                1.8698888 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.011950319 = queryNorm
              0.33913422 = fieldWeight in 2054, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          1.0517722 = weight(abstract_txt:mehrwortgruppen in 2054) [ClassicSimilarity], result of:
            1.0517722 = score(doc=2054,freq=13.0), product of:
              0.75357956 = queryWeight, product of:
                6.3633933 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.011950319 = queryNorm
              1.3957016 = fieldWeight in 2054, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
        0.28 = coord(7/25)
    
  2. Bredack, J.; Lepsky, K.: Automatische Extraktion von Fachterminologie aus Volltexten (2014) 0.31
    0.30729085 = sum of:
      0.30729085 = product of:
        1.5364542 = sum of:
          0.069602534 = weight(abstract_txt:automatische in 872) [ClassicSimilarity], result of:
            0.069602534 = score(doc=872,freq=1.0), product of:
              0.091929264 = queryWeight, product of:
                1.1112739 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.011950319 = queryNorm
              0.7571314 = fieldWeight in 872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.109375 = fieldNorm(doc=872)
          0.07837237 = weight(abstract_txt:texten in 872) [ClassicSimilarity], result of:
            0.07837237 = score(doc=872,freq=1.0), product of:
              0.09949755 = queryWeight, product of:
                1.1561134 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.011950319 = queryNorm
              0.78768134 = fieldWeight in 872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.109375 = fieldNorm(doc=872)
          0.1642753 = weight(abstract_txt:lingo in 872) [ClassicSimilarity], result of:
            0.1642753 = score(doc=872,freq=1.0), product of:
              0.1629616 = queryWeight, product of:
                1.4795746 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.011950319 = queryNorm
              1.0080614 = fieldWeight in 872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.109375 = fieldNorm(doc=872)
          0.06909486 = weight(abstract_txt:ergebnisse in 872) [ClassicSimilarity], result of:
            0.06909486 = score(doc=872,freq=1.0), product of:
              0.11525972 = queryWeight, product of:
                1.759738 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.011950319 = queryNorm
              0.599471 = fieldWeight in 872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.109375 = fieldNorm(doc=872)
          1.1551092 = weight(abstract_txt:mehrwortgruppen in 872) [ClassicSimilarity], result of:
            1.1551092 = score(doc=872,freq=2.0), product of:
              0.75357956 = queryWeight, product of:
                6.3633933 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.011950319 = queryNorm
              1.5328298 = fieldWeight in 872, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.109375 = fieldNorm(doc=872)
        0.2 = coord(5/25)
    
  3. Lepsky, K.: Automatisches Indexieren (2023) 0.27
    0.26528704 = sum of:
      0.26528704 = product of:
        1.3264352 = sum of:
          0.082875006 = weight(abstract_txt:automatischen in 1782) [ClassicSimilarity], result of:
            0.082875006 = score(doc=1782,freq=2.0), product of:
              0.09083935 = queryWeight, product of:
                1.1046666 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.011950319 = queryNorm
              0.91232497 = fieldWeight in 1782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.059659313 = weight(abstract_txt:automatische in 1782) [ClassicSimilarity], result of:
            0.059659313 = score(doc=1782,freq=1.0), product of:
              0.091929264 = queryWeight, product of:
                1.1112739 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.011950319 = queryNorm
              0.64896977 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.08192152 = weight(abstract_txt:ermittelt in 1782) [ClassicSimilarity], result of:
            0.08192152 = score(doc=1782,freq=1.0), product of:
              0.11357087 = queryWeight, product of:
                1.2351727 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.011950319 = queryNorm
              0.7213251 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.8535288 = weight(title_txt:automatisches in 1782) [ClassicSimilarity], result of:
            0.8535288 = score(doc=1782,freq=1.0), product of:
              0.1529471 = queryWeight, product of:
                1.4333916 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.011950319 = queryNorm
              5.5805492 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=1782)
          0.24845064 = weight(abstract_txt:indexierung in 1782) [ClassicSimilarity], result of:
            0.24845064 = score(doc=1782,freq=5.0), product of:
              0.17532682 = queryWeight, product of:
                2.1703682 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.011950319 = queryNorm
              1.4170715 = fieldWeight in 1782, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
        0.2 = coord(5/25)
    
  4. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.24
    0.2359578 = sum of:
      0.2359578 = product of:
        0.84270644 = sum of:
          0.06836839 = weight(abstract_txt:automatischen in 163) [ClassicSimilarity], result of:
            0.06836839 = score(doc=163,freq=4.0), product of:
              0.09083935 = queryWeight, product of:
                1.1046666 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.011950319 = queryNorm
              0.7526297 = fieldWeight in 163, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
          0.034801267 = weight(abstract_txt:automatische in 163) [ClassicSimilarity], result of:
            0.034801267 = score(doc=163,freq=1.0), product of:
              0.091929264 = queryWeight, product of:
                1.1112739 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.011950319 = queryNorm
              0.3785657 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
          0.039186183 = weight(abstract_txt:texten in 163) [ClassicSimilarity], result of:
            0.039186183 = score(doc=163,freq=1.0), product of:
              0.09949755 = queryWeight, product of:
                1.1561134 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.011950319 = queryNorm
              0.39384067 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
          0.04997234 = weight(abstract_txt:mithilfe in 163) [ClassicSimilarity], result of:
            0.04997234 = score(doc=163,freq=1.0), product of:
              0.11700656 = queryWeight, product of:
                1.2537165 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.011950319 = queryNorm
              0.42709008 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
          0.016064133 = weight(abstract_txt:durch in 163) [ClassicSimilarity], result of:
            0.016064133 = score(doc=163,freq=1.0), product of:
              0.06917863 = queryWeight, product of:
                1.3633119 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.011950319 = queryNorm
              0.23221236 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
          0.59747016 = weight(title_txt:automatisches in 163) [ClassicSimilarity], result of:
            0.59747016 = score(doc=163,freq=1.0), product of:
              0.1529471 = queryWeight, product of:
                1.4333916 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.011950319 = queryNorm
              3.9063845 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.4375 = fieldNorm(doc=163)
          0.036843978 = weight(abstract_txt:einer in 163) [ClassicSimilarity], result of:
            0.036843978 = score(doc=163,freq=4.0), product of:
              0.08676046 = queryWeight, product of:
                1.8698888 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.011950319 = queryNorm
              0.42466322 = fieldWeight in 163, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
        0.28 = coord(7/25)
    
  5. Grün, S.: Bildung von Komposita-Indextermen auf der Basis einer algorithmischen Mehrwortgruppenanalyse mit Lingo (2015) 0.22
    0.22416393 = sum of:
      0.22416393 = product of:
        1.4010246 = sum of:
          0.07710028 = weight(abstract_txt:großteil in 2335) [ClassicSimilarity], result of:
            0.07710028 = score(doc=2335,freq=1.0), product of:
              0.123166636 = queryWeight, product of:
                1.2862955 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.011950319 = queryNorm
              0.6259835 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.078125 = fieldNorm(doc=2335)
          0.039748427 = weight(abstract_txt:durch in 2335) [ClassicSimilarity], result of:
            0.039748427 = score(doc=2335,freq=3.0), product of:
              0.06917863 = queryWeight, product of:
                1.3633119 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.011950319 = queryNorm
              0.5745766 = fieldWeight in 2335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.078125 = fieldNorm(doc=2335)
          0.1173395 = weight(abstract_txt:lingo in 2335) [ClassicSimilarity], result of:
            0.1173395 = score(doc=2335,freq=1.0), product of:
              0.1629616 = queryWeight, product of:
                1.4795746 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.011950319 = queryNorm
              0.72004384 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=2335)
          1.1668364 = weight(abstract_txt:mehrwortgruppen in 2335) [ClassicSimilarity], result of:
            1.1668364 = score(doc=2335,freq=4.0), product of:
              0.75357956 = queryWeight, product of:
                6.3633933 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.011950319 = queryNorm
              1.5483918 = fieldWeight in 2335, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.078125 = fieldNorm(doc=2335)
        0.16 = coord(4/25)