Document (#37906)

Author
Kempf, A.O.
Title
Automatische Inhaltserschließung in der Fachinformation
Source
Information - Wissenschaft und Praxis. 64(2013) H.2/3, S.96-106
Year
2013
Abstract
Der Artikel basiert auf einer Masterarbeit mit dem Titel "Automatische Indexierung in der sozialwissenschaftlichen Fachinformation. Eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS" (Kempf 2012), die im Rahmen des Aufbaustudiengangs Bibliotheks- und Informationswissenschaft an der Humboldt- Universität zu Berlin am Lehrstuhl Information Retrieval verfasst wurde. Auf der Grundlage des Schalenmodells zur Inhaltserschließung in der Fachinformation stellt der Artikel Evaluationsergebnisse eines automatischen Erschließungsverfahrens für den Einsatz in der sozialwissenschaftlichen Fachinformation vor. Ausgehend von dem von Krause beschriebenen Anwendungsszenario, wonach SOLIS-Datenbestände (Sozialwissenschaftliches Literaturinformationssystem) von geringerer Relevanz automatisch erschlossen werden sollten, wurden auf dieser Dokumentgrundlage zwei Testreihen mit der Indexierungssoftware MindServer der Firma Recommind durchgeführt. Neben den Auswirkungen allgemeiner Systemeinstellungen in der ersten Testreihe wurde in der zweiten Testreihe die Indexierungsleistung der Software für die Rand- und die Kernbereiche der Literaturdatenbank miteinander verglichen. Für letztere Testreihe wurden für beide Bereiche der Datenbank spezifische Versionen der Indexierungssoftware aufgebaut, die anhand von Dokumentkorpora aus den entsprechenden Bereichen trainiert wurden. Die Ergebnisse der Evaluation, die auf der Grundlage intellektuell generierter Vergleichsdaten erfolgt, weisen auf Unterschiede in der Indexierungsleistung zwischen Rand- und Kernbereichen hin, die einerseits gegen den Einsatz automatischer Indexierungsverfahren in den Randbereichen sprechen. Andererseits deutet sich an, dass sich die Indexierungsresultate durch den Aufbau fachteilgebietsspezifischer Trainingsmengen verbessern lassen.
Content
Vgl.: http://www.degruyter.com/view/j/iwp.2013.64.issue-2-3/iwp-2013-0011/iwp-2013-0011.xml?format=INT.
Theme
Automatisches Indexieren
Field
Sozialwissenschaften
Object
SOLIS

Similar documents (author)

  1. Kempf, G.: Klassifikationsprobleme der Rechtswissenschaft (1972) 5.23
    5.230789 = sum of:
      5.230789 = weight(author_txt:kempf in 4742) [ClassicSimilarity], result of:
        5.230789 = fieldWeight in 4742, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.369263 = idf(docFreq=27, maxDocs=44421)
          0.625 = fieldNorm(doc=4742)
    
  2. Kempf, A.: Thematischer Zugang zu Fachinformationen im Internet (1994) 5.23
    5.230789 = sum of:
      5.230789 = weight(author_txt:kempf in 589) [ClassicSimilarity], result of:
        5.230789 = fieldWeight in 589, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.369263 = idf(docFreq=27, maxDocs=44421)
          0.625 = fieldNorm(doc=589)
    
  3. Kempf, A.: Forstliche Klassifikation und Meta-Information zum Wald im Internet (1995) 5.23
    5.230789 = sum of:
      5.230789 = weight(author_txt:kempf in 3272) [ClassicSimilarity], result of:
        5.230789 = fieldWeight in 3272, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.369263 = idf(docFreq=27, maxDocs=44421)
          0.625 = fieldNorm(doc=3272)
    
  4. Kempf, A.: Advocating global forest issues on the Internet (1996) 5.23
    5.230789 = sum of:
      5.230789 = weight(author_txt:kempf in 93) [ClassicSimilarity], result of:
        5.230789 = fieldWeight in 93, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.369263 = idf(docFreq=27, maxDocs=44421)
          0.625 = fieldNorm(doc=93)
    
  5. Kempf, K.: Dalla Germania un esempio avanzato di sistema integrato (1997) 5.23
    5.230789 = sum of:
      5.230789 = weight(author_txt:kempf in 846) [ClassicSimilarity], result of:
        5.230789 = fieldWeight in 846, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.369263 = idf(docFreq=27, maxDocs=44421)
          0.625 = fieldNorm(doc=846)
    

Similar documents (content)

  1. Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.50
    0.49607664 = sum of:
      0.49607664 = product of:
        1.7717023 = sum of:
          0.10875849 = weight(abstract_txt:literaturdatenbank in 1903) [ClassicSimilarity], result of:
            0.10875849 = score(doc=1903,freq=1.0), product of:
              0.1559108 = queryWeight, product of:
                1.0799999 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.016167972 = queryNorm
              0.69756866 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.11371127 = weight(abstract_txt:indexierungsverfahren in 1903) [ClassicSimilarity], result of:
            0.11371127 = score(doc=1903,freq=1.0), product of:
              0.16060896 = queryWeight, product of:
                1.0961514 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016167972 = queryNorm
              0.7080008 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.07156794 = weight(abstract_txt:datenbank in 1903) [ClassicSimilarity], result of:
            0.07156794 = score(doc=1903,freq=1.0), product of:
              0.14861287 = queryWeight, product of:
                1.4911758 = boost
                6.1641335 = idf(docFreq=253, maxDocs=44421)
                0.016167972 = queryNorm
              0.48157293 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1641335 = idf(docFreq=253, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.14334315 = weight(abstract_txt:automatische in 1903) [ClassicSimilarity], result of:
            0.14334315 = score(doc=1903,freq=2.0), product of:
              0.1874212 = queryWeight, product of:
                1.6745957 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.016167972 = queryNorm
              0.7648182 = fieldWeight in 1903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.21311639 = weight(abstract_txt:sozialwissenschaftlichen in 1903) [ClassicSimilarity], result of:
            0.21311639 = score(doc=1903,freq=1.0), product of:
              0.30760166 = queryWeight, product of:
                2.1453342 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.016167972 = queryNorm
              0.6928324 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.32957906 = weight(abstract_txt:solis in 1903) [ClassicSimilarity], result of:
            0.32957906 = score(doc=1903,freq=2.0), product of:
              0.32649297 = queryWeight, product of:
                2.2102304 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.016167972 = queryNorm
              1.0094522 = fieldWeight in 1903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.79162604 = weight(title_txt:fachinformation in 1903) [ClassicSimilarity], result of:
            0.79162604 = score(doc=1903,freq=1.0), product of:
              0.42805624 = queryWeight, product of:
                3.5790358 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.016167972 = queryNorm
              1.8493506 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.25 = fieldNorm(doc=1903)
        0.28 = coord(7/25)
    
  2. Seeger, T.: Entwicklung der Fachinformation und -kommunikation (2004) 0.18
    0.1778163 = sum of:
      0.1778163 = product of:
        1.4818026 = sum of:
          0.06989387 = weight(abstract_txt:beschriebenen in 3907) [ClassicSimilarity], result of:
            0.06989387 = score(doc=3907,freq=1.0), product of:
              0.13473079 = queryWeight, product of:
                1.0039661 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.016167972 = queryNorm
              0.5187669 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.0625 = fieldNorm(doc=3907)
          0.026563186 = weight(abstract_txt:wurde in 3907) [ClassicSimilarity], result of:
            0.026563186 = score(doc=3907,freq=1.0), product of:
              0.08906441 = queryWeight, product of:
                1.1543905 = boost
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.016167972 = queryNorm
              0.29824692 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.0625 = fieldNorm(doc=3907)
          1.3853456 = weight(title_txt:fachinformation in 3907) [ClassicSimilarity], result of:
            1.3853456 = score(doc=3907,freq=1.0), product of:
              0.42805624 = queryWeight, product of:
                3.5790358 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.016167972 = queryNorm
              3.2363634 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.4375 = fieldNorm(doc=3907)
        0.12 = coord(3/25)
    
  3. Capurro, R.: Hermeneutik der Fachinformation (1986) 0.13
    0.1277227 = sum of:
      0.1277227 = product of:
        1.5965337 = sum of:
          0.013281593 = weight(abstract_txt:wurde in 4613) [ClassicSimilarity], result of:
            0.013281593 = score(doc=4613,freq=1.0), product of:
              0.08906441 = queryWeight, product of:
                1.1543905 = boost
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.016167972 = queryNorm
              0.14912346 = fieldWeight in 4613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.03125 = fieldNorm(doc=4613)
          1.5832521 = weight(title_txt:fachinformation in 4613) [ClassicSimilarity], result of:
            1.5832521 = score(doc=4613,freq=1.0), product of:
              0.42805624 = queryWeight, product of:
                3.5790358 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.016167972 = queryNorm
              3.6987011 = fieldWeight in 4613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.5 = fieldNorm(doc=4613)
        0.08 = coord(2/25)
    
  4. Groß, T.: Automatische Indexierung von Dokumenten in einer wissenschaftlichen Bibliothek : Implementierung und Evaluierung am Beispiel der Deutschen Zentralbibliothek für Wirtschaftswissenschaften (2011) 0.11
    0.107746415 = sum of:
      0.107746415 = product of:
        0.53873205 = sum of:
          0.09912116 = weight(abstract_txt:letztere in 2083) [ClassicSimilarity], result of:
            0.09912116 = score(doc=2083,freq=1.0), product of:
              0.14655873 = queryWeight, product of:
                1.0471079 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.016167972 = queryNorm
              0.67632383 = fieldWeight in 2083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.078125 = fieldNorm(doc=2083)
          0.11371127 = weight(abstract_txt:indexierungsverfahren in 2083) [ClassicSimilarity], result of:
            0.11371127 = score(doc=2083,freq=1.0), product of:
              0.16060896 = queryWeight, product of:
                1.0961514 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016167972 = queryNorm
              0.7080008 = fieldWeight in 2083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=2083)
          0.14868082 = weight(abstract_txt:recommind in 2083) [ClassicSimilarity], result of:
            0.14868082 = score(doc=2083,freq=1.0), product of:
              0.19204548 = queryWeight, product of:
                1.198637 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.016167972 = queryNorm
              0.7741959 = fieldWeight in 2083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.078125 = fieldNorm(doc=2083)
          0.06464062 = weight(abstract_txt:grundlage in 2083) [ClassicSimilarity], result of:
            0.06464062 = score(doc=2083,freq=1.0), product of:
              0.13886125 = queryWeight, product of:
                1.4414221 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.016167972 = queryNorm
              0.46550506 = fieldWeight in 2083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.078125 = fieldNorm(doc=2083)
          0.11257816 = weight(abstract_txt:inhaltserschließung in 2083) [ClassicSimilarity], result of:
            0.11257816 = score(doc=2083,freq=1.0), product of:
              0.20100808 = queryWeight, product of:
                1.7342328 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.016167972 = queryNorm
              0.56006783 = fieldWeight in 2083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.078125 = fieldNorm(doc=2083)
        0.2 = coord(5/25)
    
  5. Herb, U.: Wege zur psychologischen Fachinformation : Eine Bilanz aus der Virtuellen Fachbibliothek Psychologie (2002) 0.11
    0.10694843 = sum of:
      0.10694843 = product of:
        0.8912369 = sum of:
          0.026563186 = weight(abstract_txt:wurde in 2177) [ClassicSimilarity], result of:
            0.026563186 = score(doc=2177,freq=1.0), product of:
              0.08906441 = queryWeight, product of:
                1.1543905 = boost
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.016167972 = queryNorm
              0.29824692 = fieldWeight in 2177, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.0625 = fieldNorm(doc=2177)
          0.073047705 = weight(abstract_txt:wurden in 2177) [ClassicSimilarity], result of:
            0.073047705 = score(doc=2177,freq=2.0), product of:
              0.15883355 = queryWeight, product of:
                1.8880668 = boost
                5.2031856 = idf(docFreq=663, maxDocs=44421)
                0.016167972 = queryNorm
              0.45990098 = fieldWeight in 2177, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2031856 = idf(docFreq=663, maxDocs=44421)
                0.0625 = fieldNorm(doc=2177)
          0.79162604 = weight(title_txt:fachinformation in 2177) [ClassicSimilarity], result of:
            0.79162604 = score(doc=2177,freq=1.0), product of:
              0.42805624 = queryWeight, product of:
                3.5790358 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.016167972 = queryNorm
              1.8493506 = fieldWeight in 2177, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.25 = fieldNorm(doc=2177)
        0.12 = coord(3/25)