Document (#42780)

Author
Tüür-Fröhlich, T.
Title
Blackbox SSCI : Datenerfassung und Datenverarbeitung bei der kommerziellen Indexierung von Zitaten
Source
Information - Wissenschaft und Praxis. 70(2019) H.5/6, S.241-248
Year
2019
Abstract
Zahlreiche Autoren, Autorinnen und kritische Initiativen (z. B. DORA) kritisieren den zu hohen und schädlichen Einfluss quantitativer Daten, welche akademische Instanzen für Evaluationszwecke heranziehen. Wegen des großen Einflusses der globalen Zitatdatenbanken von Thomson Reuters (bzw. Clarivate Analytics) auf die Bewertung der wissenschaftlichen Leistungen von Forscherinnen und Forschern habe ich extensive qualitative und quantitative Fallstudien zur Datenqualität des Social Sciences Citation Index (SSCI) durchgeführt, d. h. die Originaleinträge mit den SSCI-Datensätzen verglichen. Diese Fallstudien zeigten schwerste - nie in der Literatur erwähnte - Fehler, Verstümmelungen, Phantomautoren, Phantomwerke (Fehlerrate in der Fallstudie zu Beebe 2010, Harvard Law Review: 99 Prozent). Über die verwendeten Datenerfassungs- und Indexierungsverfahren von TR bzw. Clarivate Analytics ist nur wenig bekannt. Ein Ergebnis meiner Untersuchungen: Bei der Indexierung von Verweisen in Fußnoten (wie in den Rechtswissenschaften, gerade auch der USA, vorgeschrieben) scheinen die verwendeten Textanalyse-Anwendungen und -Algorithmen völlig überfordert. Eine Qualitätskontrolle scheint nicht stattzufinden. Damit steht der Anspruch des SSCI als einer multidisziplinären Datenbank zur Debatte. Korrekte Zitate in den Fußnoten des Originals können zu Phantom-Autoren, Phantom-Werken und Phantom-Referenzen degenerieren. Das bedeutet: Sämtliche Zeitschriften und Disziplinen, deren Zeitschriften und Büchern dieses oder ähnliche Zitierverfahren verwenden (Oxford-Style), laufen Gefahr, aufgrund starker Zitatverluste falsch, d. h. unterbewertet, zu werden. Wie viele UBOs (Unidentifiable Bibliographic Objects) sich in den Datenbanken SCI, SSCI und AHCI befinden, wäre nur mit sehr aufwändigen Prozeduren zu klären. Unabhängig davon handelt es sich, wie bei fast allen in meinen Untersuchungen gefundenen fatalen Fehlern, eindeutig um endogene Fehler in den Datenbanken, die nicht, wie oft behauptet, angeblich falsch zitierenden Autorinnen und Autoren zugeschrieben werden können, sondern erst im Laufe der Dateneingabe und -verarbeitung entstehen.
Content
Vgl.: https://doi.org/10.1515/iwp-2019-2038.
Theme
Informetrie
Object
Social Sciences Citation Index

Similar documents (author)

  1. Fröhlich, G.: Demokratisierung durch Datenbanken und Computernetze? : Impulsfassung (1995) 5.71
    5.7103243 = sum of:
      5.7103243 = weight(author_txt:fröhlich in 2768) [ClassicSimilarity], result of:
        5.7103243 = fieldWeight in 2768, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.625 = fieldNorm(doc=2768)
    
  2. Fröhlich, G.: Optimale Informationsvorenthaltung als Strategem wissenschaftlicher Kommunikation (1999) 5.71
    5.7103243 = sum of:
      5.7103243 = weight(author_txt:fröhlich in 5113) [ClassicSimilarity], result of:
        5.7103243 = fieldWeight in 5113, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.625 = fieldNorm(doc=5113)
    
  3. Fröhlich, G.: ¬Das Messen des leicht Meßbaren : Output-Indikatoren, Impact-Maße: Artefakte der Szeintometrie? (1999) 5.71
    5.7103243 = sum of:
      5.7103243 = weight(author_txt:fröhlich in 5379) [ClassicSimilarity], result of:
        5.7103243 = fieldWeight in 5379, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.625 = fieldNorm(doc=5379)
    
  4. Fröhlich, T.: Vorüberlegungen zu einem Gesamtkatalog der Bibliotheken des Deutschen Archäologischen Instituts (DAI) (2001) 5.71
    5.7103243 = sum of:
      5.7103243 = weight(author_txt:fröhlich in 5980) [ClassicSimilarity], result of:
        5.7103243 = fieldWeight in 5980, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.625 = fieldNorm(doc=5980)
    
  5. Fröhlich, G.: Plagiate und unethische Autorenschaften (2006) 5.71
    5.7103243 = sum of:
      5.7103243 = weight(author_txt:fröhlich in 1748) [ClassicSimilarity], result of:
        5.7103243 = fieldWeight in 1748, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.625 = fieldNorm(doc=1748)
    

Similar documents (content)

  1. Tüür-Fröhlich, T.: ¬Eine "autoritative" Datenbank auf dem Prüfstand : der Social Sciences Citation Index (SSCI) und seine Datenqualität (2018) 0.72
    0.72467434 = sum of:
      0.72467434 = product of:
        1.8116858 = sum of:
          0.10884743 = weight(abstract_txt:rechtswissenschaften in 591) [ClassicSimilarity], result of:
            0.10884743 = score(doc=591,freq=1.0), product of:
              0.14059417 = queryWeight, product of:
                1.0426614 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.013607024 = queryNorm
              0.7741959 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.061428133 = weight(abstract_txt:datenbanken in 591) [ClassicSimilarity], result of:
            0.061428133 = score(doc=591,freq=2.0), product of:
              0.09601374 = queryWeight, product of:
                1.2185444 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.013607024 = queryNorm
              0.6397848 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.08433783 = weight(abstract_txt:untersuchungen in 591) [ClassicSimilarity], result of:
            0.08433783 = score(doc=591,freq=1.0), product of:
              0.1494331 = queryWeight, product of:
                1.5201907 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.013607024 = queryNorm
              0.5643852 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.11846764 = weight(abstract_txt:falsch in 591) [ClassicSimilarity], result of:
            0.11846764 = score(doc=591,freq=1.0), product of:
              0.18742679 = queryWeight, product of:
                1.7025132 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.013607024 = queryNorm
              0.6320742 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.2164377 = weight(abstract_txt:fehler in 591) [ClassicSimilarity], result of:
            0.2164377 = score(doc=591,freq=3.0), product of:
              0.19421378 = queryWeight, product of:
                1.7330643 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.013607024 = queryNorm
              1.1144302 = fieldWeight in 591, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.12496036 = weight(abstract_txt:autorinnen in 591) [ClassicSimilarity], result of:
            0.12496036 = score(doc=591,freq=1.0), product of:
              0.19421378 = queryWeight, product of:
                1.7330643 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.013607024 = queryNorm
              0.6434166 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.14280368 = weight(abstract_txt:fallstudien in 591) [ClassicSimilarity], result of:
            0.14280368 = score(doc=591,freq=1.0), product of:
              0.21228768 = queryWeight, product of:
                1.8119118 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.013607024 = queryNorm
              0.67268944 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.07966478 = weight(abstract_txt:autoren in 591) [ClassicSimilarity], result of:
            0.07966478 = score(doc=591,freq=1.0), product of:
              0.16467962 = queryWeight, product of:
                1.9545205 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013607024 = queryNorm
              0.48375618 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.19205174 = weight(abstract_txt:fußnoten in 591) [ClassicSimilarity], result of:
            0.19205174 = score(doc=591,freq=1.0), product of:
              0.25864893 = queryWeight, product of:
                2.0 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.013607024 = queryNorm
              0.74251896 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
          0.6826865 = weight(abstract_txt:ssci in 591) [ClassicSimilarity], result of:
            0.6826865 = score(doc=591,freq=4.0), product of:
              0.5150777 = queryWeight, product of:
                4.4625287 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.013607024 = queryNorm
              1.3254049 = fieldWeight in 591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.078125 = fieldNorm(doc=591)
        0.4 = coord(10/25)
    
  2. Dick, S.: Wenn der Arm zum Phantom wird : Trennlinie zwischen Körper und Umwelt muß vom Gehirn ständig neu kartiert werden, Signale mit Duftmarken (1996) 0.08
    0.08140606 = sum of:
      0.08140606 = product of:
        1.0175757 = sum of:
          0.26988107 = weight(abstract_txt:untersuchungen in 3528) [ClassicSimilarity], result of:
            0.26988107 = score(doc=3528,freq=1.0), product of:
              0.1494331 = queryWeight, product of:
                1.5201907 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.013607024 = queryNorm
              1.8060327 = fieldWeight in 3528, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.25 = fieldNorm(doc=3528)
          0.74769473 = weight(title_txt:phantom in 3528) [ClassicSimilarity], result of:
            0.74769473 = score(doc=3528,freq=1.0), product of:
              0.40876245 = queryWeight, product of:
                3.0793269 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.013607024 = queryNorm
              1.8291669 = fieldWeight in 3528, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.1875 = fieldNorm(doc=3528)
        0.08 = coord(2/25)
    
  3. Albers, A.; Krüger, M.: RAK-Online - ein Phantom? (1991) 0.08
    0.07975411 = sum of:
      0.07975411 = product of:
        1.9938527 = sum of:
          1.9938527 = weight(title_txt:phantom in 5954) [ClassicSimilarity], result of:
            1.9938527 = score(doc=5954,freq=1.0), product of:
              0.40876245 = queryWeight, product of:
                3.0793269 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.013607024 = queryNorm
              4.8777785 = fieldWeight in 5954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=5954)
        0.04 = coord(1/25)
    
  4. Klimt, A. (Bearb.): Kürschners Deutscher Literatur-Kalender 2002/2003 (2002) 0.05
    0.052384038 = sum of:
      0.052384038 = product of:
        0.6548005 = sum of:
          0.39987317 = weight(abstract_txt:autorinnen in 4910) [ClassicSimilarity], result of:
            0.39987317 = score(doc=4910,freq=1.0), product of:
              0.19421378 = queryWeight, product of:
                1.7330643 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.013607024 = queryNorm
              2.058933 = fieldWeight in 4910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.25 = fieldNorm(doc=4910)
          0.2549273 = weight(abstract_txt:autoren in 4910) [ClassicSimilarity], result of:
            0.2549273 = score(doc=4910,freq=1.0), product of:
              0.16467962 = queryWeight, product of:
                1.9545205 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013607024 = queryNorm
              1.5480198 = fieldWeight in 4910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.25 = fieldNorm(doc=4910)
        0.08 = coord(2/25)
    
  5. Tüür-Fröhlich, T.: ¬The non-trivial effects of trivial errors in scientific communication and evaluation (2016) 0.05
    0.048558343 = sum of:
      0.048558343 = product of:
        0.6069793 = sum of:
          0.07269256 = weight(abstract_txt:ahci in 4137) [ClassicSimilarity], result of:
            0.07269256 = score(doc=4137,freq=1.0), product of:
              0.13625416 = queryWeight, product of:
                1.0264423 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.013607024 = queryNorm
              0.53350705 = fieldWeight in 4137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.53428674 = weight(abstract_txt:ssci in 4137) [ClassicSimilarity], result of:
            0.53428674 = score(doc=4137,freq=5.0), product of:
              0.5150777 = queryWeight, product of:
                4.4625287 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.013607024 = queryNorm
              1.0372934 = fieldWeight in 4137, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
        0.08 = coord(2/25)