Document (#40553)

Author
Heldens, S.
Sclocco, A.
Dreuning, H.
Nieuwpoort, V. van
Werkhoven, B. van
Hijma, P.
Maassen, J.
Title
litstudy: a Python package for literature reviews
Source
SoftwareX 20(7825):101207 [DOI: 10.1016/j.softx.2022.101207]
Year
2022
Abstract
Researchers are often faced with exploring new research domains. Broad questions about the research domain, such as who are the influential authors or what are important topics, are difficult to answer due to the overwhelming number of relevant publications. Therefore, we present litstudy: a Python package that enables answering such questions using simple scripts or Jupyter notebooks. The package enables selecting scientific publications and studying their metadata using visualizations, bibliographic network analysis, and natural language processing. The software was previously used in a publication on the landscape of Exascale computing, and we envision great potential for reuse.
Content
Vgl.: https://www.researchgate.net/publication/364121745_litstudy_A_Python_package_for_literature_reviews.
Theme
Informetrie
Object
Python

Similar documents (author)

  1. Maassen, B.: Inhaltserschließung als zentrale Dienstleistung der Deutschen Bibliothek (1980) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:maassen in 1440) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 1440, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=1440)
    
  2. Maassen, B.: ¬The PRECIS project of the Deutsche Bibliothek (1984) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:maassen in 1777) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 1777, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=1777)
    
  3. Maassen, D.: Im Frauenzimmerzimmer : Computernetze sind nicht so schlecht wie das, was Männer mit ihnen anstellen, immer mehr Frauen experimentieren im Datenraum (1995) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:maassen in 2173) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 2173, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=2173)
    
  4. Kelm, B.; Maassen, B.: Zentrale Dienstleistungen der Deutschen Bibliothek im Bereich der Sacherschließung (1980) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:maassen in 1746) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 1746, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=1746)
    
  5. Kelm, B.; Maassen, B.: Weiterentwicklung der Sacherschließungsarbeit an der Deutschen Bibliothek (1982) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:maassen in 13) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 13, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=13)
    

Similar documents (content)

  1. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.11
    0.1059777 = sum of:
      0.1059777 = product of:
        0.44157377 = sum of:
          0.039057158 = weight(abstract_txt:answer in 172) [ClassicSimilarity], result of:
            0.039057158 = score(doc=172,freq=1.0), product of:
              0.105711535 = queryWeight, product of:
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.01788233 = queryNorm
              0.36946923 = fieldWeight in 172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.0625 = fieldNorm(doc=172)
          0.04369815 = weight(abstract_txt:previously in 172) [ClassicSimilarity], result of:
            0.04369815 = score(doc=172,freq=1.0), product of:
              0.11392805 = queryWeight, product of:
                1.0381358 = boost
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.01788233 = queryNorm
              0.3835592 = fieldWeight in 172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.0625 = fieldNorm(doc=172)
          0.011926821 = weight(abstract_txt:research in 172) [ClassicSimilarity], result of:
            0.011926821 = score(doc=172,freq=1.0), product of:
              0.06039696 = queryWeight, product of:
                1.0689597 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.01788233 = queryNorm
              0.19747387 = fieldWeight in 172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=172)
          0.015620154 = weight(abstract_txt:using in 172) [ClassicSimilarity], result of:
            0.015620154 = score(doc=172,freq=1.0), product of:
              0.07229731 = queryWeight, product of:
                1.1695395 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01788233 = queryNorm
              0.21605442 = fieldWeight in 172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=172)
          0.06224282 = weight(abstract_txt:questions in 172) [ClassicSimilarity], result of:
            0.06224282 = score(doc=172,freq=2.0), product of:
              0.14422752 = queryWeight, product of:
                1.6518776 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.01788233 = queryNorm
              0.43155995 = fieldWeight in 172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0625 = fieldNorm(doc=172)
          0.26902866 = weight(abstract_txt:package in 172) [ClassicSimilarity], result of:
            0.26902866 = score(doc=172,freq=2.0), product of:
              0.438078 = queryWeight, product of:
                3.5259418 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01788233 = queryNorm
              0.61411136 = fieldWeight in 172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=172)
        0.24 = coord(6/25)
    
  2. Baliková, M.: Looking for the best way of subject access (2008) 0.10
    0.10107559 = sum of:
      0.10107559 = product of:
        0.4211483 = sum of:
          0.048821446 = weight(abstract_txt:answer in 3187) [ClassicSimilarity], result of:
            0.048821446 = score(doc=3187,freq=1.0), product of:
              0.105711535 = queryWeight, product of:
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.01788233 = queryNorm
              0.46183652 = fieldWeight in 3187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.078125 = fieldNorm(doc=3187)
          0.014908526 = weight(abstract_txt:research in 3187) [ClassicSimilarity], result of:
            0.014908526 = score(doc=3187,freq=1.0), product of:
              0.06039696 = queryWeight, product of:
                1.0689597 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.01788233 = queryNorm
              0.24684234 = fieldWeight in 3187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.078125 = fieldNorm(doc=3187)
          0.09779438 = weight(abstract_txt:answering in 3187) [ClassicSimilarity], result of:
            0.09779438 = score(doc=3187,freq=2.0), product of:
              0.1333259 = queryWeight, product of:
                1.1230422 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.01788233 = queryNorm
              0.7334987 = fieldWeight in 3187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.078125 = fieldNorm(doc=3187)
          0.027612792 = weight(abstract_txt:using in 3187) [ClassicSimilarity], result of:
            0.027612792 = score(doc=3187,freq=2.0), product of:
              0.07229731 = queryWeight, product of:
                1.1695395 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01788233 = queryNorm
              0.38193387 = fieldWeight in 3187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=3187)
          0.07780352 = weight(abstract_txt:questions in 3187) [ClassicSimilarity], result of:
            0.07780352 = score(doc=3187,freq=2.0), product of:
              0.14422752 = queryWeight, product of:
                1.6518776 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.01788233 = queryNorm
              0.53944993 = fieldWeight in 3187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.078125 = fieldNorm(doc=3187)
          0.15420765 = weight(abstract_txt:enables in 3187) [ClassicSimilarity], result of:
            0.15420765 = score(doc=3187,freq=2.0), product of:
              0.2275722 = queryWeight, product of:
                2.0749776 = boost
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.01788233 = queryNorm
              0.67762077 = fieldWeight in 3187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.078125 = fieldNorm(doc=3187)
        0.24 = coord(6/25)
    
  3. Erlinger, C.: Spatial planning and its need for national and regional bibliographies of grey literature (2019) 0.09
    0.085833974 = sum of:
      0.085833974 = product of:
        0.53646237 = sum of:
          0.017890232 = weight(abstract_txt:research in 274) [ClassicSimilarity], result of:
            0.017890232 = score(doc=274,freq=1.0), product of:
              0.06039696 = queryWeight, product of:
                1.0689597 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.01788233 = queryNorm
              0.2962108 = fieldWeight in 274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=274)
          0.02343023 = weight(abstract_txt:using in 274) [ClassicSimilarity], result of:
            0.02343023 = score(doc=274,freq=1.0), product of:
              0.07229731 = queryWeight, product of:
                1.1695395 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01788233 = queryNorm
              0.32408163 = fieldWeight in 274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=274)
          0.0825683 = weight(abstract_txt:publications in 274) [ClassicSimilarity], result of:
            0.0825683 = score(doc=274,freq=1.0), product of:
              0.16742231 = queryWeight, product of:
                1.7797561 = boost
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.01788233 = queryNorm
              0.49317384 = fieldWeight in 274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.09375 = fieldNorm(doc=274)
          0.4125736 = weight(abstract_txt:python in 274) [ClassicSimilarity], result of:
            0.4125736 = score(doc=274,freq=1.0), product of:
              0.48933402 = queryWeight, product of:
                3.0426817 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.01788233 = queryNorm
              0.8431329 = fieldWeight in 274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.09375 = fieldNorm(doc=274)
        0.16 = coord(4/25)
    
  4. Candela, G.; Chambers, S.; Sherratt, T.: ¬An approach to assess the quality of Jupyter projects published by GLAM institutions (2023) 0.09
    0.085092105 = sum of:
      0.085092105 = product of:
        0.53182566 = sum of:
          0.07231517 = weight(abstract_txt:reuse in 2193) [ClassicSimilarity], result of:
            0.07231517 = score(doc=2193,freq=1.0), product of:
              0.13736251 = queryWeight, product of:
                1.1399162 = boost
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.01788233 = queryNorm
              0.5264549 = fieldWeight in 2193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.078125 = fieldNorm(doc=2193)
          0.018923825 = weight(abstract_txt:such in 2193) [ClassicSimilarity], result of:
            0.018923825 = score(doc=2193,freq=1.0), product of:
              0.0708051 = queryWeight, product of:
                1.1574069 = boost
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.01788233 = queryNorm
              0.2672664 = fieldWeight in 2193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.078125 = fieldNorm(doc=2193)
          0.019525193 = weight(abstract_txt:using in 2193) [ClassicSimilarity], result of:
            0.019525193 = score(doc=2193,freq=1.0), product of:
              0.07229731 = queryWeight, product of:
                1.1695395 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01788233 = queryNorm
              0.27006802 = fieldWeight in 2193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=2193)
          0.42106146 = weight(abstract_txt:notebooks in 2193) [ClassicSimilarity], result of:
            0.42106146 = score(doc=2193,freq=4.0), product of:
              0.28006506 = queryWeight, product of:
                1.6276772 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.01788233 = queryNorm
              1.5034416 = fieldWeight in 2193, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.078125 = fieldNorm(doc=2193)
        0.16 = coord(4/25)
    
  5. Hussain, K.H.; Rajeev, J.S.: ¬The changing language technology and CDS/ ISIS : UNICODE and the emergence of OTF (2006) 0.08
    0.08222374 = sum of:
      0.08222374 = product of:
        0.41111872 = sum of:
          0.039178886 = weight(abstract_txt:great in 2496) [ClassicSimilarity], result of:
            0.039178886 = score(doc=2496,freq=1.0), product of:
              0.105931066 = queryWeight, product of:
                1.0010378 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.01788233 = queryNorm
              0.36985266 = fieldWeight in 2496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0625 = fieldNorm(doc=2496)
          0.041134167 = weight(abstract_txt:computing in 2496) [ClassicSimilarity], result of:
            0.041134167 = score(doc=2496,freq=1.0), product of:
              0.10942681 = queryWeight, product of:
                1.017421 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.01788233 = queryNorm
              0.37590575 = fieldWeight in 2496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.0625 = fieldNorm(doc=2496)
          0.022090234 = weight(abstract_txt:using in 2496) [ClassicSimilarity], result of:
            0.022090234 = score(doc=2496,freq=2.0), product of:
              0.07229731 = queryWeight, product of:
                1.1695395 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01788233 = queryNorm
              0.3055471 = fieldWeight in 2496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=2496)
          0.118483424 = weight(abstract_txt:scripts in 2496) [ClassicSimilarity], result of:
            0.118483424 = score(doc=2496,freq=2.0), product of:
              0.17582624 = queryWeight, product of:
                1.2896761 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.01788233 = queryNorm
              0.67386657 = fieldWeight in 2496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=2496)
          0.19023201 = weight(abstract_txt:package in 2496) [ClassicSimilarity], result of:
            0.19023201 = score(doc=2496,freq=1.0), product of:
              0.438078 = queryWeight, product of:
                3.5259418 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01788233 = queryNorm
              0.43424234 = fieldWeight in 2496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=2496)
        0.2 = coord(5/25)