Document (#43473)

Author
Andrushchenko, M.
Sandberg, K.
Turunen, R.
Marjanen, J.
Hatavara, M.
Kurunmäki, J.
Nummenmaa, T.
Hyvärinen, M.
Teräs, K.
Peltonen, J.
Nummenmaa, J.
Title
Using parsed and annotated corpora to analyze parliamentarians' talk in Finland
Source
Journal of the Association for Information Science and Technology. 73(2022) no.2, S.288-302
Year
2022
Series
JASIST special issue on digital humanities (DH): C. Methodological innovations, challenges, and new interest in DH
Abstract
We present a search system for grammatically analyzed corpora of Finnish parliamentary records and interviews with former parliamentarians, annotated with metadata of talk structure and involved parliamentarians, and discuss their use through carefully chosen digital humanities case studies. We first introduce the construction, contents, and principles of use of the corpora. Then we discuss the application of the search system and the corpora to study how politicians talk about power, how ideological terms are used in political speech, and how to identify narratives in the data. All case studies stem from questions in the humanities and the social sciences, but rely on the grammatically parsed corpora in both identifying and quantifying passages of interest. Finally, the paper discusses the role of natural language processing methods for questions in the (digital) humanities. It makes the claim that a digital humanities inquiry of parliamentary speech and interviews with politicians cannot only rely on computational humanities modeling, but needs to accommodate a range of perspectives starting with simple searches, quantitative exploration, and ending with modeling. Furthermore, the digital humanities need a more thorough discussion about how the utilization of tools from information science and technologies alter the research questions posed in the humanities.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24500.
Theme
Computerlinguistik
Location
FIN

Similar documents (author)

  1. Sandberg-Fox, A.: Principal changes in the ISBD(ER) (1999) 4.95
    4.954854 = sum of:
      4.954854 = weight(author_txt:sandberg in 810) [ClassicSimilarity], result of:
        4.954854 = fieldWeight in 810, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.5 = fieldNorm(doc=810)
    
  2. Sandberg-Fox, A.M.: ¬The microcomputer revolution (2001) 4.95
    4.954854 = sum of:
      4.954854 = weight(author_txt:sandberg in 409) [ClassicSimilarity], result of:
        4.954854 = fieldWeight in 409, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.5 = fieldNorm(doc=409)
    
  3. Sandberg, J.; Jin, Q.: How should catalogers provide authority control for journal article authors? : Name identifiers in the linked data world (2016) 4.95
    4.954854 = sum of:
      4.954854 = weight(author_txt:sandberg in 138) [ClassicSimilarity], result of:
        4.954854 = fieldWeight in 138, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.5 = fieldNorm(doc=138)
    
  4. Sandberg-Fox, A.; Byrum, J.D.: From ISBD(CF) to ISBD(ER) : process, policy, and provisions (1998) 4.34
    4.3354974 = sum of:
      4.3354974 = weight(author_txt:sandberg in 3583) [ClassicSimilarity], result of:
        4.3354974 = fieldWeight in 3583, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.4375 = fieldNorm(doc=3583)
    

Similar documents (content)

  1. Clement, T.E.; Carter, D.: Connecting theory and practice in digital humanities information work (2017) 0.21
    0.20953177 = sum of:
      0.20953177 = product of:
        0.74832773 = sum of:
          0.018420083 = weight(abstract_txt:about in 4638) [ClassicSimilarity], result of:
            0.018420083 = score(doc=4638,freq=2.0), product of:
              0.05322087 = queryWeight, product of:
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.0135915 = queryNorm
              0.34610638 = fieldWeight in 4638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
          0.05687682 = weight(abstract_txt:narratives in 4638) [ClassicSimilarity], result of:
            0.05687682 = score(doc=4638,freq=1.0), product of:
              0.112852484 = queryWeight, product of:
                1.0296736 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.0135915 = queryNorm
              0.5039926 = fieldWeight in 4638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
          0.0378039 = weight(abstract_txt:interviews in 4638) [ClassicSimilarity], result of:
            0.0378039 = score(doc=4638,freq=1.0), product of:
              0.10829007 = queryWeight, product of:
                1.4264394 = boost
                5.5855756 = idf(docFreq=452, maxDocs=44421)
                0.0135915 = queryNorm
              0.34909847 = fieldWeight in 4638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5855756 = idf(docFreq=452, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
          0.014609662 = weight(abstract_txt:with in 4638) [ClassicSimilarity], result of:
            0.014609662 = score(doc=4638,freq=3.0), product of:
              0.054066792 = queryWeight, product of:
                1.593655 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0135915 = queryNorm
              0.27021506 = fieldWeight in 4638, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
          0.078622825 = weight(abstract_txt:digital in 4638) [ClassicSimilarity], result of:
            0.078622825 = score(doc=4638,freq=5.0), product of:
              0.13000199 = queryWeight, product of:
                2.2102888 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0135915 = queryNorm
              0.6047817 = fieldWeight in 4638, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
          0.18683107 = weight(abstract_txt:corpora in 4638) [ClassicSimilarity], result of:
            0.18683107 = score(doc=4638,freq=1.0), product of:
              0.42642596 = queryWeight, product of:
                4.4755955 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0135915 = queryNorm
              0.4381325 = fieldWeight in 4638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
          0.35516343 = weight(abstract_txt:humanities in 4638) [ClassicSimilarity], result of:
            0.35516343 = score(doc=4638,freq=5.0), product of:
              0.4281038 = queryWeight, product of:
                5.306004 = boost
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0135915 = queryNorm
              0.8296199 = fieldWeight in 4638, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0625 = fieldNorm(doc=4638)
        0.28 = coord(7/25)
    
  2. Poole, A.H.: ¬"A greatly unexplored area" : digital curation and innovation in digital humanities (2017) 0.13
    0.12896474 = sum of:
      0.12896474 = product of:
        0.5373531 = sum of:
          0.023912264 = weight(abstract_txt:case in 4696) [ClassicSimilarity], result of:
            0.023912264 = score(doc=4696,freq=1.0), product of:
              0.079795435 = queryWeight, product of:
                1.2244697 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0135915 = queryNorm
              0.29966956 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=4696)
          0.0378039 = weight(abstract_txt:interviews in 4696) [ClassicSimilarity], result of:
            0.0378039 = score(doc=4696,freq=1.0), product of:
              0.10829007 = queryWeight, product of:
                1.4264394 = boost
                5.5855756 = idf(docFreq=452, maxDocs=44421)
                0.0135915 = queryNorm
              0.34909847 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5855756 = idf(docFreq=452, maxDocs=44421)
                0.0625 = fieldNorm(doc=4696)
          0.014609662 = weight(abstract_txt:with in 4696) [ClassicSimilarity], result of:
            0.014609662 = score(doc=4696,freq=3.0), product of:
              0.054066792 = queryWeight, product of:
                1.593655 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0135915 = queryNorm
              0.27021506 = fieldWeight in 4696, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=4696)
          0.037875857 = weight(abstract_txt:questions in 4696) [ClassicSimilarity], result of:
            0.037875857 = score(doc=4696,freq=1.0), product of:
              0.12411845 = queryWeight, product of:
                1.8703496 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0135915 = queryNorm
              0.30515897 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0625 = fieldNorm(doc=4696)
          0.10548359 = weight(abstract_txt:digital in 4696) [ClassicSimilarity], result of:
            0.10548359 = score(doc=4696,freq=9.0), product of:
              0.13000199 = queryWeight, product of:
                2.2102888 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0135915 = queryNorm
              0.8113998 = fieldWeight in 4696, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=4696)
          0.3176678 = weight(abstract_txt:humanities in 4696) [ClassicSimilarity], result of:
            0.3176678 = score(doc=4696,freq=4.0), product of:
              0.4281038 = queryWeight, product of:
                5.306004 = boost
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0135915 = queryNorm
              0.74203455 = fieldWeight in 4696, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0625 = fieldNorm(doc=4696)
        0.24 = coord(6/25)
    
  3. Thelwall, M.; Delgado, M.M.: Arts and humanities research evaluation : no metrics please, just data (2015) 0.12
    0.11923826 = sum of:
      0.11923826 = product of:
        0.5961913 = sum of:
          0.023025105 = weight(abstract_txt:about in 3313) [ClassicSimilarity], result of:
            0.023025105 = score(doc=3313,freq=2.0), product of:
              0.05322087 = queryWeight, product of:
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.0135915 = queryNorm
              0.43263298 = fieldWeight in 3313, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.078125 = fieldNorm(doc=3313)
          0.029890329 = weight(abstract_txt:case in 3313) [ClassicSimilarity], result of:
            0.029890329 = score(doc=3313,freq=1.0), product of:
              0.079795435 = queryWeight, product of:
                1.2244697 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0135915 = queryNorm
              0.37458694 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=3313)
          0.02108723 = weight(abstract_txt:with in 3313) [ClassicSimilarity], result of:
            0.02108723 = score(doc=3313,freq=4.0), product of:
              0.054066792 = queryWeight, product of:
                1.593655 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0135915 = queryNorm
              0.39002183 = fieldWeight in 3313, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3313)
          0.07823431 = weight(abstract_txt:rely in 3313) [ClassicSimilarity], result of:
            0.07823431 = score(doc=3313,freq=1.0), product of:
              0.15154992 = queryWeight, product of:
                1.6874732 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.0135915 = queryNorm
              0.51622796 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=3313)
          0.4439543 = weight(abstract_txt:humanities in 3313) [ClassicSimilarity], result of:
            0.4439543 = score(doc=3313,freq=5.0), product of:
              0.4281038 = queryWeight, product of:
                5.306004 = boost
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0135915 = queryNorm
              1.0370249 = fieldWeight in 3313, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.078125 = fieldNorm(doc=3313)
        0.2 = coord(5/25)
    
  4. Poole, A.H.; Garwood, D.A.: Digging into data management in public-funded, international research in digital humanities (2020) 0.11
    0.112196386 = sum of:
      0.112196386 = product of:
        0.46748495 = sum of:
          0.029890329 = weight(abstract_txt:case in 509) [ClassicSimilarity], result of:
            0.029890329 = score(doc=509,freq=1.0), product of:
              0.079795435 = queryWeight, product of:
                1.2244697 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0135915 = queryNorm
              0.37458694 = fieldWeight in 509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=509)
          0.04725487 = weight(abstract_txt:interviews in 509) [ClassicSimilarity], result of:
            0.04725487 = score(doc=509,freq=1.0), product of:
              0.10829007 = queryWeight, product of:
                1.4264394 = boost
                5.5855756 = idf(docFreq=452, maxDocs=44421)
                0.0135915 = queryNorm
              0.43637308 = fieldWeight in 509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5855756 = idf(docFreq=452, maxDocs=44421)
                0.078125 = fieldNorm(doc=509)
          0.018262077 = weight(abstract_txt:with in 509) [ClassicSimilarity], result of:
            0.018262077 = score(doc=509,freq=3.0), product of:
              0.054066792 = queryWeight, product of:
                1.593655 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0135915 = queryNorm
              0.33776882 = fieldWeight in 509, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=509)
          0.047344822 = weight(abstract_txt:questions in 509) [ClassicSimilarity], result of:
            0.047344822 = score(doc=509,freq=1.0), product of:
              0.12411845 = queryWeight, product of:
                1.8703496 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0135915 = queryNorm
              0.38144872 = fieldWeight in 509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.078125 = fieldNorm(doc=509)
          0.043951493 = weight(abstract_txt:digital in 509) [ClassicSimilarity], result of:
            0.043951493 = score(doc=509,freq=1.0), product of:
              0.13000199 = queryWeight, product of:
                2.2102888 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0135915 = queryNorm
              0.33808324 = fieldWeight in 509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.078125 = fieldNorm(doc=509)
          0.28078136 = weight(abstract_txt:humanities in 509) [ClassicSimilarity], result of:
            0.28078136 = score(doc=509,freq=2.0), product of:
              0.4281038 = queryWeight, product of:
                5.306004 = boost
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0135915 = queryNorm
              0.6558721 = fieldWeight in 509, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.078125 = fieldNorm(doc=509)
        0.24 = coord(6/25)
    
  5. Deegan, M.: Networking and the discipline (1995) 0.09
    0.08988058 = sum of:
      0.08988058 = product of:
        0.74900484 = sum of:
          0.056095663 = weight(abstract_txt:studies in 6655) [ClassicSimilarity], result of:
            0.056095663 = score(doc=6655,freq=5.0), product of:
              0.062873304 = queryWeight, product of:
                1.0869064 = boost
                4.25605 = idf(docFreq=1711, maxDocs=44421)
                0.0135915 = queryNorm
              0.8922016 = fieldWeight in 6655, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.25605 = idf(docFreq=1711, maxDocs=44421)
                0.09375 = fieldNorm(doc=6655)
          0.28024662 = weight(abstract_txt:corpora in 6655) [ClassicSimilarity], result of:
            0.28024662 = score(doc=6655,freq=1.0), product of:
              0.42642596 = queryWeight, product of:
                4.4755955 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0135915 = queryNorm
              0.6571987 = fieldWeight in 6655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.09375 = fieldNorm(doc=6655)
          0.41266257 = weight(abstract_txt:humanities in 6655) [ClassicSimilarity], result of:
            0.41266257 = score(doc=6655,freq=3.0), product of:
              0.4281038 = queryWeight, product of:
                5.306004 = boost
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.0135915 = queryNorm
              0.9639311 = fieldWeight in 6655, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9362764 = idf(docFreq=318, maxDocs=44421)
                0.09375 = fieldNorm(doc=6655)
        0.12 = coord(3/25)