Document (#33019)

Author
Gey, F.C.
Kando, N.
Peters, C.
Title
Cross-Language Information Retrieval : the way ahead
Source
Information processing and management. 41(2005) no.3, S.415-432
Year
2005
Abstract
This introductory paper covers not only the research content of the articles in this special issue of IP&M but attempts to characterize the state-of-the-art in the Cross-Language Information Retrieval (CLIR) domain. We present our view of some major directions for CLIR research in the future. In particular, we find that insufficient attention has been given to the Web as a resource for multilingual research, and to languages which are spoken by hundreds of millions of people in the world but have been mainly neglected by the CLIR research community. In addition, we find that most CLIR evaluation has focussed narrowly on the news genre to the exclusion of other important genres such as scientific and technical literature. The paper concludes by describing an ambitious 5-year research plan proposed by James Mayfield and Paul McNamee.
Theme
Multilinguale Probleme

Similar documents (author)

  1. Kando, N.: Information concepts reexamined (1994) 2.60
    2.601759 = sum of:
      2.601759 = product of:
        5.203518 = sum of:
          5.203518 = weight(author_txt:kando in 2194) [ClassicSimilarity], result of:
            5.203518 = score(doc=2194,freq=1.0), product of:
              0.8534242 = queryWeight, product of:
                1.2795969 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.068365924 = queryNorm
              6.0972233 = fieldWeight in 2194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=2194)
        0.5 = coord(1/2)
    
  2. Hu, X.; Kando, N.: Task complexity and difficulty in music information retrieval (2017) 2.08
    2.081407 = sum of:
      2.081407 = product of:
        4.162814 = sum of:
          4.162814 = weight(author_txt:kando in 4690) [ClassicSimilarity], result of:
            4.162814 = score(doc=4690,freq=1.0), product of:
              0.8534242 = queryWeight, product of:
                1.2795969 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.068365924 = queryNorm
              4.8777785 = fieldWeight in 4690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=4690)
        0.5 = coord(1/2)
    
  3. Fujii, A.; Iwayama, M.; Kando, N.: Introduction to the special issue on patent processing (2007) 1.56
    1.5610553 = sum of:
      1.5610553 = product of:
        3.1221106 = sum of:
          3.1221106 = weight(author_txt:kando in 1929) [ClassicSimilarity], result of:
            3.1221106 = score(doc=1929,freq=1.0), product of:
              0.8534242 = queryWeight, product of:
                1.2795969 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.068365924 = queryNorm
              3.6583338 = fieldWeight in 1929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=1929)
        0.5 = coord(1/2)
    
  4. Kuriyama, K.; Kando, N.; Nozue, T.; Eguchi, K.: Pooling for a large-scale test collection : an analysis of the search results from the First NTCIR Workshop (2002) 1.30
    1.3008795 = sum of:
      1.3008795 = product of:
        2.601759 = sum of:
          2.601759 = weight(author_txt:kando in 4830) [ClassicSimilarity], result of:
            2.601759 = score(doc=4830,freq=1.0), product of:
              0.8534242 = queryWeight, product of:
                1.2795969 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.068365924 = queryNorm
              3.0486116 = fieldWeight in 4830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.3125 = fieldNorm(doc=4830)
        0.5 = coord(1/2)
    
  5. Rodrigo, A.; Peñas, A.; Miyao, Y.; Kando, N.: Do systems pass university entrance exams? (2018) 1.30
    1.3008795 = sum of:
      1.3008795 = product of:
        2.601759 = sum of:
          2.601759 = weight(author_txt:kando in 54) [ClassicSimilarity], result of:
            2.601759 = score(doc=54,freq=1.0), product of:
              0.8534242 = queryWeight, product of:
                1.2795969 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.068365924 = queryNorm
              3.0486116 = fieldWeight in 54, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.3125 = fieldNorm(doc=54)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Kishida, K.: Technical issues of cross-language information retrieval : a review (2005) 0.28
    0.27852133 = sum of:
      0.27852133 = product of:
        1.1605055 = sum of:
          0.02353283 = weight(abstract_txt:paper in 2019) [ClassicSimilarity], result of:
            0.02353283 = score(doc=2019,freq=1.0), product of:
              0.062155265 = queryWeight, product of:
                1.0600579 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.016938314 = queryNorm
              0.37861362 = fieldWeight in 2019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.109375 = fieldNorm(doc=2019)
          0.023837812 = weight(abstract_txt:retrieval in 2019) [ClassicSimilarity], result of:
            0.023837812 = score(doc=2019,freq=1.0), product of:
              0.06269113 = queryWeight, product of:
                1.0646176 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016938314 = queryNorm
              0.3802422 = fieldWeight in 2019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=2019)
          0.08233535 = weight(abstract_txt:language in 2019) [ClassicSimilarity], result of:
            0.08233535 = score(doc=2019,freq=4.0), product of:
              0.09024006 = queryWeight, product of:
                1.2772924 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.016938314 = queryNorm
              0.9124035 = fieldWeight in 2019, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.109375 = fieldNorm(doc=2019)
          0.09969444 = weight(abstract_txt:cross in 2019) [ClassicSimilarity], result of:
            0.09969444 = score(doc=2019,freq=1.0), product of:
              0.16273306 = queryWeight, product of:
                1.7152543 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.016938314 = queryNorm
              0.6126256 = fieldWeight in 2019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.109375 = fieldNorm(doc=2019)
          0.04473717 = weight(abstract_txt:research in 2019) [ClassicSimilarity], result of:
            0.04473717 = score(doc=2019,freq=1.0), product of:
              0.1294556 = queryWeight, product of:
                2.4189174 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.016938314 = queryNorm
              0.34557927 = fieldWeight in 2019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.109375 = fieldNorm(doc=2019)
          0.8863679 = weight(abstract_txt:clir in 2019) [ClassicSimilarity], result of:
            0.8863679 = score(doc=2019,freq=2.0), product of:
              0.69840044 = queryWeight, product of:
                5.0252523 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016938314 = queryNorm
              1.26914 = fieldWeight in 2019, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.109375 = fieldNorm(doc=2019)
        0.24 = coord(6/25)
    
  2. Oard, D.W.: Multilingual information access (2009) 0.26
    0.26255265 = sum of:
      0.26255265 = product of:
        1.0939693 = sum of:
          0.023837812 = weight(abstract_txt:retrieval in 837) [ClassicSimilarity], result of:
            0.023837812 = score(doc=837,freq=1.0), product of:
              0.06269113 = queryWeight, product of:
                1.0646176 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016938314 = queryNorm
              0.3802422 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=837)
          0.07130451 = weight(abstract_txt:language in 837) [ClassicSimilarity], result of:
            0.07130451 = score(doc=837,freq=3.0), product of:
              0.09024006 = queryWeight, product of:
                1.2772924 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.016938314 = queryNorm
              0.79016465 = fieldWeight in 837, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.109375 = fieldNorm(doc=837)
          0.20634027 = weight(abstract_txt:narrowly in 837) [ClassicSimilarity], result of:
            0.20634027 = score(doc=837,freq=1.0), product of:
              0.20976892 = queryWeight, product of:
                1.3770388 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.016938314 = queryNorm
              0.9836551 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.109375 = fieldNorm(doc=837)
          0.06603558 = weight(abstract_txt:find in 837) [ClassicSimilarity], result of:
            0.06603558 = score(doc=837,freq=1.0), product of:
              0.12365562 = queryWeight, product of:
                1.495194 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.016938314 = queryNorm
              0.5340282 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.109375 = fieldNorm(doc=837)
          0.09969444 = weight(abstract_txt:cross in 837) [ClassicSimilarity], result of:
            0.09969444 = score(doc=837,freq=1.0), product of:
              0.16273306 = queryWeight, product of:
                1.7152543 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.016938314 = queryNorm
              0.6126256 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.109375 = fieldNorm(doc=837)
          0.6267568 = weight(abstract_txt:clir in 837) [ClassicSimilarity], result of:
            0.6267568 = score(doc=837,freq=1.0), product of:
              0.69840044 = queryWeight, product of:
                5.0252523 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016938314 = queryNorm
              0.8974175 = fieldWeight in 837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.109375 = fieldNorm(doc=837)
        0.24 = coord(6/25)
    
  3. Oard, D.W.; He, D.; Wang, J.: User-assisted query translation for interactive cross-language information retrieval (2008) 0.26
    0.25528532 = sum of:
      0.25528532 = product of:
        0.91173327 = sum of:
          0.02377175 = weight(abstract_txt:paper in 3030) [ClassicSimilarity], result of:
            0.02377175 = score(doc=3030,freq=2.0), product of:
              0.062155265 = queryWeight, product of:
                1.0600579 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.016938314 = queryNorm
              0.38245752 = fieldWeight in 3030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
          0.024079828 = weight(abstract_txt:retrieval in 3030) [ClassicSimilarity], result of:
            0.024079828 = score(doc=3030,freq=2.0), product of:
              0.06269113 = queryWeight, product of:
                1.0646176 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016938314 = queryNorm
              0.3841026 = fieldWeight in 3030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
          0.05093179 = weight(abstract_txt:language in 3030) [ClassicSimilarity], result of:
            0.05093179 = score(doc=3030,freq=3.0), product of:
              0.09024006 = queryWeight, product of:
                1.2772924 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.016938314 = queryNorm
              0.5644033 = fieldWeight in 3030, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
          0.047168277 = weight(abstract_txt:find in 3030) [ClassicSimilarity], result of:
            0.047168277 = score(doc=3030,freq=1.0), product of:
              0.12365562 = queryWeight, product of:
                1.495194 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.016938314 = queryNorm
              0.38144872 = fieldWeight in 3030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
          0.1007066 = weight(abstract_txt:cross in 3030) [ClassicSimilarity], result of:
            0.1007066 = score(doc=3030,freq=2.0), product of:
              0.16273306 = queryWeight, product of:
                1.7152543 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.016938314 = queryNorm
              0.61884534 = fieldWeight in 3030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
          0.031955123 = weight(abstract_txt:research in 3030) [ClassicSimilarity], result of:
            0.031955123 = score(doc=3030,freq=1.0), product of:
              0.1294556 = queryWeight, product of:
                2.4189174 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.016938314 = queryNorm
              0.24684234 = fieldWeight in 3030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
          0.63311994 = weight(abstract_txt:clir in 3030) [ClassicSimilarity], result of:
            0.63311994 = score(doc=3030,freq=2.0), product of:
              0.69840044 = queryWeight, product of:
                5.0252523 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016938314 = queryNorm
              0.90652853 = fieldWeight in 3030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=3030)
        0.28 = coord(7/25)
    
  4. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.21
    0.20839977 = sum of:
      0.20839977 = product of:
        1.0419989 = sum of:
          0.016809165 = weight(abstract_txt:paper in 2020) [ClassicSimilarity], result of:
            0.016809165 = score(doc=2020,freq=1.0), product of:
              0.062155265 = queryWeight, product of:
                1.0600579 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.016938314 = queryNorm
              0.2704383 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.01702701 = weight(abstract_txt:retrieval in 2020) [ClassicSimilarity], result of:
            0.01702701 = score(doc=2020,freq=1.0), product of:
              0.06269113 = queryWeight, product of:
                1.0646176 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016938314 = queryNorm
              0.27160156 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.04158563 = weight(abstract_txt:language in 2020) [ClassicSimilarity], result of:
            0.04158563 = score(doc=2020,freq=2.0), product of:
              0.09024006 = queryWeight, product of:
                1.2772924 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.016938314 = queryNorm
              0.46083337 = fieldWeight in 2020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.07121032 = weight(abstract_txt:cross in 2020) [ClassicSimilarity], result of:
            0.07121032 = score(doc=2020,freq=1.0), product of:
              0.16273306 = queryWeight, product of:
                1.7152543 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.016938314 = queryNorm
              0.43758973 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.8953668 = weight(abstract_txt:clir in 2020) [ClassicSimilarity], result of:
            0.8953668 = score(doc=2020,freq=4.0), product of:
              0.69840044 = queryWeight, product of:
                5.0252523 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016938314 = queryNorm
              1.282025 = fieldWeight in 2020, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
        0.2 = coord(5/25)
    
  5. Levow, G.-A.; Oard, D.W.; Resnik, P.: Dictionary-based techniques for cross-language information retrieval (2005) 0.20
    0.19901916 = sum of:
      0.19901916 = product of:
        0.8292465 = sum of:
          0.01702701 = weight(abstract_txt:retrieval in 2025) [ClassicSimilarity], result of:
            0.01702701 = score(doc=2025,freq=1.0), product of:
              0.06269113 = queryWeight, product of:
                1.0646176 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016938314 = queryNorm
              0.27160156 = fieldWeight in 2025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2025)
          0.019135306 = weight(abstract_txt:been in 2025) [ClassicSimilarity], result of:
            0.019135306 = score(doc=2025,freq=1.0), product of:
              0.06776479 = queryWeight, product of:
                1.1068599 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.016938314 = queryNorm
              0.2823783 = fieldWeight in 2025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.078125 = fieldNorm(doc=2025)
          0.04158563 = weight(abstract_txt:language in 2025) [ClassicSimilarity], result of:
            0.04158563 = score(doc=2025,freq=2.0), product of:
              0.09024006 = queryWeight, product of:
                1.2772924 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.016938314 = queryNorm
              0.46083337 = fieldWeight in 2025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=2025)
          0.047168277 = weight(abstract_txt:find in 2025) [ClassicSimilarity], result of:
            0.047168277 = score(doc=2025,freq=1.0), product of:
              0.12365562 = queryWeight, product of:
                1.495194 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.016938314 = queryNorm
              0.38144872 = fieldWeight in 2025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.078125 = fieldNorm(doc=2025)
          0.07121032 = weight(abstract_txt:cross in 2025) [ClassicSimilarity], result of:
            0.07121032 = score(doc=2025,freq=1.0), product of:
              0.16273306 = queryWeight, product of:
                1.7152543 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.016938314 = queryNorm
              0.43758973 = fieldWeight in 2025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=2025)
          0.63311994 = weight(abstract_txt:clir in 2025) [ClassicSimilarity], result of:
            0.63311994 = score(doc=2025,freq=2.0), product of:
              0.69840044 = queryWeight, product of:
                5.0252523 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016938314 = queryNorm
              0.90652853 = fieldWeight in 2025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=2025)
        0.24 = coord(6/25)