Document (#33343)

Author
Klein, S.T.
Title
Processing queries with metrical constraints in XML-based IR systems
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.1, S.86-97
Year
2008
Abstract
XML documents combine features from classical IR systems allowing free text, with explicit structures as in databases. Many query languages have been specially designed for IR applications on XML documents. This work concentrates on a special type of language for which the problem of processing queries including metrical constraints is investigated. The main question is how to define the distance between terms in different locations of the XML tree in an intuitively justifiable way, without jeopardizing the ability to get good retrieval results in terms of recall and precision. A new definition is given and its usefulness is shown on several examples from the INEX collection.
Object
XML

Similar documents (author)

  1. Klein, W.: Organisation des Wissens durch Sprache : Konsequenzen für die maschinelle Sprachanalyse (1977) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 1747) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 1747, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=1747)
    
  2. Klein, H.: GENIOS jetzt mit Thesaurus-Suche (1993) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 7536) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 7536, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=7536)
    
  3. Klein, R.D.: ¬The problem of cataloguing world literature using the Nippon Decimal Classification (1994) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 935) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 935, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=935)
    
  4. Klein, G.M.: Is there a standard default keyword operator? : a bibliometric analysis of processing options chosen by libraries to execute keyword searches in online public access catalogs (1994) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 2268) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 2268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=2268)
    
  5. Klein, J.T.: Interdisciplinary needs : the current context (1996) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 245) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 245, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=245)
    

Similar documents (content)

  1. Klein, S.T.: On the use of negation in Boolean IR queries. (2009) 0.30
    0.30426303 = sum of:
      0.30426303 = product of:
        1.0866537 = sum of:
          0.051091466 = weight(abstract_txt:shown in 914) [ClassicSimilarity], result of:
            0.051091466 = score(doc=914,freq=1.0), product of:
              0.09749117 = queryWeight, product of:
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.01744028 = queryNorm
              0.5240625 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.057195988 = weight(abstract_txt:investigated in 914) [ClassicSimilarity], result of:
            0.057195988 = score(doc=914,freq=1.0), product of:
              0.10510985 = queryWeight, product of:
                1.0383388 = boost
                5.8043137 = idf(docFreq=363, maxDocs=44421)
                0.01744028 = queryNorm
              0.5441544 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8043137 = idf(docFreq=363, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.06473795 = weight(abstract_txt:usefulness in 914) [ClassicSimilarity], result of:
            0.06473795 = score(doc=914,freq=1.0), product of:
              0.11415782 = queryWeight, product of:
                1.082107 = boost
                6.0489783 = idf(docFreq=284, maxDocs=44421)
                0.01744028 = queryNorm
              0.5670917 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0489783 = idf(docFreq=284, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.05470142 = weight(abstract_txt:terms in 914) [ClassicSimilarity], result of:
            0.05470142 = score(doc=914,freq=2.0), product of:
              0.10203099 = queryWeight, product of:
                1.4467664 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.01744028 = queryNorm
              0.53612554 = fieldWeight in 914, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.1345309 = weight(abstract_txt:queries in 914) [ClassicSimilarity], result of:
            0.1345309 = score(doc=914,freq=3.0), product of:
              0.16239923 = queryWeight, product of:
                1.8252584 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01744028 = queryNorm
              0.82839614 = fieldWeight in 914, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.18127176 = weight(abstract_txt:constraints in 914) [ClassicSimilarity], result of:
            0.18127176 = score(doc=914,freq=1.0), product of:
              0.2857348 = queryWeight, product of:
                2.4211068 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.01744028 = queryNorm
              0.6344056 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.54312414 = weight(abstract_txt:metrical in 914) [ClassicSimilarity], result of:
            0.54312414 = score(doc=914,freq=1.0), product of:
              0.59384865 = queryWeight, product of:
                3.4903603 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.01744028 = queryNorm
              0.91458344 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
        0.28 = coord(7/25)
    
  2. Schlieder, T.; Meuss, H.: Querying and ranking XML documents (2002) 0.09
    0.09315949 = sum of:
      0.09315949 = product of:
        0.38816455 = sum of:
          0.053760894 = weight(abstract_txt:classical in 1459) [ClassicSimilarity], result of:
            0.053760894 = score(doc=1459,freq=1.0), product of:
              0.13216147 = queryWeight, product of:
                1.1643131 = boost
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.01744028 = queryNorm
              0.4067819 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.058406614 = weight(abstract_txt:combine in 1459) [ClassicSimilarity], result of:
            0.058406614 = score(doc=1459,freq=1.0), product of:
              0.1396696 = queryWeight, product of:
                1.1969287 = boost
                6.690832 = idf(docFreq=149, maxDocs=44421)
                0.01744028 = queryNorm
              0.418177 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.690832 = idf(docFreq=149, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.088613786 = weight(abstract_txt:tree in 1459) [ClassicSimilarity], result of:
            0.088613786 = score(doc=1459,freq=2.0), product of:
              0.14636977 = queryWeight, product of:
                1.2253017 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.01744028 = queryNorm
              0.60541046 = fieldWeight in 1459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.03646761 = weight(abstract_txt:terms in 1459) [ClassicSimilarity], result of:
            0.03646761 = score(doc=1459,freq=2.0), product of:
              0.10203099 = queryWeight, product of:
                1.4467664 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.01744028 = queryNorm
              0.35741702 = fieldWeight in 1459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.047353707 = weight(abstract_txt:documents in 1459) [ClassicSimilarity], result of:
            0.047353707 = score(doc=1459,freq=3.0), product of:
              0.10608796 = queryWeight, product of:
                1.4752493 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.01744028 = queryNorm
              0.4463627 = fieldWeight in 1459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.10356194 = weight(abstract_txt:queries in 1459) [ClassicSimilarity], result of:
            0.10356194 = score(doc=1459,freq=4.0), product of:
              0.16239923 = queryWeight, product of:
                1.8252584 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01744028 = queryNorm
              0.63769966 = fieldWeight in 1459, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
        0.24 = coord(6/25)
    
  3. Vilares, J.; Alonso, M.A.; Vilares, M.: Extraction of complex index terms in non-English IR : a shallow parsing based approach (2008) 0.08
    0.08350977 = sum of:
      0.08350977 = product of:
        0.29824919 = sum of:
          0.034060977 = weight(abstract_txt:shown in 3107) [ClassicSimilarity], result of:
            0.034060977 = score(doc=3107,freq=1.0), product of:
              0.09749117 = queryWeight, product of:
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.01744028 = queryNorm
              0.349375 = fieldWeight in 3107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
          0.053760894 = weight(abstract_txt:classical in 3107) [ClassicSimilarity], result of:
            0.053760894 = score(doc=3107,freq=1.0), product of:
              0.13216147 = queryWeight, product of:
                1.1643131 = boost
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.01744028 = queryNorm
              0.4067819 = fieldWeight in 3107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
          0.01547974 = weight(abstract_txt:systems in 3107) [ClassicSimilarity], result of:
            0.01547974 = score(doc=3107,freq=1.0), product of:
              0.07260719 = queryWeight, product of:
                1.2204561 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.01744028 = queryNorm
              0.21319844 = fieldWeight in 3107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
          0.03646761 = weight(abstract_txt:terms in 3107) [ClassicSimilarity], result of:
            0.03646761 = score(doc=3107,freq=2.0), product of:
              0.10203099 = queryWeight, product of:
                1.4467664 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.01744028 = queryNorm
              0.35741702 = fieldWeight in 3107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
          0.03866414 = weight(abstract_txt:documents in 3107) [ClassicSimilarity], result of:
            0.03866414 = score(doc=3107,freq=2.0), product of:
              0.10608796 = queryWeight, product of:
                1.4752493 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.01744028 = queryNorm
              0.3644536 = fieldWeight in 3107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
          0.046586484 = weight(abstract_txt:processing in 3107) [ClassicSimilarity], result of:
            0.046586484 = score(doc=3107,freq=1.0), product of:
              0.15134816 = queryWeight, product of:
                1.762061 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.01744028 = queryNorm
              0.30781004 = fieldWeight in 3107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
          0.07322934 = weight(abstract_txt:queries in 3107) [ClassicSimilarity], result of:
            0.07322934 = score(doc=3107,freq=2.0), product of:
              0.16239923 = queryWeight, product of:
                1.8252584 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01744028 = queryNorm
              0.45092174 = fieldWeight in 3107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=3107)
        0.28 = coord(7/25)
    
  4. Pal, S.; Mitra, M.; Kamps, J.: Evaluation effort, reliability and reusability in XML retrieval (2011) 0.08
    0.07685881 = sum of:
      0.07685881 = product of:
        0.4803676 = sum of:
          0.045891803 = weight(abstract_txt:recall in 197) [ClassicSimilarity], result of:
            0.045891803 = score(doc=197,freq=2.0), product of:
              0.10318152 = queryWeight, product of:
                1.0287701 = boost
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.01744028 = queryNorm
              0.44476765 = fieldWeight in 197, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.0546875 = fieldNorm(doc=197)
          0.027089544 = weight(abstract_txt:systems in 197) [ClassicSimilarity], result of:
            0.027089544 = score(doc=197,freq=4.0), product of:
              0.07260719 = queryWeight, product of:
                1.2204561 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.01744028 = queryNorm
              0.37309727 = fieldWeight in 197, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0546875 = fieldNorm(doc=197)
          0.31676954 = weight(abstract_txt:inex in 197) [ClassicSimilarity], result of:
            0.31676954 = score(doc=197,freq=5.0), product of:
              0.27560946 = queryWeight, product of:
                1.6813743 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01744028 = queryNorm
              1.1493421 = fieldWeight in 197, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0546875 = fieldNorm(doc=197)
          0.090616696 = weight(abstract_txt:queries in 197) [ClassicSimilarity], result of:
            0.090616696 = score(doc=197,freq=4.0), product of:
              0.16239923 = queryWeight, product of:
                1.8252584 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01744028 = queryNorm
              0.5579872 = fieldWeight in 197, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0546875 = fieldNorm(doc=197)
        0.16 = coord(4/25)
    
  5. Pérez Pozo, Á.; Rosa, J. de la; Ros, S.; González-Blanco, E.; Hernández, L.; Sisto, M. de: ¬A bridge too far for artificial intelligence? : automatic classification of stanzas in Spanish poetry (2022) 0.08
    0.076465584 = sum of:
      0.076465584 = product of:
        0.4779099 = sum of:
          0.053760894 = weight(abstract_txt:classical in 1469) [ClassicSimilarity], result of:
            0.053760894 = score(doc=1469,freq=1.0), product of:
              0.13216147 = queryWeight, product of:
                1.1643131 = boost
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.01744028 = queryNorm
              0.4067819 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.0625 = fieldNorm(doc=1469)
          0.01547974 = weight(abstract_txt:systems in 1469) [ClassicSimilarity], result of:
            0.01547974 = score(doc=1469,freq=1.0), product of:
              0.07260719 = queryWeight, product of:
                1.2204561 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.01744028 = queryNorm
              0.21319844 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=1469)
          0.046586484 = weight(abstract_txt:processing in 1469) [ClassicSimilarity], result of:
            0.046586484 = score(doc=1469,freq=1.0), product of:
              0.15134816 = queryWeight, product of:
                1.762061 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.01744028 = queryNorm
              0.30781004 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=1469)
          0.36208278 = weight(abstract_txt:metrical in 1469) [ClassicSimilarity], result of:
            0.36208278 = score(doc=1469,freq=1.0), product of:
              0.59384865 = queryWeight, product of:
                3.4903603 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.01744028 = queryNorm
              0.6097223 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=1469)
        0.16 = coord(4/25)