Document (#32581)

Author
Niemi, T.
Jämsen, J.
Title
¬A query language for discovering semantic associations, part II : sample queries and query evaluation
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.11, S.1686-1700
Year
2007
Abstract
In our query language introduced in Part I (Journal of the American Society for Information Science and Technology. 58(2007) no.11, S.1559-1568) the user can formulate queries to find out (possibly complex) semantic relationships among entities. In this article we demonstrate the usage of our query language and discuss the new applications that it supports. We categorize several query types and give sample queries. The query types are categorized based on whether the entities specified in a query are known or unknown to the user in advance, and whether text information in documents is utilized. Natural language is used to represent the results of queries in order to facilitate correct interpretation by the user. We discuss briefly the issues related to the prototype implementation of the query language and show that an independent operation like Rho (Sheth et al., 2005; Anyanwu & Sheth, 2002, 2003), which presupposes entities of interest to be known in advance, is exceedingly inefficient in emulating the behavior of our query language. The discussion also covers potential problems, and challenges for future work.
Theme
Computerlinguistik
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (author)

  1. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:niemi in 1591) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 1591, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=1591)
    
  2. Järvelin, K.; Niemi, T.: Deductive information retrieval based on classifications (1993) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:niemi in 3229) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 3229, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=3229)
    
  3. Niemi, T.; Hirvonen, L.; Järvelin, K.: Multidimensional data model and query language for informetrics (2003) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:niemi in 2753) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 2753, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=2753)
    
  4. Järvelin, K.; Ingwersen, P.; Niemi, T.: ¬A user-oriented interface for generalised informetric analysis based on applying advanced data modelling techniques (2000) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:niemi in 5545) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 5545, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=5545)
    
  5. Näppilä, T.; Järvelin, K.; Niemi, T.: ¬A tool for data cube construction from structurally heterogeneous XML documents (2008) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:niemi in 2369) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 2369, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=2369)
    

Similar documents (content)

  1. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 0.31
    0.3056445 = sum of:
      0.3056445 = product of:
        0.95513916 = sum of:
          0.08512002 = weight(abstract_txt:unknown in 1591) [ClassicSimilarity], result of:
            0.08512002 = score(doc=1591,freq=2.0), product of:
              0.13372242 = queryWeight, product of:
                1.074192 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.017285815 = queryNorm
              0.6365426 = fieldWeight in 1591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.06633242 = weight(abstract_txt:discovering in 1591) [ClassicSimilarity], result of:
            0.06633242 = score(doc=1591,freq=1.0), product of:
              0.14267361 = queryWeight, product of:
                1.1095622 = boost
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.017285815 = queryNorm
              0.46492425 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.070724145 = weight(abstract_txt:semantic in 1591) [ClassicSimilarity], result of:
            0.070724145 = score(doc=1591,freq=6.0), product of:
              0.10324392 = queryWeight, product of:
                1.3348334 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.017285815 = queryNorm
              0.68501997 = fieldWeight in 1591, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.045321465 = weight(abstract_txt:known in 1591) [ClassicSimilarity], result of:
            0.045321465 = score(doc=1591,freq=1.0), product of:
              0.13944589 = queryWeight, product of:
                1.5513067 = boost
                5.200178 = idf(docFreq=665, maxDocs=44421)
                0.017285815 = queryNorm
              0.32501113 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.200178 = idf(docFreq=665, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.041759193 = weight(abstract_txt:user in 1591) [ClassicSimilarity], result of:
            0.041759193 = score(doc=1591,freq=3.0), product of:
              0.10479997 = queryWeight, product of:
                1.647104 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017285815 = queryNorm
              0.3984657 = fieldWeight in 1591, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.23491289 = weight(abstract_txt:entities in 1591) [ClassicSimilarity], result of:
            0.23491289 = score(doc=1591,freq=6.0), product of:
              0.2631001 = queryWeight, product of:
                2.6097622 = boost
                5.8321705 = idf(docFreq=353, maxDocs=44421)
                0.017285815 = queryNorm
              0.8928651 = fieldWeight in 1591, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8321705 = idf(docFreq=353, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.09922069 = weight(abstract_txt:language in 1591) [ClassicSimilarity], result of:
            0.09922069 = score(doc=1591,freq=2.0), product of:
              0.26913387 = queryWeight, product of:
                3.732842 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.017285815 = queryNorm
              0.3686667 = fieldWeight in 1591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.31174836 = weight(abstract_txt:query in 1591) [ClassicSimilarity], result of:
            0.31174836 = score(doc=1591,freq=4.0), product of:
              0.52455384 = queryWeight, product of:
                6.3825774 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.017285815 = queryNorm
              0.5943115 = fieldWeight in 1591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
        0.32 = coord(8/25)
    
  2. Owei, V.; Higa, K.: ¬A paradigm for natural language explanation of database queries : a semantic data model approach (1994) 0.24
    0.24341992 = sum of:
      0.24341992 = product of:
        1.0142497 = sum of:
          0.17028293 = weight(abstract_txt:specified in 8188) [ClassicSimilarity], result of:
            0.17028293 = score(doc=8188,freq=3.0), product of:
              0.12771486 = queryWeight, product of:
                1.0497854 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.017285815 = queryNorm
              1.3333056 = fieldWeight in 8188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.109375 = fieldNorm(doc=8188)
          0.05052777 = weight(abstract_txt:semantic in 8188) [ClassicSimilarity], result of:
            0.05052777 = score(doc=8188,freq=1.0), product of:
              0.10324392 = queryWeight, product of:
                1.3348334 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.017285815 = queryNorm
              0.4894019 = fieldWeight in 8188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.109375 = fieldNorm(doc=8188)
          0.07307859 = weight(abstract_txt:user in 8188) [ClassicSimilarity], result of:
            0.07307859 = score(doc=8188,freq=3.0), product of:
              0.10479997 = queryWeight, product of:
                1.647104 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017285815 = queryNorm
              0.697315 = fieldWeight in 8188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.109375 = fieldNorm(doc=8188)
          0.21181215 = weight(abstract_txt:queries in 8188) [ClassicSimilarity], result of:
            0.21181215 = score(doc=8188,freq=2.0), product of:
              0.268418 = queryWeight, product of:
                3.0437965 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.017285815 = queryNorm
              0.78911304 = fieldWeight in 8188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.109375 = fieldNorm(doc=8188)
          0.12277935 = weight(abstract_txt:language in 8188) [ClassicSimilarity], result of:
            0.12277935 = score(doc=8188,freq=1.0), product of:
              0.26913387 = queryWeight, product of:
                3.732842 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.017285815 = queryNorm
              0.45620176 = fieldWeight in 8188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.109375 = fieldNorm(doc=8188)
          0.38576892 = weight(abstract_txt:query in 8188) [ClassicSimilarity], result of:
            0.38576892 = score(doc=8188,freq=2.0), product of:
              0.52455384 = queryWeight, product of:
                6.3825774 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.017285815 = queryNorm
              0.7354229 = fieldWeight in 8188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.109375 = fieldNorm(doc=8188)
        0.24 = coord(6/25)
    
  3. Airio, E.: Who benefits from CLIR in web retrieval? (2008) 0.21
    0.20904642 = sum of:
      0.20904642 = product of:
        0.87102675 = sum of:
          0.052879136 = weight(abstract_txt:utilized in 3342) [ClassicSimilarity], result of:
            0.052879136 = score(doc=3342,freq=1.0), product of:
              0.12266368 = queryWeight, product of:
                1.0288162 = boost
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.017285815 = queryNorm
              0.4310904 = fieldWeight in 3342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.0625 = fieldNorm(doc=3342)
          0.06286949 = weight(abstract_txt:formulate in 3342) [ClassicSimilarity], result of:
            0.06286949 = score(doc=3342,freq=1.0), product of:
              0.13766378 = queryWeight, product of:
                1.0899075 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.017285815 = queryNorm
              0.45668864 = fieldWeight in 3342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0625 = fieldNorm(doc=3342)
          0.03663104 = weight(abstract_txt:whether in 3342) [ClassicSimilarity], result of:
            0.03663104 = score(doc=3342,freq=1.0), product of:
              0.12099551 = queryWeight, product of:
                1.4450386 = boost
                4.8439536 = idf(docFreq=950, maxDocs=44421)
                0.017285815 = queryNorm
              0.3027471 = fieldWeight in 3342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8439536 = idf(docFreq=950, maxDocs=44421)
                0.0625 = fieldNorm(doc=3342)
          0.14823763 = weight(abstract_txt:queries in 3342) [ClassicSimilarity], result of:
            0.14823763 = score(doc=3342,freq=3.0), product of:
              0.268418 = queryWeight, product of:
                3.0437965 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.017285815 = queryNorm
              0.5522641 = fieldWeight in 3342, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=3342)
          0.22186422 = weight(abstract_txt:language in 3342) [ClassicSimilarity], result of:
            0.22186422 = score(doc=3342,freq=10.0), product of:
              0.26913387 = queryWeight, product of:
                3.732842 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.017285815 = queryNorm
              0.8243638 = fieldWeight in 3342, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=3342)
          0.34854525 = weight(abstract_txt:query in 3342) [ClassicSimilarity], result of:
            0.34854525 = score(doc=3342,freq=5.0), product of:
              0.52455384 = queryWeight, product of:
                6.3825774 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.017285815 = queryNorm
              0.6644604 = fieldWeight in 3342, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3342)
        0.24 = coord(6/25)
    
  4. Lewandowski, D.: Evaluating the retrieval effectiveness of web search engines using a representative query sample (2015) 0.21
    0.20829177 = sum of:
      0.20829177 = product of:
        0.8678824 = sum of:
          0.10513361 = weight(abstract_txt:correct in 3157) [ClassicSimilarity], result of:
            0.10513361 = score(doc=3157,freq=3.0), product of:
              0.11588851 = queryWeight, product of:
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.017285815 = queryNorm
              0.9071961 = fieldWeight in 3157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.0358642 = weight(abstract_txt:types in 3157) [ClassicSimilarity], result of:
            0.0358642 = score(doc=3157,freq=1.0), product of:
              0.102810435 = queryWeight, product of:
                1.3320282 = boost
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.017285815 = queryNorm
              0.34883815 = fieldWeight in 3157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.030137103 = weight(abstract_txt:user in 3157) [ClassicSimilarity], result of:
            0.030137103 = score(doc=3157,freq=1.0), product of:
              0.10479997 = queryWeight, product of:
                1.647104 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017285815 = queryNorm
              0.28756785 = fieldWeight in 3157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.09722033 = weight(abstract_txt:sample in 3157) [ClassicSimilarity], result of:
            0.09722033 = score(doc=3157,freq=2.0), product of:
              0.1586443 = queryWeight, product of:
                1.6546534 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.017285815 = queryNorm
              0.61281955 = fieldWeight in 3157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.26204962 = weight(abstract_txt:queries in 3157) [ClassicSimilarity], result of:
            0.26204962 = score(doc=3157,freq=6.0), product of:
              0.268418 = queryWeight, product of:
                3.0437965 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.017285815 = queryNorm
              0.9762743 = fieldWeight in 3157, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.33747754 = weight(abstract_txt:query in 3157) [ClassicSimilarity], result of:
            0.33747754 = score(doc=3157,freq=3.0), product of:
              0.52455384 = queryWeight, product of:
                6.3825774 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.017285815 = queryNorm
              0.6433611 = fieldWeight in 3157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
        0.24 = coord(6/25)
    
  5. Rozinajová, V.; Macko, P.: Using natural language to search linked data (2017) 0.20
    0.20237556 = sum of:
      0.20237556 = product of:
        0.8432315 = sum of:
          0.0358642 = weight(abstract_txt:types in 4488) [ClassicSimilarity], result of:
            0.0358642 = score(doc=4488,freq=1.0), product of:
              0.102810435 = queryWeight, product of:
                1.3320282 = boost
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.017285815 = queryNorm
              0.34883815 = fieldWeight in 4488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.078125 = fieldNorm(doc=4488)
          0.036091264 = weight(abstract_txt:semantic in 4488) [ClassicSimilarity], result of:
            0.036091264 = score(doc=4488,freq=1.0), product of:
              0.10324392 = queryWeight, product of:
                1.3348334 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.017285815 = queryNorm
              0.34957278 = fieldWeight in 4488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=4488)
          0.060274206 = weight(abstract_txt:user in 4488) [ClassicSimilarity], result of:
            0.060274206 = score(doc=4488,freq=4.0), product of:
              0.10479997 = queryWeight, product of:
                1.647104 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017285815 = queryNorm
              0.5751357 = fieldWeight in 4488, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=4488)
          0.1512944 = weight(abstract_txt:queries in 4488) [ClassicSimilarity], result of:
            0.1512944 = score(doc=4488,freq=2.0), product of:
              0.268418 = queryWeight, product of:
                3.0437965 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.017285815 = queryNorm
              0.56365216 = fieldWeight in 4488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=4488)
          0.12402587 = weight(abstract_txt:language in 4488) [ClassicSimilarity], result of:
            0.12402587 = score(doc=4488,freq=2.0), product of:
              0.26913387 = queryWeight, product of:
                3.732842 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.017285815 = queryNorm
              0.46083337 = fieldWeight in 4488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=4488)
          0.43568158 = weight(abstract_txt:query in 4488) [ClassicSimilarity], result of:
            0.43568158 = score(doc=4488,freq=5.0), product of:
              0.52455384 = queryWeight, product of:
                6.3825774 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.017285815 = queryNorm
              0.8305755 = fieldWeight in 4488, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=4488)
        0.24 = coord(6/25)