Document (#10114)

Author
Yannakoudakis, E.J.
Daraki, J.J.
Title
Lexical clustering and retrieval of bibliographic records
Source
Information retrieval: new systems and current research. Proceedings of the 15th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Glasgow 1993. Ed.: Ruben Leon
Imprint
London : Taylor Graham
Year
1994
Pages
S.137-149
Abstract
Presents a new system that enables users to retrieve catalogue entries on the basis of theri lexical similarities and to cluster records in a dynamic fashion. Describes the information retrieval system developed by the Department of Informatics, Athens University of Economics and Business, Greece. The system also offers the means for cyclic retrieval of records from each cluster while allowing the user to define the field to be used in each case. The approach is based on logical keys which are derived from pertinent bibliographic fields and are used for all clustering and information retrieval functions
Theme
Computerlinguistik

Similar documents (content)

  1. Leazer, G.H.: ¬A conceptual schema for the control of bibliographic works (1994) 0.15
    0.15174119 = sum of:
      0.15174119 = product of:
        0.47419125 = sum of:
          0.0496559 = weight(abstract_txt:retrieve in 3101) [ClassicSimilarity], result of:
            0.0496559 = score(doc=3101,freq=1.0), product of:
              0.13209668 = queryWeight, product of:
                1.0434705 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.021048093 = queryNorm
              0.37590575 = fieldWeight in 3101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.052652504 = weight(abstract_txt:enables in 3101) [ClassicSimilarity], result of:
            0.052652504 = score(doc=3101,freq=1.0), product of:
              0.13735907 = queryWeight, product of:
                1.0640521 = boost
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.021048093 = queryNorm
              0.38332018 = fieldWeight in 3101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.053455215 = weight(abstract_txt:logical in 3101) [ClassicSimilarity], result of:
            0.053455215 = score(doc=3101,freq=1.0), product of:
              0.13875161 = queryWeight, product of:
                1.0694321 = boost
                6.1641335 = idf(docFreq=253, maxDocs=44421)
                0.021048093 = queryNorm
              0.38525835 = fieldWeight in 3101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1641335 = idf(docFreq=253, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.017271725 = weight(abstract_txt:used in 3101) [ClassicSimilarity], result of:
            0.017271725 = score(doc=3101,freq=1.0), product of:
              0.0823149 = queryWeight, product of:
                1.1648995 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021048093 = queryNorm
              0.20982501 = fieldWeight in 3101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.045069914 = weight(abstract_txt:each in 3101) [ClassicSimilarity], result of:
            0.045069914 = score(doc=3101,freq=2.0), product of:
              0.12383284 = queryWeight, product of:
                1.4287858 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.021048093 = queryNorm
              0.3639577 = fieldWeight in 3101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.13386846 = weight(abstract_txt:bibliographic in 3101) [ClassicSimilarity], result of:
            0.13386846 = score(doc=3101,freq=15.0), product of:
              0.1307203 = queryWeight, product of:
                1.467982 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.021048093 = queryNorm
              1.0240831 = fieldWeight in 3101, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.045500692 = weight(abstract_txt:system in 3101) [ClassicSimilarity], result of:
            0.045500692 = score(doc=3101,freq=3.0), product of:
              0.12462064 = queryWeight, product of:
                1.7554556 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021048093 = queryNorm
              0.36511362 = fieldWeight in 3101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
          0.07671684 = weight(abstract_txt:retrieval in 3101) [ClassicSimilarity], result of:
            0.07671684 = score(doc=3101,freq=4.0), product of:
              0.1765381 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021048093 = queryNorm
              0.4345625 = fieldWeight in 3101, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3101)
        0.32 = coord(8/25)
    
  2. Dunlavy, D.M.; O'Leary, D.P.; Conroy, J.M.; Schlesinger, J.D.: QCS: A system for querying, clustering and summarizing documents (2007) 0.13
    0.12796439 = sum of:
      0.12796439 = product of:
        0.533185 = sum of:
          0.026176067 = weight(abstract_txt:used in 1947) [ClassicSimilarity], result of:
            0.026176067 = score(doc=1947,freq=3.0), product of:
              0.0823149 = queryWeight, product of:
                1.1648995 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021048093 = queryNorm
              0.31799912 = fieldWeight in 1947, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.062354077 = weight(abstract_txt:each in 1947) [ClassicSimilarity], result of:
            0.062354077 = score(doc=1947,freq=5.0), product of:
              0.12383284 = queryWeight, product of:
                1.4287858 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.021048093 = queryNorm
              0.50353426 = fieldWeight in 1947, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.05630424 = weight(abstract_txt:system in 1947) [ClassicSimilarity], result of:
            0.05630424 = score(doc=1947,freq=6.0), product of:
              0.12462064 = queryWeight, product of:
                1.7554556 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021048093 = queryNorm
              0.45180508 = fieldWeight in 1947, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.13597897 = weight(abstract_txt:clustering in 1947) [ClassicSimilarity], result of:
            0.13597897 = score(doc=1947,freq=2.0), product of:
              0.28263143 = queryWeight, product of:
                2.1585367 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.021048093 = queryNorm
              0.48111764 = fieldWeight in 1947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.19423775 = weight(abstract_txt:cluster in 1947) [ClassicSimilarity], result of:
            0.19423775 = score(doc=1947,freq=3.0), product of:
              0.3131588 = queryWeight, product of:
                2.2721214 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.021048093 = queryNorm
              0.6202532 = fieldWeight in 1947, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.058133885 = weight(abstract_txt:retrieval in 1947) [ClassicSimilarity], result of:
            0.058133885 = score(doc=1947,freq=3.0), product of:
              0.1765381 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021048093 = queryNorm
              0.3292994 = fieldWeight in 1947, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
        0.24 = coord(6/25)
    
  3. Pao, M.L.: Retrieval differences between term and citation indexing (1989) 0.12
    0.12393977 = sum of:
      0.12393977 = product of:
        0.5164157 = sum of:
          0.08689783 = weight(abstract_txt:retrieve in 3634) [ClassicSimilarity], result of:
            0.08689783 = score(doc=3634,freq=1.0), product of:
              0.13209668 = queryWeight, product of:
                1.0434705 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.021048093 = queryNorm
              0.65783507 = fieldWeight in 3634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.109375 = fieldNorm(doc=3634)
          0.030225517 = weight(abstract_txt:used in 3634) [ClassicSimilarity], result of:
            0.030225517 = score(doc=3634,freq=1.0), product of:
              0.0823149 = queryWeight, product of:
                1.1648995 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021048093 = queryNorm
              0.36719376 = fieldWeight in 3634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.109375 = fieldNorm(doc=3634)
          0.21590579 = weight(abstract_txt:keys in 3634) [ClassicSimilarity], result of:
            0.21590579 = score(doc=3634,freq=1.0), product of:
              0.24232346 = queryWeight, product of:
                1.4132922 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.021048093 = queryNorm
              0.8909818 = fieldWeight in 3634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.109375 = fieldNorm(doc=3634)
          0.055771176 = weight(abstract_txt:each in 3634) [ClassicSimilarity], result of:
            0.055771176 = score(doc=3634,freq=1.0), product of:
              0.12383284 = queryWeight, product of:
                1.4287858 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.021048093 = queryNorm
              0.4503747 = fieldWeight in 3634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.109375 = fieldNorm(doc=3634)
          0.060488198 = weight(abstract_txt:bibliographic in 3634) [ClassicSimilarity], result of:
            0.060488198 = score(doc=3634,freq=1.0), product of:
              0.1307203 = queryWeight, product of:
                1.467982 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.021048093 = queryNorm
              0.46272993 = fieldWeight in 3634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.109375 = fieldNorm(doc=3634)
          0.067127235 = weight(abstract_txt:retrieval in 3634) [ClassicSimilarity], result of:
            0.067127235 = score(doc=3634,freq=1.0), product of:
              0.1765381 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021048093 = queryNorm
              0.3802422 = fieldWeight in 3634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=3634)
        0.24 = coord(6/25)
    
  4. Huang, L.; Milne, D.; Frank, E.; Witten, I.H.: Learning a concept-based document similarity measure (2012) 0.12
    0.12023818 = sum of:
      0.12023818 = product of:
        0.60119087 = sum of:
          0.039836556 = weight(abstract_txt:each in 1372) [ClassicSimilarity], result of:
            0.039836556 = score(doc=1372,freq=1.0), product of:
              0.12383284 = queryWeight, product of:
                1.4287858 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.021048093 = queryNorm
              0.32169622 = fieldWeight in 1372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.078125 = fieldNorm(doc=1372)
          0.19425566 = weight(abstract_txt:clustering in 1372) [ClassicSimilarity], result of:
            0.19425566 = score(doc=1372,freq=2.0), product of:
              0.28263143 = queryWeight, product of:
                2.1585367 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.021048093 = queryNorm
              0.68731093 = fieldWeight in 1372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.078125 = fieldNorm(doc=1372)
          0.15894605 = weight(abstract_txt:lexical in 1372) [ClassicSimilarity], result of:
            0.15894605 = score(doc=1372,freq=1.0), product of:
              0.31151655 = queryWeight, product of:
                2.266156 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.021048093 = queryNorm
              0.5102331 = fieldWeight in 1372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.078125 = fieldNorm(doc=1372)
          0.1602046 = weight(abstract_txt:cluster in 1372) [ClassicSimilarity], result of:
            0.1602046 = score(doc=1372,freq=1.0), product of:
              0.3131588 = queryWeight, product of:
                2.2721214 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.021048093 = queryNorm
              0.51157624 = fieldWeight in 1372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.078125 = fieldNorm(doc=1372)
          0.04794802 = weight(abstract_txt:retrieval in 1372) [ClassicSimilarity], result of:
            0.04794802 = score(doc=1372,freq=1.0), product of:
              0.1765381 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021048093 = queryNorm
              0.27160156 = fieldWeight in 1372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1372)
        0.2 = coord(5/25)
    
  5. Evens, M.: Thesaural relations in information retrieval (2002) 0.11
    0.113383375 = sum of:
      0.113383375 = product of:
        0.5669169 = sum of:
          0.044873256 = weight(abstract_txt:used in 2201) [ClassicSimilarity], result of:
            0.044873256 = score(doc=2201,freq=3.0), product of:
              0.0823149 = queryWeight, product of:
                1.1648995 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021048093 = queryNorm
              0.54514134 = fieldWeight in 2201, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.09375 = fieldNorm(doc=2201)
          0.039404754 = weight(abstract_txt:system in 2201) [ClassicSimilarity], result of:
            0.039404754 = score(doc=2201,freq=1.0), product of:
              0.12462064 = queryWeight, product of:
                1.7554556 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021048093 = queryNorm
              0.31619766 = fieldWeight in 2201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=2201)
          0.19073527 = weight(abstract_txt:lexical in 2201) [ClassicSimilarity], result of:
            0.19073527 = score(doc=2201,freq=1.0), product of:
              0.31151655 = queryWeight, product of:
                2.266156 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.021048093 = queryNorm
              0.6122797 = fieldWeight in 2201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=2201)
          0.19224553 = weight(abstract_txt:cluster in 2201) [ClassicSimilarity], result of:
            0.19224553 = score(doc=2201,freq=1.0), product of:
              0.3131588 = queryWeight, product of:
                2.2721214 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.021048093 = queryNorm
              0.6138915 = fieldWeight in 2201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.09375 = fieldNorm(doc=2201)
          0.09965809 = weight(abstract_txt:retrieval in 2201) [ClassicSimilarity], result of:
            0.09965809 = score(doc=2201,freq=3.0), product of:
              0.1765381 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021048093 = queryNorm
              0.5645132 = fieldWeight in 2201, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2201)
        0.2 = coord(5/25)