Document (#30977)

Author
Pfister, J.
Title
Clustering von Patent-Dokumenten am Beispiel der Datenbanken des Fachinformationszentrums Karlsruhe
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.129-146
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
In diesem Artikel, der im Anwendungsbereich der Patentrecherche und Patentinformation angesiedelt ist, wird das automatische Gruppieren von Patentdokumenten - das so genannte Clustering - als ein Werkzeug zur Aufbereitung der Ergebnismenge einer Datenbankanfrage untersucht. Der Schwerpunkt liegt dabei auf der Evaluierung von drei Clustering-Verfahren mittels Nutzerbewertungen.
Theme
Automatisches Klassifizieren
Field
Patentinformation

Similar documents (author)

  1. Pfister, D. Schmidt- => Schmidt-Pfister, D.: 4.98
    4.9845104 = sum of:
      4.9845104 = weight(author_txt:pfister in 6982) [ClassicSimilarity], result of:
        4.9845104 = fieldWeight in 6982, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=6982)
    
  2. Pfister, R.-D.: Ware oder öffentliches Gut? : Über den Charakter von Information; am Beispiel Internet (1994) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:pfister in 127) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 127, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=127)
    
  3. Pfister, R.-D.: Neue Produkte auf der Basis von Multimedia (1995) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:pfister in 1459) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 1459, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=1459)
    
  4. Pfister, H.-R.: Eröffnung des CSCL-Kompetenzzentrums am GMD-IPSI in Darmstadt : Kooperatives computerunterstütztes Lernen (CSCL) - Was ist das und wozu nützt es? (2000) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:pfister in 5833) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 5833, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=5833)
    
  5. Hangel, N.; Schmidt-Pfister, D.: Why do you publish? : on the tensions between generating scientific knowledge and publication pressure (2017) 4.11
    4.1120114 = sum of:
      4.1120114 = weight(author_txt:pfister in 54) [ClassicSimilarity], result of:
        4.1120114 = fieldWeight in 54, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.4375 = fieldNorm(doc=54)
    

Similar documents (content)

  1. Schramm, R.: Patentinformation (2004) 0.23
    0.23275574 = sum of:
      0.23275574 = product of:
        1.9396312 = sum of:
          0.04321659 = weight(abstract_txt:verfahren in 3955) [ClassicSimilarity], result of:
            0.04321659 = score(doc=3955,freq=2.0), product of:
              0.113103345 = queryWeight, product of:
                1.2300966 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.015952084 = queryNorm
              0.38209826 = fieldWeight in 3955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.046875 = fieldNorm(doc=3955)
          0.07513477 = weight(abstract_txt:patent in 3955) [ClassicSimilarity], result of:
            0.07513477 = score(doc=3955,freq=2.0), product of:
              0.16353187 = queryWeight, product of:
                1.4791175 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.015952084 = queryNorm
              0.45945033 = fieldWeight in 3955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.046875 = fieldNorm(doc=3955)
          1.8212799 = weight(title_txt:patentinformation in 3955) [ClassicSimilarity], result of:
            1.8212799 = score(doc=3955,freq=1.0), product of:
              0.22435224 = queryWeight, product of:
                1.7324739 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.015952084 = queryNorm
              8.117949 = fieldWeight in 3955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                1.0 = fieldNorm(doc=3955)
        0.12 = coord(3/25)
    
  2. STN baut Patentinformation aus (2004) 0.17
    0.1703643 = sum of:
      0.1703643 = product of:
        1.0647769 = sum of:
          0.036150355 = weight(abstract_txt:datenbanken in 3304) [ClassicSimilarity], result of:
            0.036150355 = score(doc=3304,freq=1.0), product of:
              0.11415518 = queryWeight, product of:
                1.2358031 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.015952084 = queryNorm
              0.3166773 = fieldWeight in 3304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3304)
          0.036562417 = weight(abstract_txt:beispiel in 3304) [ClassicSimilarity], result of:
            0.036562417 = score(doc=3304,freq=1.0), product of:
              0.115021005 = queryWeight, product of:
                1.2404809 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.015952084 = queryNorm
              0.31787598 = fieldWeight in 3304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3304)
          0.08142413 = weight(abstract_txt:karlsruhe in 3304) [ClassicSimilarity], result of:
            0.08142413 = score(doc=3304,freq=1.0), product of:
              0.19615047 = queryWeight, product of:
                1.6199297 = boost
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.015952084 = queryNorm
              0.4151106 = fieldWeight in 3304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3304)
          0.91063994 = weight(title_txt:patentinformation in 3304) [ClassicSimilarity], result of:
            0.91063994 = score(doc=3304,freq=1.0), product of:
              0.22435224 = queryWeight, product of:
                1.7324739 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.015952084 = queryNorm
              4.0589743 = fieldWeight in 3304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.5 = fieldNorm(doc=3304)
        0.16 = coord(4/25)
    
  3. Gerick, T.: Content-based Information Retrieval auf Basis semantischer Abfragenetze : Kooperative Technologien am Beispsiel der Dokumentenrecherche in GENIOS Wirtschaftsdatenbanken (1999) 0.13
    0.13039598 = sum of:
      0.13039598 = product of:
        0.4656999 = sum of:
          0.027363185 = weight(abstract_txt:dabei in 4874) [ClassicSimilarity], result of:
            0.027363185 = score(doc=4874,freq=1.0), product of:
              0.07474756 = queryWeight, product of:
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.015952084 = queryNorm
              0.36607462 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
          0.05093124 = weight(abstract_txt:verfahren in 4874) [ClassicSimilarity], result of:
            0.05093124 = score(doc=4874,freq=1.0), product of:
              0.113103345 = queryWeight, product of:
                1.2300966 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.015952084 = queryNorm
              0.45030713 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
          0.051643364 = weight(abstract_txt:datenbanken in 4874) [ClassicSimilarity], result of:
            0.051643364 = score(doc=4874,freq=1.0), product of:
              0.11415518 = queryWeight, product of:
                1.2358031 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.015952084 = queryNorm
              0.45239615 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
          0.052232023 = weight(abstract_txt:beispiel in 4874) [ClassicSimilarity], result of:
            0.052232023 = score(doc=4874,freq=1.0), product of:
              0.115021005 = queryWeight, product of:
                1.2404809 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.015952084 = queryNorm
              0.45410857 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
          0.07116866 = weight(abstract_txt:dokumenten in 4874) [ClassicSimilarity], result of:
            0.07116866 = score(doc=4874,freq=1.0), product of:
              0.14136605 = queryWeight, product of:
                1.3752259 = boost
                6.443972 = idf(docFreq=191, maxDocs=44421)
                0.015952084 = queryNorm
              0.5034353 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.443972 = idf(docFreq=191, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
          0.08854718 = weight(abstract_txt:patent in 4874) [ClassicSimilarity], result of:
            0.08854718 = score(doc=4874,freq=1.0), product of:
              0.16353187 = queryWeight, product of:
                1.4791175 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.015952084 = queryNorm
              0.5414674 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
          0.123814255 = weight(abstract_txt:werkzeug in 4874) [ClassicSimilarity], result of:
            0.123814255 = score(doc=4874,freq=1.0), product of:
              0.20448731 = queryWeight, product of:
                1.6539968 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.015952084 = queryNorm
              0.6054863 = fieldWeight in 4874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.078125 = fieldNorm(doc=4874)
        0.28 = coord(7/25)
    
  4. Panyr, J.: Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen (1987) 0.09
    0.09174131 = sum of:
      0.09174131 = product of:
        0.5733832 = sum of:
          0.04004347 = weight(abstract_txt:diesem in 2321) [ClassicSimilarity], result of:
            0.04004347 = score(doc=2321,freq=1.0), product of:
              0.076987766 = queryWeight, product of:
                1.0148745 = boost
                4.7554536 = idf(docFreq=1038, maxDocs=44421)
                0.015952084 = queryNorm
              0.5201277 = fieldWeight in 2321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7554536 = idf(docFreq=1038, maxDocs=44421)
                0.109375 = fieldNorm(doc=2321)
          0.14090677 = weight(abstract_txt:dokumenten in 2321) [ClassicSimilarity], result of:
            0.14090677 = score(doc=2321,freq=2.0), product of:
              0.14136605 = queryWeight, product of:
                1.3752259 = boost
                6.443972 = idf(docFreq=191, maxDocs=44421)
                0.015952084 = queryNorm
              0.99675107 = fieldWeight in 2321, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.443972 = idf(docFreq=191, maxDocs=44421)
                0.109375 = fieldNorm(doc=2321)
          0.12351379 = weight(abstract_txt:automatische in 2321) [ClassicSimilarity], result of:
            0.12351379 = score(doc=2321,freq=1.0), product of:
              0.16313389 = queryWeight, product of:
                1.4773166 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.015952084 = queryNorm
              0.7571314 = fieldWeight in 2321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.109375 = fieldNorm(doc=2321)
          0.2689192 = weight(abstract_txt:clustering in 2321) [ClassicSimilarity], result of:
            0.2689192 = score(doc=2321,freq=1.0), product of:
              0.39523512 = queryWeight, product of:
                3.9828126 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.015952084 = queryNorm
              0.6804031 = fieldWeight in 2321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.109375 = fieldNorm(doc=2321)
        0.16 = coord(4/25)
    
  5. Geiß, D.: Aus der Praxis der Patentinformation : Teil 1: Übersicht über die Entwicklung der elektronischen Medien bei Patentbehörden (2004) 0.09
    0.08672443 = sum of:
      0.08672443 = product of:
        0.5420277 = sum of:
          0.016417913 = weight(abstract_txt:dabei in 3366) [ClassicSimilarity], result of:
            0.016417913 = score(doc=3366,freq=1.0), product of:
              0.07474756 = queryWeight, product of:
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.015952084 = queryNorm
              0.21964478 = fieldWeight in 3366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.046875 = fieldNorm(doc=3366)
          0.017161489 = weight(abstract_txt:diesem in 3366) [ClassicSimilarity], result of:
            0.017161489 = score(doc=3366,freq=1.0), product of:
              0.076987766 = queryWeight, product of:
                1.0148745 = boost
                4.7554536 = idf(docFreq=1038, maxDocs=44421)
                0.015952084 = queryNorm
              0.2229119 = fieldWeight in 3366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7554536 = idf(docFreq=1038, maxDocs=44421)
                0.046875 = fieldNorm(doc=3366)
          0.053128306 = weight(abstract_txt:patent in 3366) [ClassicSimilarity], result of:
            0.053128306 = score(doc=3366,freq=1.0), product of:
              0.16353187 = queryWeight, product of:
                1.4791175 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.015952084 = queryNorm
              0.32488045 = fieldWeight in 3366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.046875 = fieldNorm(doc=3366)
          0.45531997 = weight(title_txt:patentinformation in 3366) [ClassicSimilarity], result of:
            0.45531997 = score(doc=3366,freq=1.0), product of:
              0.22435224 = queryWeight, product of:
                1.7324739 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.015952084 = queryNorm
              2.0294871 = fieldWeight in 3366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.25 = fieldNorm(doc=3366)
        0.16 = coord(4/25)