Document (#30153)

Author
Watters, C.
Amoudi, A.
Title
Geosearcher : location-based ranking of search engine results
Source
Journal of the American Society for Information Science and technology. 54(2003) no.2, S.140-151
Year
2003
Abstract
Waters and Amoudi describe GeoSearcher, a prototype ranking program that arranges search engine results along a geo-spatial dimension without the provision of geo-spatial meta-tags or the use of geo-spatial feature extraction. GeoSearcher uses URL analysis, IptoLL, Whois, and the Getty Thesaurus of Geographic Names to determine site location. It accepts the first 200 sites returned by a search engine, identifies the coordinates, calculates their distance from a reference point and ranks in ascending order by this value. For any retrieved site the system checks if it has already been located in the current session, then sends the domain name to Whois to generate a return of a two letter country code and an area code. With no success the name is stripped one level and resent. If this fails the top level domain is tested for being a country code. Any remaining unmatched names go to IptoLL. Distance is calculated using the center point of the geographic area and a provided reference location. A test run on a set of 100 URLs from a search was successful in locating 90 sites. Eighty three pages could be manually found and 68 had sufficient information to verify location determination. Of these 65 ( 95%) had been assigned reasonably correct geographic locations. A random set of URLs used instead of a search result, yielded 80% success.
Theme
Suchmaschinen
Retrievalalgorithmen
Object
GeoSearcher

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:watters in 605) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 605, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=605)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:watters in 5319) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 5319, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=5319)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 4.50
    4.496709 = sum of:
      4.496709 = weight(author_txt:watters in 7289) [ClassicSimilarity], result of:
        4.496709 = fieldWeight in 7289, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.5 = fieldNorm(doc=7289)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 4.50
    4.496709 = sum of:
      4.496709 = weight(author_txt:watters in 2549) [ClassicSimilarity], result of:
        4.496709 = fieldWeight in 2549, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.5 = fieldNorm(doc=2549)
    
  5. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 4.50
    4.496709 = sum of:
      4.496709 = weight(author_txt:watters in 5856) [ClassicSimilarity], result of:
        4.496709 = fieldWeight in 5856, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.5 = fieldNorm(doc=5856)
    

Similar documents (content)

  1. Hill, L.L.; Frew, J.; Zheng, Q.: Geographic names : the implementation of a gazetteer in a georeferenced digital library (1999) 0.19
    0.19418152 = sum of:
      0.19418152 = product of:
        0.80908966 = sum of:
          0.018042207 = weight(abstract_txt:reference in 2240) [ClassicSimilarity], result of:
            0.018042207 = score(doc=2240,freq=1.0), product of:
              0.086201414 = queryWeight, product of:
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.019305473 = queryNorm
              0.2093029 = fieldWeight in 2240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.046875 = fieldNorm(doc=2240)
          0.085957915 = weight(abstract_txt:name in 2240) [ClassicSimilarity], result of:
            0.085957915 = score(doc=2240,freq=5.0), product of:
              0.14273217 = queryWeight, product of:
                1.2867783 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.019305473 = queryNorm
              0.6022322 = fieldWeight in 2240, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.046875 = fieldNorm(doc=2240)
          0.13373339 = weight(abstract_txt:names in 2240) [ClassicSimilarity], result of:
            0.13373339 = score(doc=2240,freq=11.0), product of:
              0.14735006 = queryWeight, product of:
                1.3074285 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.019305473 = queryNorm
              0.9075897 = fieldWeight in 2240, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.046875 = fieldNorm(doc=2240)
          0.33235124 = weight(abstract_txt:geographic in 2240) [ClassicSimilarity], result of:
            0.33235124 = score(doc=2240,freq=15.0), product of:
              0.2790741 = queryWeight, product of:
                2.2036784 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.019305473 = queryNorm
              1.1909068 = fieldWeight in 2240, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.046875 = fieldNorm(doc=2240)
          0.096058175 = weight(abstract_txt:spatial in 2240) [ClassicSimilarity], result of:
            0.096058175 = score(doc=2240,freq=1.0), product of:
              0.30086705 = queryWeight, product of:
                2.2881038 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.019305473 = queryNorm
              0.31927118 = fieldWeight in 2240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.046875 = fieldNorm(doc=2240)
          0.14294674 = weight(abstract_txt:location in 2240) [ClassicSimilarity], result of:
            0.14294674 = score(doc=2240,freq=2.0), product of:
              0.34258693 = queryWeight, product of:
                2.8193123 = boost
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.019305473 = queryNorm
              0.41725683 = fieldWeight in 2240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.046875 = fieldNorm(doc=2240)
        0.24 = coord(6/25)
    
  2. Hill, L.L.: Geographic indexing for bibliographic databases (1989) 0.17
    0.16759409 = sum of:
      0.16759409 = product of:
        0.83797044 = sum of:
          0.036084414 = weight(abstract_txt:reference in 3716) [ClassicSimilarity], result of:
            0.036084414 = score(doc=3716,freq=1.0), product of:
              0.086201414 = queryWeight, product of:
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.019305473 = queryNorm
              0.4186058 = fieldWeight in 3716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.09375 = fieldNorm(doc=3716)
          0.11034848 = weight(abstract_txt:country in 3716) [ClassicSimilarity], result of:
            0.11034848 = score(doc=3716,freq=1.0), product of:
              0.18161242 = queryWeight, product of:
                1.451495 = boost
                6.481112 = idf(docFreq=184, maxDocs=44421)
                0.019305473 = queryNorm
              0.60760427 = fieldWeight in 3716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.481112 = idf(docFreq=184, maxDocs=44421)
                0.09375 = fieldNorm(doc=3716)
          0.29726398 = weight(abstract_txt:geographic in 3716) [ClassicSimilarity], result of:
            0.29726398 = score(doc=3716,freq=3.0), product of:
              0.2790741 = queryWeight, product of:
                2.2036784 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.019305473 = queryNorm
              1.0651793 = fieldWeight in 3716, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.09375 = fieldNorm(doc=3716)
          0.19211635 = weight(abstract_txt:spatial in 3716) [ClassicSimilarity], result of:
            0.19211635 = score(doc=3716,freq=1.0), product of:
              0.30086705 = queryWeight, product of:
                2.2881038 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.019305473 = queryNorm
              0.63854235 = fieldWeight in 3716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.09375 = fieldNorm(doc=3716)
          0.2021572 = weight(abstract_txt:location in 3716) [ClassicSimilarity], result of:
            0.2021572 = score(doc=3716,freq=1.0), product of:
              0.34258693 = queryWeight, product of:
                2.8193123 = boost
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.019305473 = queryNorm
              0.5900902 = fieldWeight in 3716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.09375 = fieldNorm(doc=3716)
        0.2 = coord(5/25)
    
  3. Koehler, W.C.: Internet search note : specialized retrieval and Web search engines (1997) 0.13
    0.13346389 = sum of:
      0.13346389 = product of:
        0.55609953 = sum of:
          0.07439445 = weight(abstract_txt:level in 1769) [ClassicSimilarity], result of:
            0.07439445 = score(doc=1769,freq=3.0), product of:
              0.08736216 = queryWeight, product of:
                1.0067103 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.019305473 = queryNorm
              0.8515637 = fieldWeight in 1769, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.109375 = fieldNorm(doc=1769)
          0.0500074 = weight(abstract_txt:domain in 1769) [ClassicSimilarity], result of:
            0.0500074 = score(doc=1769,freq=1.0), product of:
              0.09668512 = queryWeight, product of:
                1.059065 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.019305473 = queryNorm
              0.5172192 = fieldWeight in 1769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.109375 = fieldNorm(doc=1769)
          0.060161978 = weight(abstract_txt:area in 1769) [ClassicSimilarity], result of:
            0.060161978 = score(doc=1769,freq=1.0), product of:
              0.10936664 = queryWeight, product of:
                1.1263808 = boost
                5.0294347 = idf(docFreq=789, maxDocs=44421)
                0.019305473 = queryNorm
              0.5500944 = fieldWeight in 1769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0294347 = idf(docFreq=789, maxDocs=44421)
                0.109375 = fieldNorm(doc=1769)
          0.089696944 = weight(abstract_txt:name in 1769) [ClassicSimilarity], result of:
            0.089696944 = score(doc=1769,freq=1.0), product of:
              0.14273217 = queryWeight, product of:
                1.2867783 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.019305473 = queryNorm
              0.62842834 = fieldWeight in 1769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.109375 = fieldNorm(doc=1769)
          0.08160907 = weight(abstract_txt:search in 1769) [ClassicSimilarity], result of:
            0.08160907 = score(doc=1769,freq=2.0), product of:
              0.14436626 = queryWeight, product of:
                2.0461886 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019305473 = queryNorm
              0.5652918 = fieldWeight in 1769, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.109375 = fieldNorm(doc=1769)
          0.20022969 = weight(abstract_txt:geographic in 1769) [ClassicSimilarity], result of:
            0.20022969 = score(doc=1769,freq=1.0), product of:
              0.2790741 = queryWeight, product of:
                2.2036784 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.019305473 = queryNorm
              0.7174786 = fieldWeight in 1769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.109375 = fieldNorm(doc=1769)
        0.24 = coord(6/25)
    
  4. Wisniewski, J.: Authority work, Internet resources, and a cataloguer's home page (1998) 0.13
    0.13189486 = sum of:
      0.13189486 = product of:
        0.65947425 = sum of:
          0.12064349 = weight(abstract_txt:sites in 3534) [ClassicSimilarity], result of:
            0.12064349 = score(doc=3534,freq=2.0), product of:
              0.12628005 = queryWeight, product of:
                1.2103478 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.019305473 = queryNorm
              0.9553646 = fieldWeight in 3534, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.125 = fieldNorm(doc=3534)
          0.09996037 = weight(abstract_txt:site in 3534) [ClassicSimilarity], result of:
            0.09996037 = score(doc=3534,freq=1.0), product of:
              0.14035484 = queryWeight, product of:
                1.2760171 = boost
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.019305473 = queryNorm
              0.71219754 = fieldWeight in 3534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.125 = fieldNorm(doc=3534)
          0.10251079 = weight(abstract_txt:name in 3534) [ClassicSimilarity], result of:
            0.10251079 = score(doc=3534,freq=1.0), product of:
              0.14273217 = queryWeight, product of:
                1.2867783 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.019305473 = queryNorm
              0.7182038 = fieldWeight in 3534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.125 = fieldNorm(doc=3534)
          0.10752569 = weight(abstract_txt:names in 3534) [ClassicSimilarity], result of:
            0.10752569 = score(doc=3534,freq=1.0), product of:
              0.14735006 = queryWeight, product of:
                1.3074285 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.019305473 = queryNorm
              0.72972953 = fieldWeight in 3534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.125 = fieldNorm(doc=3534)
          0.22883393 = weight(abstract_txt:geographic in 3534) [ClassicSimilarity], result of:
            0.22883393 = score(doc=3534,freq=1.0), product of:
              0.2790741 = queryWeight, product of:
                2.2036784 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.019305473 = queryNorm
              0.8199755 = fieldWeight in 3534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.125 = fieldNorm(doc=3534)
        0.2 = coord(5/25)
    
  5. Schaefer, M.T.: Project Aristotle & Cyberstacks : automating the virtual Internet library (1998) 0.12
    0.117539756 = sum of:
      0.117539756 = product of:
        0.58769876 = sum of:
          0.08746533 = weight(abstract_txt:site in 1337) [ClassicSimilarity], result of:
            0.08746533 = score(doc=1337,freq=1.0), product of:
              0.14035484 = queryWeight, product of:
                1.2760171 = boost
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.019305473 = queryNorm
              0.6231729 = fieldWeight in 1337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.109375 = fieldNorm(doc=1337)
          0.0878082 = weight(abstract_txt:success in 1337) [ClassicSimilarity], result of:
            0.0878082 = score(doc=1337,freq=1.0), product of:
              0.14072141 = queryWeight, product of:
                1.2776823 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.019305473 = queryNorm
              0.62398607 = fieldWeight in 1337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.109375 = fieldNorm(doc=1337)
          0.11886881 = weight(abstract_txt:engine in 1337) [ClassicSimilarity], result of:
            0.11886881 = score(doc=1337,freq=1.0), product of:
              0.19712687 = queryWeight, product of:
                1.8520869 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.019305473 = queryNorm
              0.60300666 = fieldWeight in 1337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.109375 = fieldNorm(doc=1337)
          0.057706323 = weight(abstract_txt:search in 1337) [ClassicSimilarity], result of:
            0.057706323 = score(doc=1337,freq=1.0), product of:
              0.14436626 = queryWeight, product of:
                2.0461886 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019305473 = queryNorm
              0.39972165 = fieldWeight in 1337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.109375 = fieldNorm(doc=1337)
          0.23585007 = weight(abstract_txt:location in 1337) [ClassicSimilarity], result of:
            0.23585007 = score(doc=1337,freq=1.0), product of:
              0.34258693 = queryWeight, product of:
                2.8193123 = boost
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.019305473 = queryNorm
              0.6884386 = fieldWeight in 1337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.109375 = fieldNorm(doc=1337)
        0.2 = coord(5/25)