Document (#26821)

Author
Karakos, A.
Title
Greeklish : an experimental interface for automatic transliteration
Source
Journal of the American Society for Information Science and technology. 54(2003) no.11, S.1069-1074
Year
2003
Abstract
"Transliteration" in linguistics means the system of conveying as nearly as possible by means of one set of letters or characters the pronunciation of the words in languages written and printed in a totally different script. This term may be applied to a transcription in Latin letters of Greek, Hebrew, or the Slavonic languages written in the Cyrillic alphabet. We present in this article Greeklish, a Windows application that automatically produces English to Greek transliteration and back-transliteration (retransliteration). This transliteration is based an an algorithm with a table of associations between the two character sets. This table can be modified by the user so that it can cover personal preferences or formal present and future rules. The novelty of this system is its speed of operation, its simplicity, and its ease of use. Our examples use a Greek to Latin (English) alphabet mapping, but the Greeklish application can easily use any X to Latin mapping, where X is any non-Latin alphabet.
Theme
Computerlinguistik
Field
Sprachwissenschaft

Similar documents (content)

  1. Beloozerov, V.N.; Radkovskii, G.N.; Kosarskaya, Y.P.: Prakticheskaya transliteratsiya russkogo teksta latinskim alfavitom (1997) 0.37
    0.37131387 = sum of:
      0.37131387 = product of:
        1.8565693 = sum of:
          0.1429411 = weight(abstract_txt:characters in 3260) [ClassicSimilarity], result of:
            0.1429411 = score(doc=3260,freq=3.0), product of:
              0.08941205 = queryWeight, product of:
                1.0477434 = boost
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.011557146 = queryNorm
              1.5986784 = fieldWeight in 3260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.125 = fieldNorm(doc=3260)
          0.2488815 = weight(abstract_txt:cyrillic in 3260) [ClassicSimilarity], result of:
            0.2488815 = score(doc=3260,freq=2.0), product of:
              0.14813241 = queryWeight, product of:
                1.3485963 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.011557146 = queryNorm
              1.6801286 = fieldWeight in 3260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.125 = fieldNorm(doc=3260)
          0.07102248 = weight(abstract_txt:english in 3260) [ClassicSimilarity], result of:
            0.07102248 = score(doc=3260,freq=1.0), product of:
              0.10192301 = queryWeight, product of:
                1.5820057 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.011557146 = queryNorm
              0.6968248 = fieldWeight in 3260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.125 = fieldNorm(doc=3260)
          0.3905524 = weight(abstract_txt:latin in 3260) [ClassicSimilarity], result of:
            0.3905524 = score(doc=3260,freq=1.0), product of:
              0.40007174 = queryWeight, product of:
                4.432573 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.011557146 = queryNorm
              0.9762059 = fieldWeight in 3260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.125 = fieldNorm(doc=3260)
          1.0031718 = weight(abstract_txt:transliteration in 3260) [ClassicSimilarity], result of:
            1.0031718 = score(doc=3260,freq=3.0), product of:
              0.56044304 = queryWeight, product of:
                5.8655353 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.011557146 = queryNorm
              1.789962 = fieldWeight in 3260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.125 = fieldNorm(doc=3260)
        0.2 = coord(5/25)
    
  2. Fattah, M. Abdel; Ren, F.: English-Arabic proper-noun transliteration-pairs creation (2008) 0.26
    0.25558347 = sum of:
      0.25558347 = product of:
        0.91279805 = sum of:
          0.023793442 = weight(abstract_txt:present in 2999) [ClassicSimilarity], result of:
            0.023793442 = score(doc=2999,freq=2.0), product of:
              0.061942663 = queryWeight, product of:
                1.2332947 = boost
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.011557146 = queryNorm
              0.38412043 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.025863992 = weight(abstract_txt:means in 2999) [ClassicSimilarity], result of:
            0.025863992 = score(doc=2999,freq=1.0), product of:
              0.08250724 = queryWeight, product of:
                1.4233705 = boost
                5.015607 = idf(docFreq=800, maxDocs=44421)
                0.011557146 = queryNorm
              0.31347543 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.015607 = idf(docFreq=800, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.040520273 = weight(abstract_txt:languages in 2999) [ClassicSimilarity], result of:
            0.040520273 = score(doc=2999,freq=2.0), product of:
              0.08833509 = queryWeight, product of:
                1.4727824 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.011557146 = queryNorm
              0.45871094 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.07102248 = weight(abstract_txt:english in 2999) [ClassicSimilarity], result of:
            0.07102248 = score(doc=2999,freq=4.0), product of:
              0.10192301 = queryWeight, product of:
                1.5820057 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.011557146 = queryNorm
              0.6968248 = fieldWeight in 2999, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.13917772 = weight(abstract_txt:letters in 2999) [ClassicSimilarity], result of:
            0.13917772 = score(doc=2999,freq=2.0), product of:
              0.20109357 = queryWeight, product of:
                2.222138 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.011557146 = queryNorm
              0.6921043 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.20287696 = weight(abstract_txt:alphabet in 2999) [ClassicSimilarity], result of:
            0.20287696 = score(doc=2999,freq=1.0), product of:
              0.37286124 = queryWeight, product of:
                3.7058787 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.011557146 = queryNorm
              0.54410845 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.40954316 = weight(abstract_txt:transliteration in 2999) [ClassicSimilarity], result of:
            0.40954316 = score(doc=2999,freq=2.0), product of:
              0.56044304 = queryWeight, product of:
                5.8655353 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.011557146 = queryNorm
              0.73074895 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
        0.28 = coord(7/25)
    
  3. Aissing, A.A.: Cyrillic transliteration and its users (1995) 0.18
    0.17948005 = sum of:
      0.17948005 = product of:
        1.1217504 = sum of:
          0.18666112 = weight(abstract_txt:cyrillic in 1924) [ClassicSimilarity], result of:
            0.18666112 = score(doc=1924,freq=2.0), product of:
              0.14813241 = queryWeight, product of:
                1.3485963 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.011557146 = queryNorm
              1.2600964 = fieldWeight in 1924, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.09375 = fieldNorm(doc=1924)
          0.042978242 = weight(abstract_txt:languages in 1924) [ClassicSimilarity], result of:
            0.042978242 = score(doc=1924,freq=1.0), product of:
              0.08833509 = queryWeight, product of:
                1.4727824 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.011557146 = queryNorm
              0.48653644 = fieldWeight in 1924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.09375 = fieldNorm(doc=1924)
          0.13973206 = weight(abstract_txt:table in 1924) [ClassicSimilarity], result of:
            0.13973206 = score(doc=1924,freq=2.0), product of:
              0.15387033 = queryWeight, product of:
                1.9437901 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.011557146 = queryNorm
              0.9081157 = fieldWeight in 1924, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.09375 = fieldNorm(doc=1924)
          0.7523789 = weight(abstract_txt:transliteration in 1924) [ClassicSimilarity], result of:
            0.7523789 = score(doc=1924,freq=3.0), product of:
              0.56044304 = queryWeight, product of:
                5.8655353 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.011557146 = queryNorm
              1.3424716 = fieldWeight in 1924, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.09375 = fieldNorm(doc=1924)
        0.16 = coord(4/25)
    
  4. Harrison, S.E.: Chinese names in English (1992) 0.16
    0.15854912 = sum of:
      0.15854912 = product of:
        0.66062135 = sum of:
          0.077651 = weight(abstract_txt:script in 3051) [ClassicSimilarity], result of:
            0.077651 = score(doc=3051,freq=1.0), product of:
              0.10400532 = queryWeight, product of:
                1.1300163 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.011557146 = queryNorm
              0.74660605 = fieldWeight in 3051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.09375 = fieldNorm(doc=3051)
          0.025236756 = weight(abstract_txt:present in 3051) [ClassicSimilarity], result of:
            0.025236756 = score(doc=3051,freq=1.0), product of:
              0.061942663 = queryWeight, product of:
                1.2332947 = boost
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.011557146 = queryNorm
              0.40742123 = fieldWeight in 3051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.09375 = fieldNorm(doc=3051)
          0.05326686 = weight(abstract_txt:english in 3051) [ClassicSimilarity], result of:
            0.05326686 = score(doc=3051,freq=1.0), product of:
              0.10192301 = queryWeight, product of:
                1.5820057 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.011557146 = queryNorm
              0.5226186 = fieldWeight in 3051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.09375 = fieldNorm(doc=3051)
          0.059371073 = weight(abstract_txt:written in 3051) [ClassicSimilarity], result of:
            0.059371073 = score(doc=3051,freq=1.0), product of:
              0.10956809 = queryWeight, product of:
                1.6402649 = boost
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.011557146 = queryNorm
              0.54186463 = fieldWeight in 3051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.09375 = fieldNorm(doc=3051)
          0.010709528 = weight(abstract_txt:this in 3051) [ClassicSimilarity], result of:
            0.010709528 = score(doc=3051,freq=1.0), product of:
              0.04747457 = queryWeight, product of:
                1.7071531 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.011557146 = queryNorm
              0.2255845 = fieldWeight in 3051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.09375 = fieldNorm(doc=3051)
          0.43438613 = weight(abstract_txt:transliteration in 3051) [ClassicSimilarity], result of:
            0.43438613 = score(doc=3051,freq=1.0), product of:
              0.56044304 = queryWeight, product of:
                5.8655353 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.011557146 = queryNorm
              0.7750763 = fieldWeight in 3051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.09375 = fieldNorm(doc=3051)
        0.24 = coord(6/25)
    
  5. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 0.14
    0.14133057 = sum of:
      0.14133057 = product of:
        0.5888774 = sum of:
          0.02103063 = weight(abstract_txt:present in 2052) [ClassicSimilarity], result of:
            0.02103063 = score(doc=2052,freq=1.0), product of:
              0.061942663 = queryWeight, product of:
                1.2332947 = boost
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.011557146 = queryNorm
              0.3395177 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.050650343 = weight(abstract_txt:languages in 2052) [ClassicSimilarity], result of:
            0.050650343 = score(doc=2052,freq=2.0), product of:
              0.08833509 = queryWeight, product of:
                1.4727824 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.011557146 = queryNorm
              0.5733887 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.04438905 = weight(abstract_txt:english in 2052) [ClassicSimilarity], result of:
            0.04438905 = score(doc=2052,freq=1.0), product of:
              0.10192301 = queryWeight, product of:
                1.5820057 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.011557146 = queryNorm
              0.4355155 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.008924606 = weight(abstract_txt:this in 2052) [ClassicSimilarity], result of:
            0.008924606 = score(doc=2052,freq=1.0), product of:
              0.04747457 = queryWeight, product of:
                1.7071531 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.011557146 = queryNorm
              0.18798709 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.21978751 = weight(abstract_txt:greek in 2052) [ClassicSimilarity], result of:
            0.21978751 = score(doc=2052,freq=1.0), product of:
              0.3389384 = queryWeight, product of:
                3.5332792 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.011557146 = queryNorm
              0.6484586 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.24409525 = weight(abstract_txt:latin in 2052) [ClassicSimilarity], result of:
            0.24409525 = score(doc=2052,freq=1.0), product of:
              0.40007174 = queryWeight, product of:
                4.432573 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.011557146 = queryNorm
              0.6101287 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
        0.24 = coord(6/25)