Search (1 results, page 1 of 1)

  • × author_ss:"Wu, J."
  1. Radford, A.; Wu, J.; Child, R.; Luan, D.; Amode, D.; Sutskever, I.: Language models are unsupervised multitask learners 10.60
    10.602856 = weight(object_ss:GPT-2 in 1872) [ClassicSimilarity], result of:
      10.602856 = fieldWeight in 1872, product of:
        1.0 = tf(freq=1.0), with freq of:
          1.0 = termFreq=1.0
        10.602856 = idf(docFreq=2, maxDocs=44421)
        1.0 = fieldNorm(doc=1872)
    
    Object
    GPT-2