References

Language models are few-shot learners

Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, & Dario Amodei (2020)

Advances in Neural Information Processing Systems, 33, 1877-1901.

URL: https://papers.nips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html

Abstract. Brown et al.'s GPT-3 paper — the foundational demonstration that language models scale to 175B parameters and acquire emergent few-shot learning capability. The reference cited as the modern foundation-model era's starting point.

Tags: ai-usability llm foundational gpt

This site is currently in Beta. Contact: Chris Paton

Textbook of AI · Textbook of Digital Health

Auckland Maths and Science Tutoring