This is a document from a talk I gave titled Understanding GPT usage from Transformer.
I have been working with natural language for 2 months, so please let me know if there are any mistakes!
10. 参考文献(より詳しく知りたい方へ)
• Ashish Vaswani, et al., “Attention Is All You Need”, 12 June 2017,
https://arxiv.org/abs/1706.03762#.
• David Foster, “生成Deep Learning”, OREILY, pp229-pp.306, 2020.
• 深層学習界の大前提Transformerの論文解説!
https://qiita.com/omiita/items/07e69aef6c156d23c538