题目
Transformer架构是由哪篇论文首次提出的?A. Attention Is All You NeedB. Deep Learning for NLPC. Sequence to Sequence Learning with Neural NetworksD. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Transformer架构是由哪篇论文首次提出的?
A. Attention Is All You Need
B. Deep Learning for NLP
C. Sequence to Sequence Learning with Neural Networks
D. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
题目解答
答案
A. Attention Is All You Need