题目
Transformer架构是由哪篇论文首次提出的?A. Deep Learning for NLPB. BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingC. Attention Is All You NeedD. Sequence to Sequence Learning with Neural Networks
Transformer架构是由哪篇论文首次提出的?
A. Deep Learning for NLP
B. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
C. Attention Is All You Need
D. Sequence to Sequence Learning with Neural Networks
题目解答
答案
C. Attention Is All You Need