PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation
Wei Zeng,Xiaozhe Ren,Teng Su,Hui Wang,Yi Liao,Zhiwei Wang,Xin Jiang,ZhenZhang Yang,Kaisheng Wang,Xiaoda Zhang,Chen Li,Ziyan Gong,Yifan Yao,Xinjing Huang,Jun Wang,Jianfeng Yu,Qi Guo,Yue Yu,Yan Zhang,Jin Wang,Hengtao Tao,Dasen Yan,Zexuan Yi,Fang Peng,Fangqing Jiang,Han Zhang,Lingfeng Deng,Yehong Zhang,Zhe Lin,Chao Zhang,Shaojie Zhang,Mingyue Guo,Shanzhi Gu,Gaojun Fan,Yaowei Wang,Xuefeng Jin,Qun Liu,Yonghong Tian arXiv (Cornell University)(2021)
关键词
Language Modeling,Multilingual Neural Machine Translation,Language Understanding,Neural Machine Translation,Attention Mechanism
AI 理解论文
溯源树
样例
