RetNet¶ 约 11 个字 预计阅读时间不到 1 分钟 Retentive Network: A Successor to Transformer for Large Language Models