Enable javascript in your browser for better experience. Need to know to enable it? Go here.
Published : Apr 26, 2023
NOT ON THE CURRENT EDITION
This blip is not on the current edition of the Radar. If it was on one of the last few editions, it is likely that it is still relevant. If the blip is older, it might no longer be relevant and our assessment might be different today. Unfortunately, we simply don't have the bandwidth to continuously review blips from previous editions of the Radar. Understand more
Apr 2023
Assess ?

nanoGPT 是一个用于对中等规模的生成式预训练 Transformer(GPT)进行训练和调优的框架。其作者 Andrej Karpathy 基于注意力机制OpenAI 的 GPT-3 两篇论文的理论,使用 PyTorch 从零开始构建一个 GPT。在生成式人工智能火热的趋势下,我们想要强调 nanoGPT 的简洁性,并且注重对 GPT 架构的构建模块进行清晰呈现。

Download the PDF

 

 

 

English | Português 

Sign up for the Technology Radar newsletter

 

 

Subscribe now

Visit our archive to read previous volumes