阿里社区博客(重点在transformer的激活值参数量估计):https://developer.aliyun.com/article/1496103
推理时显存占用(GitHub):
https://github.com/Hoper-J/I-Guide-and-Demos-zh_CN/blob/master/Guide/07.%20%E6%8E%A2%E7%A9%B6%E6%A8%A1%E5%9E%8B%E5%8F%82%E6%95%B0%E4%B8%8E%E6%98%BE%E5%AD%98%E7%9A%84%E5%85%B3%E7%B3%BB%E4%BB%A5%E5%8F%8A%E4%B8%8D%E5%90%8C%E7%B2%BE%E5%BA%A6%E9%80%A0%E6%88%90%E7%9A%84%E5%BD%B1%E5%93%8D.md#%E8%AE%AD%E7%BB%83%E6%97%B6%E7%9A%84%E6%98%BE%E5%AD%98%E5%8D%A0%E7%94%A8
显存评估器:https://vram.asmirnov.xyz/?ref=blog.runpod.io
显存评估器中文版(APX):https://apxml.com/zh/tools/vram-calculator