Yann Lecun转发「大规模语言模型优化的进展速度简直令人震惊！」

今日，ML & NLP 研究者，前谷歌DeepMind / 微软ML工程师Aleksa Gordić在推特上发布了一条关于大规模语言模型（LLMs）优化的观点帖子。这条内容还得到了图灵奖得主 Yann LeC++un 的转发。

以下是其详细观点：

1. 在边缘设备（例如带有M1芯片的MacBook Pro）上运行像LLaMA这样的13B LLM现在几乎是轻而易举的事。很快我们就能在移动设备上本地运行13B+ LLMs，这只是时间问题。

2. 使所有这些成为可能的第一个重要开源软件工作是由@ggerganov完成的，他的llama.cpp项目（C++ LLaMA端口）。该项目在几周内积累了将近21k的star。

3. 在这里查看：https://github.com/ggerganov/llama.cpp

最新的优化使加载LLaMA快了大约10-100倍！（诀窍是使用mmap）

请查看@JustineTunney的博客以了解更多信息：https://justine.lol/mmap/

同时，也可以查看这个很酷的演示：https://twitter.com/ggerganov/status/1640022482307502085

4. “在机器学习中的1小时就像其他技术领域的7年”这一说法变得越来越真实了。

结论：在机器学习世界中成为优秀的软件工程师变得越来越重要了。

5. 我一直认为在机器学习中拥有良好的软件工程背景是一个了不起的起点（对我来说似乎很明显）- 现在比以往任何时候都更为真实。在未来的一段时间里，主要推动创新的将是建造者，而不是机器学习科学家。

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

Yann Lecun转发「大规模语言模型优化的进展速度简直令人震惊！」

Nano banana手办玩法火爆出圈！无需抽卡，效果惊了(°o°)

蚂蚁专用模型超越o3！仅用2K训练样本刷新医疗AI榜单纪录

Claude估值暴涨300%！全球独角兽字节第三他第四

马斯克入局AI编程！新模型限时免费用：256K上下文，主打一个速度快

OpenAI宣布推出AI在线招聘平台，和微软的领英打起来了

小米新系统和iPhone联动了

马斯克入局AI编程！新模型限时免费用：256K上下文，主打一个速度快

Nano banana手办玩法火爆出圈！无需抽卡，效果惊了(°o°)

打工人出差又烦又累？阿里商旅推出了一个AI“行政助理”

蚂蚁专用模型超越o3！仅用2K训练样本刷新医疗AI榜单纪录