清华大学发布CodeGeeX大模型用于代码生成，可以帮助83.4%的用户提高编码效率

2,338次阅读

【推荐理由】本文介绍了CodeGeeX，一个拥有130亿参数的多语言模型，用于代码生成。用户研究表明，CodeGeeX可以帮助83.4%的用户提高编码效率。

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X

Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang

[Tsinghua University & Zhipu.AI]

【论文链接】https://arxiv.org/pdf/2303.17568.pdf

【项目链接】https://github.com/THUDM/CodeGeeX

【摘要】大型预训练代码生成模型，如OpenAI Codex，可以生成语法和功能正确的代码，使程序员的编码更加高效，对人工通用智能的追求也更近了一步。在本文中，作者介绍了CodeGeeX，一个拥有130亿参数的多语言模型，用于代码生成。CodeGeeX在2022年6月的23种编程语言的8500亿个标记上进行了预训练。作者广泛的实验表明，CodeGeeX在HumanEval-X上的代码生成和翻译任务上优于类似规模的多语言代码模型。在HumanEval（仅限Python）的基础上，作者开发了HumanEval-X基准测试，通过手写C ++、Java、JavaScript和Go的解决方案来评估多语言模型。此外，作者在Visual Studio Code、JetBrains和Cloud Studio上构建了基于CodeGeeX的扩展，每周为数以万计的活跃用户生成47亿个标记。用户研究表明，CodeGeeX可以帮助83.4%的用户提高编码效率。

清华大学发布CodeGeeX大模型用于代码生成，可以帮助83.4%的用户提高编码效率

正文完

可以使用微信扫码关注公众号（ID：xzluomor）

AI AR HTML OpenAI RSS 程序员

发表至：智源

2023年3月31日

H2TF用于高光谱图像去噪：层次非线性变换与层次矩阵分解相遇

400米2分34秒破纪录！伯克利双足机器人「接管」人类

北大数学课，启用AI助教

清华大学基础模型2023学术年会丨梁正教授应邀在大模型安全与对齐分论坛发言

抖音、快手「兵临城下」，美团转攻为守

Vicuna：一个开源的聊天机器人，以90%*的ChatGPT质量打动了GPT-4

评论（没有评论）

2023 年 3 月
一	二	三	四	五	六	日
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

文心AIGC

人工智能ChatGPT，AIGC指利用人工智能技术来生成内容，其中包括文字、语音、代码、图像、视频、机器人动作等等。被认为是继PGC、UGC之后的新型内容创作方式。AIGC作为元宇宙的新方向，近几年迭代速度呈现指数级爆发，谷歌、Meta、百度等平台型巨头持续布局

文章搜索

最新评论

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

经典留声机

经典流行从来都不冲突

在这里，听见你曾经的故事

新浪微博：主播小D

小红书：小D就是我

抖音号：52915017

Search Episodes

薛之谦：从“人歌分离”到“深情解构者”的音乐涅槃之路（上）

2025年6月30日

主播小D

你一定听过这些经典合唱–第一篇

2025年1月20日

主播小D

缅怀一代歌王罗文的经典之声–第二篇

2024年12月30日

主播小D

缅怀一代歌王罗文的经典之声–第一篇

2024年12月27日

主播小D

在这里，听琼瑶，岁月长歌–第二篇

2024年12月24日

主播小D

在这里，听琼瑶，岁月长歌–第一篇

2024年12月21日

主播小D

你总能在这些歌里找到你的回忆–第一百零三篇

2024年12月18日

主播小D

你总能在这些歌里找到你的回忆–第一百零四篇

2024年12月13日

主播小D

《这些歌都发行在2001年–第三篇》

2024年12月10日

主播小D

《这些歌都发行在2001年–第二篇》

2024年12月7日

主播小D

Search Results placeholder

2023 年 3 月
一	二	三	四	五	六	日
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31