diff --git a/README.md b/README.md index 7dd69b8..1920060 100644 --- a/README.md +++ b/README.md @@ -1,7 +1,7 @@

- 🏠 Homepage | 📖 Blog | 🪧 DEMO | 🤖 Download Model | 🌐 中文 + 🏠 Homepage | 📖 Blog | 🪧 DEMO | 🤖 Download Model | 📄 Paper | 🌐 中文

🛠 VS Code, Jetbrains, Cloud Studio supported | 👋 Join our Discord, Slack, Telegram, WeChat @@ -26,6 +26,7 @@ - [Crosslingual Code Translation](#crosslingual-code-translation) - [How to use HumanEval-X and contribute to it?](#how-to-use-humaneval-x-and-contribute-to-it) - [License](#license) + - [Citation](#citation) # CodeGeeX: A Multilingual Code Generation Model @@ -43,6 +44,8 @@ We introduce CodeGeeX, a large-scale multilingual code generation model with 13 ## News +* **2023-03-30**: CodeGeeX paper is now available at [arxiv](https://arxiv.org/abs/2303.17568). + * **2023-02-14**: CodeGeeX now supports [Cloud Studio](https://cloudstudio.net/), a fantastic web IDE from Tencent. Click on the badge on top of this page to quickly launch an environment to test CodeGeeX. * **2023-02-13**: Thanks a lot to [OneFlow](https://github.com/Oneflow-Inc/oneflow) team for adding oneflow backend for CodeGeeX's inference (Even faster than FasterTransformer under FP16!). Check more details [here](https://github.com/THUDM/CodeGeeX/pull/65). @@ -201,3 +204,18 @@ Ruijie Cheng (Tsinghua), Peinan Yu (Tsinghua), Jingyao Zhang (Zhipu\.AI), Bowen Our code is licensed under the [Apache-2.0 license](LICENSE). Our model is licensed under the [license](MODEL_LICENSE). + +## Citation + +If you find our work useful, please cite: + +``` +@misc{zheng2023codegeex, + title={CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X}, + author={Qinkai Zheng and Xiao Xia and Xu Zou and Yuxiao Dong and Shan Wang and Yufei Xue and Zihan Wang and Lei Shen and Andi Wang and Yang Li and Teng Su and Zhilin Yang and Jie Tang}, + year={2023}, + eprint={2303.17568}, + archivePrefix={arXiv}, + primaryClass={cs.LG} +} +``` \ No newline at end of file diff --git a/README_zh.md b/README_zh.md index fc3c38a..14360c9 100644 --- a/README_zh.md +++ b/README_zh.md @@ -1,7 +1,7 @@

- 🏠 主页 | 📖 博客 | 🪧 示例 | 🤖 模型下载 | 📒 API申请 | 🌐 English + 🏠 主页 | 📖 博客 | 🪧 示例 | 🤖 模型下载 | 📄 论文 | 🌐 English

🛠 VS Code, Jetbrains, Cloud Studio 插件 | 👋 欢迎加入 微信开发者交流群 @@ -26,6 +26,7 @@ - [多语言代码生成](#多语言代码生成) - [跨语言代码翻译](#跨语言代码翻译) - [许可证](#许可证) + - [引用](#引用) # CodeGeeX: 多语言代码生成模型 CodeGeeX是一个具有130亿参数的多编程语言代码生成预训练模型。CodeGeeX采用华为MindSpore框架实现,在鹏城实验室“鹏城云脑II”中的192个节点(共1536个国产[昇腾910 AI处理器](https://e.huawei.com/cn/products/servers/ascend))上训练而成。截至2022年6月22日,CodeGeeX历时两个月在20多种编程语言的代码语料库(>8500亿Token)上预训练得到。CodeGeeX有以下特点: @@ -42,6 +43,8 @@ CodeGeeX是一个具有130亿参数的多编程语言代码生成预训练模型 ## 新闻 +* **2023-03-30**: CodeGeeX 已论文发表在[arxiv](https://arxiv.org/abs/2303.17568)。 + * **2023-02-14**: CodeGeeX 现已支持 [Cloud Studio](https://cloudstudio.net/), 一款腾讯推出、十分好用的在线编辑器。单击此页面顶部的徽章可快速启动环境测试 CodeGeeX。 * **2023-02-13**: 感谢 [OneFlow](https://github.com/Oneflow-Inc/oneflow) 加入了oneflow版推理支持,在FP16下比FasterTransformer还要快!更多优化细节请点击[这里](https://github.com/THUDM/CodeGeeX/pull/65). @@ -198,7 +201,22 @@ HumanEval-X中每个语言的样本,包含了声明、描述和解答,它们 如果遇到问题或有任何建议,欢迎通过邮件与我们联系[codegeex@aminer.cn](mailto:codegeex@aminer.cn). -### 许可证 +## 许可证 代码使用[Apache-2.0许可证](LICENSE) 模型使用[许可证](MODEL_LICENSE) + +## 引用 + +如果觉得我们的工作有帮助,欢迎引用以下论文: + +``` +@misc{zheng2023codegeex, + title={CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X}, + author={Qinkai Zheng and Xiao Xia and Xu Zou and Yuxiao Dong and Shan Wang and Yufei Xue and Zihan Wang and Lei Shen and Andi Wang and Yang Li and Teng Su and Zhilin Yang and Jie Tang}, + year={2023}, + eprint={2303.17568}, + archivePrefix={arXiv}, + primaryClass={cs.LG} +} +``` \ No newline at end of file