数据集:
AhmedSSoliman/DJANGO
Django数据集是Oda等人在2015年的ASE会议上使用的数据集。
Django数据集是用于代码生成的数据集,包括16000个训练样本、1000个开发样本和1805个测试样本。每个数据点由一行Python代码和一个手动创建的自然语言描述组成。
@inproceedings{oda2015ase:pseudogen1,
author = {Oda, Yusuke and Fudaba, Hiroyuki and Neubig, Graham and Hata, Hideaki and Sakti, Sakriani and Toda, Tomoki and Nakamura, Satoshi},
title = {Learning to Generate Pseudo-code from Source Code Using Statistical Machine Translation},
booktitle = {Proceedings of the 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE)},
series = {ASE '15},
month = {November},
year = {2015},
isbn = {978-1-5090-0025-8},
pages = {574--584},
numpages = {11},
url = {https://doi.org/10.1109/ASE.2015.36},
doi = {10.1109/ASE.2015.36},
acmid = {2916173},
publisher = {IEEE Computer Society},
address = {Lincoln, Nebraska, USA}
}