数据集:
xcopa
任务:
子任务:
multiple-choice-qa计算机处理:
multilingual语言创建人:
expert-generated批注创建人:
expert-generated源数据集:
extended|copa许可:
XCOPA(Causal Commonsense Reasoning跨语言选择可行替代)数据集是一个评估机器学习模型在不同语言之间迁移常识推理能力的基准。该数据集是英语COPA(Roemmele等人,2011)的翻译和重新注释,涵盖了来自全球11个家族的11种语言以及多个地区。该数据集具有挑战性,因为它要求同时掌握世界知识和泛化到新语言的能力。有关XCOPA的创建细节和基线实现的所有详细信息,请参阅论文。
Xcopa语言et
"验证"的示例如下所示。
{
"changed": false,
"choice1": "Ta kallas piima kaussi.",
"choice2": "Ta kaotas oma isu.",
"idx": 1,
"label": 1,
"premise": "Tüdruk leidis oma helveste seest putuka.",
"question": "effect"
}
ht
"验证"的示例如下所示。
{
"changed": false,
"choice1": "Ta kallas piima kaussi.",
"choice2": "Ta kaotas oma isu.",
"idx": 1,
"label": 1,
"premise": "Tüdruk leidis oma helveste seest putuka.",
"question": "effect"
}
id
"验证"的示例如下所示。
{
"changed": false,
"choice1": "Ta kallas piima kaussi.",
"choice2": "Ta kaotas oma isu.",
"idx": 1,
"label": 1,
"premise": "Tüdruk leidis oma helveste seest putuka.",
"question": "effect"
}
it
"验证"的示例如下所示。
{
"changed": false,
"choice1": "Ta kallas piima kaussi.",
"choice2": "Ta kaotas oma isu.",
"idx": 1,
"label": 1,
"premise": "Tüdruk leidis oma helveste seest putuka.",
"question": "effect"
}
qu
"验证"的示例如下所示。
{
"changed": false,
"choice1": "Ta kallas piima kaussi.",
"choice2": "Ta kaotas oma isu.",
"idx": 1,
"label": 1,
"premise": "Tüdruk leidis oma helveste seest putuka.",
"question": "effect"
}
数据字段在所有拆分中都是相同的。
et| name | validation | test |
|---|---|---|
| et | 100 | 500 |
| ht | 100 | 500 |
| id | 100 | 500 |
| it | 100 | 500 |
| qu | 100 | 500 |
Creative Commons Attribution 4.0 International (CC BY 4.0) .
@article{ponti2020xcopa,
title={{XCOPA: A} Multilingual Dataset for Causal Commonsense Reasoning},
author={Edoardo M. Ponti, Goran Glava
{s}, Olga Majewska, Qianchu Liu, Ivan Vuli'{c} and Anna Korhonen},
journal={arXiv preprint},
year={2020},
url={https://ducdauge.github.io/files/xcopa.pdf}
}
@inproceedings{roemmele2011choice,
title={Choice of plausible alternatives: An evaluation of commonsense causal reasoning},
author={Roemmele, Melissa and Bejan, Cosmin Adrian and Gordon, Andrew S},
booktitle={2011 AAAI Spring Symposium Series},
year={2011},
url={https://people.ict.usc.edu/~gordon/publications/AAAI-SPRING11A.PDF},
}
感谢 @patrickvonplaten , @lewtun , @thomwolf 提供此数据集。