数据集:

art

任务:

多项选择

文本分类

子任务:

natural-language-inference

语言:

计算机处理:

monolingual

大小:

100K<n<1M

语言创建人:

found

批注创建人:

crowdsourced

源数据集:

original

预印本库:

arxiv:1908.05739

其他:

abductive-natural-language-inference

许可:

license:unknown

数据集介绍文件清单

英文

"art" 数据集的数据卡

数据集概述

ART 数据集包含超过20,000个常识叙述上下文和200,000个解释。

AI2提供的Abductive Natural Language Inference（ANLI）数据集。

支持的任务和排行榜

More Information Needed

语言

More Information Needed

数据集结构

数据实例

anli

下载的数据集文件大小：5.12 MB
生成的数据集大小：34.36 MB
总使用磁盘量：39.48 MB

'train'的示例如下所示。

{
    "hypothesis_1": "Chad's car had all sorts of other problems besides alignment.",
    "hypothesis_2": "Chad's car had all sorts of benefits other than being sexy.",
    "label": 1,
    "observation_1": "Chad went to get the wheel alignment measured on his car.",
    "observation_2": "The mechanic provided a working alignment with new body work."
}

数据字段

数据字段在所有拆分中是相同的。

anli

observation_1：一个字符串特征。
observation_2：一个字符串特征。
hypothesis_1：一个字符串特征。
hypothesis_2：一个字符串特征。
label：一个分类标签，可能的值包括0（0）、1（1）、2（2）。

数据拆分

name	train	validation
anli	169654	1532

数据集创建

策划理由

More Information Needed

源数据

初始数据收集和规范化

More Information Needed

谁是源语言的生产者？

More Information Needed

注释

注释过程

More Information Needed

谁是标注者？

More Information Needed

个人和敏感信息

More Information Needed

使用数据时的注意事项

附加信息

数据集策划者

More Information Needed

许可信息

More Information Needed

引用信息

@inproceedings{Bhagavatula2020Abductive,
  title={Abductive Commonsense Reasoning},
  author={Chandra Bhagavatula and Ronan Le Bras and Chaitanya Malaviya and Keisuke Sakaguchi and Ari Holtzman and Hannah Rashkin and Doug Downey and Wen-tau Yih and Yejin Choi},
  booktitle={International Conference on Learning Representations},
  year={2020},
  url={https://openreview.net/forum?id=Byg1v1HKDB}
}

贡献

感谢 @patrickvonplaten ， @thomwolf ， @mariamabarham ， @lewtun ， @lhoestq 添加了该数据集。

作者:

佚名

数据集大小:

13.38 KB

"art" 数据集的数据卡

数据集概述

支持的任务和排行榜

语言

数据集结构

数据实例

数据字段

数据拆分

数据集创建

策划理由

源数据

注释

个人和敏感信息

使用数据时的注意事项

数据的社会影响

偏见讨论

其他已知限制

附加信息

数据集策划者

许可信息

引用信息

贡献