数据集:
conv_ai_3
子任务:
text-scoring语言:
计算机处理:
monolingual大小:
10K<n<100K语言创建人:
crowdsourced批注创建人:
crowdsourced源数据集:
original预印本库:
arxiv:2009.11352许可:
Conv AI 3挑战赛是2020年在Search-oriented Conversational AI (SCAI) EMNLP研讨会中举办的。对话系统的主要目标是针对用户的请求返回适当的回答。然而,某些用户请求可能存在歧义。在信息检索(IR)设置中,这种情况主要通过多样化的搜索结果页来处理。但是在对话设置中更具挑战性。因此,我们的目标是研究对话设置下的以下情况:
[需要更多信息]
[需要更多信息]
这里是数据集中的几个示例:
{'topic_id': 8,
'facet_id': 'F0968',
'initial_request': 'I want to know about appraisals.',
'topic_desc': 'Find information about the appraisals in nearby companies.',
'clarification_need': 2,
'question_id': 'F0001',
'question': 'are you looking for a type of appraiser',
'answer': 'im looking for nearby companies that do home appraisals',
'facet_desc': 'Get the TYPE of Appraisals'
'conversation_context': [],
'context_id': 968}
{'topic_id': 8,
'facet_id': 'F0969',
'initial_request': 'I want to know about appraisals.',
'topic_desc': 'Find information about the type of appraisals.',
'clarification_need': 2,
'question_id': 'F0005',
'question': 'are you looking for a type of appraiser',
'facet_desc': 'Get the TYPE of Appraisals'
'answer': 'yes jewelry',
'conversation_context': [],
'context_id': 969}
{'topic_id': 293,
'facet_id': 'F0729',
'initial_request': 'Tell me about the educational advantages of social networking sites.',
'topic_desc': 'Find information about the educational benefits of the social media sites',
'clarification_need': 2,
'question_id': 'F0009'
'question': 'which social networking sites would you like information on',
'answer': 'i don have a specific one in mind just overall educational benefits to social media sites',
'facet_desc': 'Detailed information about the Networking Sites.'
'conversation_context': [{'question': 'what level of schooling are you interested in gaining the advantages to social networking sites', 'answer': 'all levels'}, {'question': 'what type of educational advantages are you seeking from social networking', 'answer': 'i just want to know if there are any'}],
'context_id': 976573}
[需要更多信息]
[需要更多信息]
[需要更多信息]
谁是源语言制作方?[需要更多信息]
[需要更多信息]
注释者是谁?[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
@misc{aliannejadi2020convai3,title={ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue Systems (ClariQ)},author={Mohammad Aliannejadi and Julia Kiseleva and Aleksandr Chuklin and Jeff Dalton and Mikhail Burtsev},year={2020},eprint={2009.11352},archivePrefix={arXiv},primaryClass={cs.CL}}
感谢 @rkc007 添加此数据集。