数据集:
metaeval/sts-companion
https://ixa2.si.ehu.eus/stswiki/index.php/STSbenchmark
STS基准测试的伴随数据集包括我们在2012年至2017年SemEval语境中使用的其他英语数据集。作者汇编了两个数据集,一个数据集包含与机器翻译评估相关的句子对,另一个数据集包含用于领域自适应研究的其他数据集。
@inproceedings{cer-etal-2017-semeval,
title = "{S}em{E}val-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation",
author = "Cer, Daniel and
Diab, Mona and
Agirre, Eneko and
Lopez-Gazpio, I{\~n}igo and
Specia, Lucia",
booktitle = "Proceedings of the 11th International Workshop on Semantic Evaluation ({S}em{E}val-2017)",
month = aug,
year = "2017",
address = "Vancouver, Canada",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/S17-2001",
doi = "10.18653/v1/S17-2001",
pages = "1--14",
}