数据集:

metaeval/sts-companion

英文

https://ixa2.si.ehu.eus/stswiki/index.php/STSbenchmark

STS基准测试的伴随数据集包括我们在2012年至2017年SemEval语境中使用的其他英语数据集。作者汇编了两个数据集,一个数据集包含与机器翻译评估相关的句子对,另一个数据集包含用于领域自适应研究的其他数据集。

@inproceedings{cer-etal-2017-semeval,
    title = "{S}em{E}val-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation",
    author = "Cer, Daniel  and
      Diab, Mona  and
      Agirre, Eneko  and
      Lopez-Gazpio, I{\~n}igo  and
      Specia, Lucia",
    booktitle = "Proceedings of the 11th International Workshop on Semantic Evaluation ({S}em{E}val-2017)",
    month = aug,
    year = "2017",
    address = "Vancouver, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/S17-2001",
    doi = "10.18653/v1/S17-2001",
    pages = "1--14",
}