数据集:
bigbio/tmvar_v1
该数据集包含500篇手动注释的PubMed文章,涵盖了各种类型的突变提及。该数据集仅用于命名实体识别任务。数据集分为训练集(334篇)和测试集(166篇)。
@article{wei2013tmvar,
title={tmVar: a text mining approach for extracting sequence variants in biomedical literature},
author={Wei, Chih-Hsuan and Harris, Bethany R and Kao, Hung-Yu and Lu, Zhiyong},
journal={Bioinformatics},
volume={29},
number={11},
pages={1433--1439},
year={2013},
publisher={Oxford University Press}
}