数据集:
bigbio/ehr_rel
EHR-Rel是一个新颖的开源1生物医学概念相关性数据集,包含了3630对概念,是现有数据集的六倍。与以往的工作不同,该数据集是从电子健康记录(EHRs)中采样得到的,以确保概念与EHR概念检索任务相关。对数据集中概念的详细分析显示,其覆盖范围远远超过现有数据集。
@inproceedings{schulz-etal-2020-biomedical,
title = {Biomedical Concept Relatedness {--} A large {EHR}-based benchmark},
author = {Schulz, Claudia and
Levy-Kramer, Josh and
Van Assel, Camille and
Kepes, Miklos and
Hammerla, Nils},
booktitle = {Proceedings of the 28th International Conference on Computational Linguistics},
month = {dec},
year = {2020},
address = {Barcelona, Spain (Online)},
publisher = {International Committee on Computational Linguistics},
url = {https://aclanthology.org/2020.coling-main.577},
doi = {10.18653/v1/2020.coling-main.577},
pages = {6565--6575},
}