数据集:
bigbio/gad
GAD数据集通过基于遗传关联数据库的半自动注释流程识别基因与疾病之间的关联。
该数据集的主页已无法访问,但链接已记录在此。数据最初是从Google Drive文件夹中下载的(在 BLURB benchmark data download script 链接中使用)。但为了更可靠的下载和访问,我们将数据托管在Hugging Face Hub上。
@article{Bravo2015,
doi = {10.1186/s12859-015-0472-9},
url = {https://doi.org/10.1186/s12859-015-0472-9},
year = {2015},
month = feb,
publisher = {Springer Science and Business Media {LLC}},
volume = {16},
number = {1},
author = {{\`{A}}lex Bravo and Janet Pi{\~{n}}ero and N{\'{u}}ria Queralt-Rosinach and Michael Rautschka and Laura I Furlong},
title = {Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research},
journal = {{BMC} Bioinformatics}
}