​Humangenphen: scalable extraction of human genetic and phenotypic data from peer-reviewed literature

This project will extend and integrate the participants' existing text mining tools to provide a reusable workflow to extract human genotype-phenotype associations from scientific literature full-texts, tables and supplementary materials. These data will be imported into GWAS Central and DisGeNET, accelerating FAIR access to pioneering findings such as COVID-19 GWAS. The development of an annotated GWAS corpus based on full-text articles will enable the evaluation of existing and future text mining methodologies for extracting genotype-phenotype associations and metadata. This project is funded by ELIXIR for the period 2022-23.

Site Information