Primary tabs

Boise State University logo

Boise State University is a Carnegie-classified doctoral research university and our students have opportunities to work with talented and accomplished faculty on research, even as undergraduates. Our students go into the workforce better prepared, with expertise outside of their major by taking advantage of opportunities such as certificates in business anthropology, entrepreneurship, or cybersecurity.

Learn more at


GNU Affero General Public License v3.0

Other Access

The information on this page (the dataset metadata) is also available in these formats.


via the DKAN API


v0.1.0 release of the G2PMineR Package as described in Genes manuscript "G2PMineR: A Genome to Phenome Literature Review Approach" (Wojahn et al., 2021). For further information and details about the package release visit the G2PMineR Project Webpage.

There is a gap in the conceptual framework linking genes to phenotypes (G2P) for non-model organisms, as most non-model organisms do not yet have genomic resources readily available. To address this, researchers often perform literature reviews to understand G2P linkages by curating a list of likely gene candidates, hinging upon other studies already conducted in closely related systems. Sifting through hundreds to thousands of articles is a cumbersome task that slows down the scientific process and may introduce bias into a study. To fill this gap, we created G2PMineR, a free and open-source literature mining tool developed specifically for G2P research. This package uses automation to make the G2P review process efficient and unbiased, while also generating hypothesized associations between genes and phenotypes within a taxonomical framework. We applied the package to a literature review for drought-tolerance in plants. The analysis provides biologically meaningful the results within the known framework of drought tolerance in plants. Overall, the package is useful for conducting literature reviews for genome-to-phenome projects and also has broad appeal to scientists investigating a wide range of study systems as it can conduct analyses under the auspices of three different kingdoms (Plantae, Animalia, and Fungi).

Data Use
GNU Affero General Public License v3.0
Recommended Citation
Wojahn JMA, Buerki S. 2021. G2PMineR (Version 0.1.0). GitHub.

US National Science Foundation and Idaho EPSCoR: OIA-1757324

Release Date
Homepage URL
Temporal Coverage
Monday, June 1, 2020 - 00:00 to Wednesday, February 3, 2021 - 00:00
English (United States)
GNU Affero General Public License v3.0
John Michael Adrian Wojahn and Sven Buerki
Contact Name
Michael Wojahn
Contact Email
Public Access Level
Data available on:: 
Sunday, June 6, 2021