Partnerships to Improve FAIRness
Authors:
Kjiersten Fagnan1,2* ([email protected], PI), Chris Beecroft1, Neil Byers1, Chuck Parker1, Emiley Eloe-Fadrosh1,2, Elisha Wood-Charlson3, A. J. Ireland3, Jeffrey Johnson4, Nigel Mouncey1
Institutions:
1DOE Joint Genome Institute; 2National Microbiome Data Collaborative; 3KBase, Lawrence Berkeley National Laboratory; 4Cohere Consulting, LLC
URLs:
Goals
DOE Joint Genome Institute, National Microbiome Data Collaborative, and KBase are collaborating to make data discovery and transfers easier.
Abstract
Data discoverability is a challenge. BER supports the generation of petabytes of high-quality environmental data leading to exciting discoveries. Once the data are archived in national repositories the return on the original investment is amplified through reuse. In order to create more opportunities for data to be identified and utilized by the scientific community, organizations have pursued implementation of the FAIR (findable, accessible, interoperable, reusable) principles. Data and metadata quality are improving, however biological data requires more structure to ensure interoperability across studies.
DOE JGI, NMDC, and KBase are collaborating on software infrastructure including common data models, application programming interfaces, and transfer protocols to improve data flow between resources. The group’s shared vision is a consistent data discovery experience across BER platforms. This talk will share details of the team’s efforts to work toward this vision, including JGI’s planned reuse of the NMDC Submission Portal, KBase and JGI’s co-development of the Data Transfer Service (DTS), and exploration of data reuse through JGI’s Data Citation Explorer.
Image
JGI Known Data and Storage Usage. Genetics and microbiology comprise the largest share of DOE Joint Genome Institute's data and storage usage. Altogether, the majority of data and storage usage is attributed to the biosciences. [Courtesy DOE Joint Genome Institute]
Funding Information
The work conducted by the DOE Joint Genome Institute (ror.org/04xm1d337), a DOE Office of Science user facility, is supported by the DOE Office of Science operated under contract number DE-AC02-05CH11231.