A Unified Data Infrastructure for Biological and Environmental Research
Authors:
Kerstin Kleese van Dam* ([email protected], PI)
Institutions:
Brookhaven National Laboratory; Chair, BERAC Subcommittee and Working Group on Unified Data Infrastructure
URLs:
Abstract
In October 2022, the BER Advisory Committee (BERAC) received a charge letter from the DOE Office of Science director requesting a review of existing capabilities in data management and infrastructure relevant to BER science. The charge also requested a recommended strategy for next-generation data management and analysis within a unified framework. Further goals included identifying new science opportunities that could be enabled by increased integration of BER’s facilities while considering advances in artificial intelligence and machine learning (AI/ML).
The charge asked BERAC to examine synergistic investments within DOE and at other agencies and the impact of a more unified data infrastructure on the scientific workforce. To address these goals, the appointed subcommittee established five working groups focusing on (1) environmental science; (2) biological science; (3) BER data infrastructure services; (4) workforce development, inclusion, and diversity; and (5) data infrastructure technologies. The subcommittee organized a two-day virtual community workshop that included discussions on new unified data infrastructure– enabled science opportunities, barriers to broader inclusion of minorities, support for early career scientists, and potential unified data infrastructure solutions for BER. The results of the workshop and a public request for information were evaluated by the subcommittee and its working groups and its findings summarized in a report. The talk will outline the initial science opportunities identified that could be enabled by a new unified data infrastructure for BER sciences. The talk will also review how a unified data infrastructure may increase the accessibility of BER science and support early career and minority scientists. Furthermore, it will discuss existing data infrastructure capabilities available at BER and elsewhere, and finally, outline the subcommittee recommendations.
Image
A Unified Data Infrastructure for Biological and Environmental Research. This 2024 report details a review of existing capabilities in data management and infrastructure relevant to BER science and provides a recommended strategy for next-generation data management and analysis within a unified framework. Read the full BERAC report here.