Principles of the Battery Data Genome


Logan Ward, Susan Babinec, Eric J. Dufek, David A. Howey, Venkatasubramanian Viswanathan, Muratahan Aykol, David A. C. Beck, Benjamin Blauszik, Bor-Rong Chen, George Crabtree, Simon Clark, Valerio De Angelis, Philipp André Dechant, Matthieu Dubarry, Erica E. Eggleton, Donal P. Finegan, Ian Foster, Chirranjeevi Balaji, Patrick K. Herring, Victor W. Hu, Noah H. Paulson, Yuliya Preger, Dirk Uwe Sauer, Kandler Smith, Seth W. Snyder, Shashank Sripad, Tanvir R. Tanim, Linnette Teo, Joule, 3 October 2022.



Batteries are central to modern society. They are no longer just a convenience but a critical enabler of the transition to a resilient, low-carbon economy. Battery development capabilities are provided by communities spanning materials discovery, battery chemistry and electrochemistry, cell and pack design, scale-up, manufacturing, and deployments. Despite their relative maturity, data-science practices among these diverse groups are far behind the state of the art in other fields, which have demonstrated an ability to significantly improve innovation and economic impact. The negative consequences of the present paradigm include incremental improvements but few breakthroughs, significant manufacturing uncertainties, and cascading investment risks that collectively slow deployments. The primary roadblock to a battery-data-science renaissance is the requirement for large amounts of high-quality data, which are not available in the current fragmented ecosystem. Here, we identify gaps and propose principles that enable the solution by building a robust community of data hubs with standardized practices and flexible sharing options that will seed advanced tools spanning innovation to deployment. Precedents are offered that demonstrate that both public good and immense economic gains will arise from sharing valuable battery data. The proposed Battery Data Genome looks to broadly transform innovations and revolutionize their translation from research to societal impact.