Enabling Petascale Science: Data Management, Troubleshooting and Scalable Science Services
|Title||Enabling Petascale Science: Data Management, Troubleshooting and Scalable Science Services|
|Publication Type||Conference Paper|
|Year of Publication||2007|
|Authors||Baranovski, A, Beattie, K, Bharathi, S, Boverhof, J, Bresnahan, J, Chervenak, AL, Foster, IT, Freeman, T, Gunter, D, Keahey, K, Kesselman, C, Kettimuthu, R, Leroy, N, Link, M, Livny, M, Madduri, R, Oleynik, G, Pearlman, L, Schuler, R, Tierney, B|
DOE science applications in such diverse areas as astrophysics, biology, chemistry, combustion, fusion, high energy physics, nanoscience, and nuclear physics are generating and analyzing up to petabytes of data per year. The Center for Enabling Distributed Petascale Science (CEDPS) project works with DOE application science communities to provide services required to move data sets where they are needed; to enable analysis near the data; and to detect and recover from failures in the distributed environment. Each of these tasks is challenging in a petascale environment, because of the need to coordinate numerous shared resources, including CPUs, storage, and networks.