• A fault-tolerance protocol for parallel applications with communication imbalance 

      Meneses-Rojas, Esteban (IEEE, 2015)
      The predicted failure rates of future supercomputers loom the groundbreaking research large machines are expected to foster. Therefore, resilient extreme-scale applications are an absolute necessity to effectively use ...
    • Camel: collective-aware message logging 

      Kalé, Laxmikant; Meneses-Rojas, Esteban (Kluwer Academic Publishers, 2015-03)
      The continuous progress in the performance of supercomputers has made possible the understanding of many fundamental problems in science. Simulation, the third scientific pillar, constantly demands more powerful machines ...
    • Power, Reliability, Performance: One System to Rule Them All 

      Acun, Bilge; Langer, Akhil; Meneses-Rojas, Esteban; Menon, Harshitha; Sarood, Osman; Totoni, Ehsan; Kalé, Laxmikant (IEEE, 2016)
      En un diseño basado en el marco de programación paralelo Charm ++, un sistema de tiempo de ejecución adaptativo interactúa dinámicamente con el administrador de recursos de un centro de datos para controlar la energía ...
    • Simulation-based evaluation of school reopening strategies during COVID-19: a case study of São Paulo, Brazil 

      Cruz, E. H. M.; Maciel, J. M.; Clozato, C.; Serpa, M. S.; Navaux, P. O. A.; Meneses-Rojas, Esteban; Abdalah, M.; Diener, M. (Instituto Tecnológico de Costa Rica, 2021)
      During the COVID-19 pandemic, many countries opted for strict public health measures, including closing schools. After some time, they have started relaxing some of those restrictions. To avoid overwhelming health systems, ...
    • Using migratable objects to enhance fault tolerance schemes in supercomputers 

      Mendes, Celso; Meneses-Rojas, Esteban; Xiang, Ni; Gengbin, Zheng (IEEE Computer Society, 2015-07)
      Supercomputers have seen an exponential increase in their size in the last two decades. Such a high growth rate is expected to take us to exascale in the timeframe 2018-2022. But, to bring a productive exascale environment ...