Mostrar el registro sencillo del ítem

dc.contributor.authorMeneses-Rojas, Esteban
dc.date.accessioned2018-04-06T14:33:39Z
dc.date.available2018-04-06T14:33:39Z
dc.date.issued2015
dc.identifierhttps://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7379847es
dc.identifier.citationProceedings - Symposium on Computer Architecture and High Performance Computing. Vol 2016 - January p. 162-169es
dc.identifier.urihttps://hdl.handle.net/2238/9676
dc.descriptionArticuloes
dc.description.abstractThe predicted failure rates of future supercomputers loom the groundbreaking research large machines are expected to foster. Therefore, resilient extreme-scale applications are an absolute necessity to effectively use the new generation of supercomputers. Rollback-recovery techniques have been traditionally used in HPC to provide resilience. Among those techniques, message logging provides the appealing features of saving energy, accelerating recovery, and having low performance penalty. Its increased memory consumption is, however, an important downside. This paper introduces memory-constrained message logging (MCML), a general framework for decreasing the memory footprint of message-logging protocols. In particular, we demonstrate the effectiveness of MCML in maintaining message logging feasible for applications with substantial communication imbalance. This type of applications appear in many scientific fields. We present experimental results with several parallel codes running on up to 4,096 cores. Using those results and an analytical model, we predict MCML can reduce execution time up to 25% and energy consumption up to 15%, at extreme scale.es
dc.language.isoenges
dc.publisherIEEEes
dc.relation.hasversion10.1109/SBAC-PAD.2015.25es
dc.source27th International Symposium on Computer Architecture and High Performance Computinges
dc.subjectMensajeses
dc.subjectComunicaciónes
dc.subjectMemoriaes
dc.subjectMCMLes
dc.subjectResearch Subject Categories::TECHNOLOGY::Information technology::Computer engineeringes
dc.titleA fault-tolerance protocol for parallel applications with communication imbalancees
dc.typeartículo originales


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem