BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Chicago
X-LIC-LOCATION:America/Chicago
BEGIN:DAYLIGHT
TZOFFSETFROM:-0600
TZOFFSETTO:-0500
TZNAME:CDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0500
TZOFFSETTO:-0600
TZNAME:CST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20181221T160742Z
LOCATION:D221
DTSTART;TZID=America/Chicago:20181113T154500
DTEND;TZID=America/Chicago:20181113T160000
UID:submissions.supercomputing.org_SC18_sess279_drs124@linklings.com
SUMMARY:High Performance Middlewares for Next Generation Architectures: Ch
 allenges and Solutions
DESCRIPTION:Doctoral Showcase\nArchitectures, Memory, Runtime Systems, Sto
 rage, Workshop Reg Pass, Tutorial Reg Pass, Tech Program Reg Pass, Exhibit
 s Reg Pass, Exhibits - Exhibit Hall Only Reg Pass, Doctoral Showcase\n\nHi
 gh Performance Middlewares for Next Generation Architectures: Challenges a
 nd Solutions\n\nChakraborty, Panda\n\nThe emergence of modern multi-/many-
 core architectures and high-performance interconnects have fueled the grow
 th of large-scale supercomputing clusters. Due to this unprecedented growt
 h in scale and compute density, high performance computing (HPC) middlewar
 es now face a plethora of new challenges to solve in order to extract the 
 best performance from such systems. In this work, we study four such chall
 enges - a) launching and bootstrapping jobs on very large scale clusters, 
 b) contention in collective communication, c) point-to-point communication
  protocols, and d) scalable fault-tolerance and recovery and propose effic
 ient solutions for them. The proposed solutions have been implemented on M
 VAPICH2, a popular MPI and PGAS runtime used by scientists and HPC cluster
 s around the world.
URL:https://sc18.supercomputing.org/presentation/?id=drs124&sess=sess279
END:VEVENT
END:VCALENDAR

