Predictive and distributed routing balancing, an application-aware approach

Carlos Núñez Castillo*, Diego Lugones, Daniel Franco, Emilio Luque, Martin Collier

*Corresponding author for this work

Research output: Contribution to journalArticleResearchpeer-review

1 Citation (Scopus)


The interconnection design in computing clusters and data centers is expected to change significantly in the near future to sustain the increasing communication demand at controlled capitalization and operational cost. In particular, a shift from typical and expensive full-bisection bandwidth interconnects (which safely cover the worst communication cases) to application oriented designs (which may provide cost-efficient data movement at larger system scales) is devised in academic research and industry initiatives. Having information of communication dynamics of applications (e.g. repetitiveness, computing and communication phases, traffic pattern and bandwidth, etc. ) allows for efficiently managing and provisioning of network resources at reduced cost. This paper presents an Application-Aware Predictive and Distributed Routing Balancing technique (PR-DRB), a new method that controls network inefficiencies based on communication patterns of applications and speculative routing, PR-DRB monitors increments in the communication latency and, then, dynamically re-distributes the network traffic over multiple paths (path expansion) to deal with load unbalances. Additionally, PR-DRB stores the number of paths used to balance the traffic (solution) and links it to the application's pattern that caused the unbalance (problem). This information allows PR-DRB to respond to similar situations in repetitive patterns, quickly converging to a stable solution. Evaluation results show latency and completion time reductions of up to 37% for experiments conducted on 64 nodes executing the NAS benchmarks and the Lammps application.

Original languageEnglish
Pages (from-to)179-188
Number of pages10
JournalProcedia Computer Science
Publication statusPublished - 2013


  • Application-aware routing
  • High performance computing
  • Hpc clusters
  • Interconnection networks
  • Parallel scientific applications
  • Predictive routing


Dive into the research topics of 'Predictive and distributed routing balancing, an application-aware approach'. Together they form a unique fingerprint.

Cite this