TY - JOUR
T1 - Predictive and distributed routing balancing for high speed interconnection networks
AU - Castillo, Carlos Núñez
AU - Lugones, Diego
AU - Franco, Daniel
AU - Luque, Emilio
PY - 2011
Y1 - 2011
N2 - Current parallel applications in parallel computing systems require an interconnection network to provide low and bounded communication delays. Communication characteristics such as traffic pattern and communication load change over time and, eventually, they may exceed available network capacity causing congestion and performance degradation. Congestion control based on adaptive routing should be applied in order to adapt quickly to changing traffic conditions. Studies on a vast range of parallel applications show repetitive behavior and can be characterized by a set of representative phases. This work presents a Predictive and Distributed Routing Balancing technique (PR-DRB) to control network congestion based on adaptive traffic distribution. PR-DRB uses speculative routing based on application repetitiveness. PR-DRB monitors messages latencies on routers and logs solutions to congestion, to quickly respond in future similar situations. Experimental results show that the predictive approach could be used to improve performance.
AB - Current parallel applications in parallel computing systems require an interconnection network to provide low and bounded communication delays. Communication characteristics such as traffic pattern and communication load change over time and, eventually, they may exceed available network capacity causing congestion and performance degradation. Congestion control based on adaptive routing should be applied in order to adapt quickly to changing traffic conditions. Studies on a vast range of parallel applications show repetitive behavior and can be characterized by a set of representative phases. This work presents a Predictive and Distributed Routing Balancing technique (PR-DRB) to control network congestion based on adaptive traffic distribution. PR-DRB uses speculative routing based on application repetitiveness. PR-DRB monitors messages latencies on routers and logs solutions to congestion, to quickly respond in future similar situations. Experimental results show that the predictive approach could be used to improve performance.
KW - application aware routing
KW - Interconnection networks
KW - parallel applications
KW - predictive routing
UR - http://www.scopus.com/inward/record.url?scp=80955126846&partnerID=8YFLogxK
U2 - 10.1109/CLUSTER.2011.66
DO - 10.1109/CLUSTER.2011.66
M3 - Article
AN - SCOPUS:80955126846
SN - 1552-5244
SP - 552
EP - 556
JO - Proceedings - IEEE International Conference on Cluster Computing, ICCC
JF - Proceedings - IEEE International Conference on Cluster Computing, ICCC
ER -