TY - GEN
T1 - A novel fault-tolerant router architecture for network-on-chip reconfiguration
AU - Yan, Pengzhan
AU - Jiang, Shixiong
AU - Sridhar, Ramalingam
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/2/12
Y1 - 2016/2/12
N2 - In Network-on-Chip (NoC) architectures, a faulty router can isolate a functional processing element (PE) from other nodes, severely restricting the performance of the system. This paper presents a fault-tolerant router architecture that can avoid PE isolation even if the router fails. In this design, we connect the local port of the router with one of the other four ports through a fault tolerant control unit that works with the fault detection signal. If the router fails, the control unit will turn on and connect the PE with the neighboring router directly, thus protecting the system functioning. A revised XY-Routing algorithm is also presented to achieve the NoC reconfiguration when router fails. Theoretical analysis show that for a 10∗10 mesh NoC, the reliability at year 10 is 20 times better than NoC implemented with traditional router. Also our design has greatly improved value for mean time to failure (MTTF). In video and DSP applications, simulation results show better power control with faulty PEs. The new architecture also has other advantages over annealing re-mapping algorithm. Our design has only 12% area overhead, which is significantly better than other approaches to deal with faults.
AB - In Network-on-Chip (NoC) architectures, a faulty router can isolate a functional processing element (PE) from other nodes, severely restricting the performance of the system. This paper presents a fault-tolerant router architecture that can avoid PE isolation even if the router fails. In this design, we connect the local port of the router with one of the other four ports through a fault tolerant control unit that works with the fault detection signal. If the router fails, the control unit will turn on and connect the PE with the neighboring router directly, thus protecting the system functioning. A revised XY-Routing algorithm is also presented to achieve the NoC reconfiguration when router fails. Theoretical analysis show that for a 10∗10 mesh NoC, the reliability at year 10 is 20 times better than NoC implemented with traditional router. Also our design has greatly improved value for mean time to failure (MTTF). In video and DSP applications, simulation results show better power control with faulty PEs. The new architecture also has other advantages over annealing re-mapping algorithm. Our design has only 12% area overhead, which is significantly better than other approaches to deal with faults.
UR - https://www.scopus.com/pages/publications/84962429595
U2 - 10.1109/SOCC.2015.7406966
DO - 10.1109/SOCC.2015.7406966
M3 - Conference contribution
AN - SCOPUS:84962429595
T3 - International System on Chip Conference
SP - 292
EP - 297
BT - Proceedings - 28th IEEE International System on Chip Conference, SOCC 2015
A2 - Buchner, Thomas
A2 - Zhao, Danella
A2 - Bhatia, Karan
A2 - Sridhar, Ramalingam
PB - IEEE Computer Society
T2 - 28th IEEE International System on Chip Conference, SOCC 2015
Y2 - 8 September 2015 through 11 September 2015
ER -