Online Traffic-Aware Fault Detection for Networks-on-Chip

Research output: Contribution to journalArticlepeer-review

48 Citations (Scopus)

Abstract

A key requirement for modern Networks-on-Chip (NoC) is the ability to detect and diagnose faults and failures. This paper addresses the challenge of fault diagnosis using online testing where the interruption of the runtime operation (performance) under diagnosis is minimised. A novel Monitor Module (MM) is proposed to detect NoC interconnect faults which minimise the intrusion of the regular NoC traffic throughput by (1) using a channel tester which only examines NoC channels when they are idle; and (2) using a testing interval parameter based on the Binary Exponential Back off algorithm to dynamically balance the level of testing when recovering from temporary faults. The paper presents results on the minimal impact on NoC throughput for a range of testing conditions and also highlights the minimal area overhead of the MM (11.56%) compared with an adaptive NoC router implemented on FPGA hardware. Simulation results demonstrate non-intrusion of the NoC runtime traffic throughput when channel are fault free, and also how throughput loss is minimised when faults are identified.
Original languageEnglish
Pages (from-to)1984-1993
Number of pages10
JournalJournal of Parallel and Distributed Computing
Volume74
Issue number1
Early online date16 Sept 2013
DOIs
Publication statusPublished (in print/issue) - 1 Jan 2014

Fingerprint

Dive into the research topics of 'Online Traffic-Aware Fault Detection for Networks-on-Chip'. Together they form a unique fingerprint.

Cite this