ROBUST: A new self-healing fault-tolerant NoC router

Jacques Henri Collet, Ahmed Louri, Vivek Tulsidas Bhat, Pavan Poluri

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

This work addresses the general problem of making Network-on-Chips (NoCs) routers totally self-healing in massively defective technologies. There are three main contributions. First, we propose a new hardware approach based on Built-In Self-Test techniques and multi-functional blocks (called Universal Logic Blocks, ULBs) to autonomously diagnose permanent faults and repair faulty units. ULBs have the capability to assume the functionality of various functional units within the router through simple reconfiguration and thus enable the repair of multiple permanent faults within the NoC router. Second, we propose a new reliability metric and introduce a probabilistic model to estimate the router reliability improvement achieved by the protection circuitry. Third, we compare our architecture to two router architectures (Vicis and Bulletproof) and we show that our design provides superior reliability improvement especially in extremely defective nanoscale technologies (i.e., typically above 30% of faulty routers). The most striking result is that the self-healing of the routers enables maintaining the communications at fault levels, where it is normally impossible to preserve communications.

Original languageEnglish (US)
Title of host publication4th International Workshop on Network on Chip Architectures, NoCArc 2011 - In Conjunction with the 44th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-44
Pages11-16
Number of pages6
DOIs
StatePublished - 2011
Externally publishedYes
Event4th International Workshop on Network on Chip Architectures, NoCArc 2011, in Conjunction with the 44th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO44 - Porto Alegre, Brazil
Duration: Dec 4 2011Dec 4 2011

Publication series

NameACM International Conference Proceeding Series

Other

Other4th International Workshop on Network on Chip Architectures, NoCArc 2011, in Conjunction with the 44th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO44
Country/TerritoryBrazil
CityPorto Alegre
Period12/4/1112/4/11

Keywords

  • fault-tolerance
  • multi-core architectures
  • network-on-chip
  • self-healing

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'ROBUST: A new self-healing fault-tolerant NoC router'. Together they form a unique fingerprint.

Cite this