Skip to content

VLSIFacts

Let's Program the Transistors

  • Home
  • DHD
    • Digital Electronics
    • Fault Tolerant System Design
    • TLM
    • Verification
    • Verilog
    • VHDL
    • Xilinx
  • Embedded System
    • 8085 uP
    • 8086 uP
    • 8051 uC
  • VLSI Technology
    • Analog Electronics
    • Memory Devices
    • VLSI Circuits
  • Interview
    • Interview Experience
    • Training Experience
    • Question Bank
  • Notifications
  • QUIZ
  • Community
  • Job Board
  • Contact Us

Need of Fault Tolerant VLSI System Design

Posted on October 28, 2017June 17, 2025 By vlsifacts No Comments on Need of Fault Tolerant VLSI System Design

In recent few years VLSI design has achieved remarkable growth. High performance (peta-scale) computing is a reality now and we are expecting exa-scale computing by 2020. We talk about many core processor now a days. Intel’s Xeon Phi (Knights Landing) with 72 cores and IBM’s Kilocore processors with more than 1000 cores are great examples of many core processors. Design scientists and manufacturing engineers are working hard to come up with efficient designs which meet the area, power, performance demand of the market. At the same time the highly embedded units need to be robust and the design cycle should meet the time to market needs.

No doubt the designers are trying hard to develop fault free systems, but no matter how robust the design is, 100% fault free design is impossible. Computer Scientists and engineers have introduced variety of tools and techniques to reduce the number of faults in the system they build. However we need to build systems that will acknowledge the existence of faults as a fact of life, and incorporate techniques to tolerate these faults while still delivering an acceptable level of service [1].

Image Source

Different application areas demand fault tolerance a necessity to be incorporated during design phase:

  1. Critical Application: Aircraft, Nuclear reactor, Medical equipments
  2. Harsh Environment: Systems open to high vibration, temperature, humidity, electromagnetic disturbances, particle hits
  3. High Computing Systems: Complex systems consisting of millions of devices

Fault and error in one part of the unit can spread across the whole system. For example a stuck-at-zero at the data output of a memory module (permanent ground connection) might provide a wrong “0” data (while logic “1” is the correct data) to the processor. This wrong value of data would be processed by the processor and it may end up in a wrong result generation. In this scenario, the processor unit was not faulty, but the error in the result propagated due to the fault in the memory module.

Faults in electronic systems can be transient or permanent.

Transient Faults – disappear after a relatively short time. For example, a memory cell whose contents are changed spuriously due to some electromagnetic interference. Overwriting the memory cell with the right content will make the fault go away.

Permanent Faults – never go away, component has to be repaired or replaced

We would discuss about different Redundancy techniques in the future posts to deal with faults in VLSI systems.

[1] Fault-Tolerant Systems, by Israel Koren and C. Mani Krishna

Spread the Word

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
  • Click to share on LinkedIn (Opens in new window) LinkedIn
  • Click to share on Pinterest (Opens in new window) Pinterest
  • Click to share on Tumblr (Opens in new window) Tumblr
  • Click to share on Pocket (Opens in new window) Pocket
  • Click to share on Reddit (Opens in new window) Reddit
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

Like this:

Like Loading...

Related posts:

  1. Defects, Errors, and Faults
  2. Redundancy in Fault Tolerance
  3. Reliability vs. Availability in Fault Tolerance
  4. What are Observability and Controllability
Fault Tolerant System Design Tags:Error, Fault, Fault Tolerance, Permanent Fault, Transient Fault

Post navigation

Previous Post: Device Drivers and Linux Architecture
Next Post: Device Drivers : Role & Types

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Top Posts & Pages

  • ASCII Code
  • Different Coding Styles of Verilog Language
  • Truth Tables, Characteristic Equations and Excitation Tables of Different Flipflops
  • NAND and NOR gate using CMOS Technology
  • Difference between $display, $monitor, $write and $strobe in Verilog

Copyright © 2025 VLSIFacts.

Powered by PressBook WordPress theme

Subscribe to Our Newsletter

%d