Redundancy in System Reliability. Engineering

gajjal 161 views 17 slides Sep 15, 2025
Slide 1
Slide 1 of 17
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17

About This Presentation

Redundancy in reliability engineering is the inclusion of extra (duplicate or backup) components or subsystems in a system, so that if one component fails, another can take over its function.


Slide Content

Redundancy in Reliability By Dr. Priya S Gajjal 1

Redundancy in Reliability Definition: Redundancy in reliability engineering is the inclusion of extra (duplicate or backup) components or subsystems in a system, so that if one component fails, another can take over its function. This increases the overall reliability and availability of the system. Purpose of Redundancy To reduce the probability of system failure. To maintain system functionality even after component failures. To improve fault tolerance and system safety. To achieve higher reliability without needing to improve individual component reliability. 2

Types of Redundancy Active Redundancy (Parallel Redundancy) All redundant components operate simultaneously. If one component fails, the others continue to operate. Used in critical systems requiring continuous operation. Example: Parallel power supply units. Reliability Formula (for n parallel components): Reliability Formula (for n parallel components): = 1- (1- ) ​where 𝑅𝑖 = reliability of each component. Β  3

2. Standby (Cold) Redundancy Only one component operates at a time; others are kept in standby mode. A switching mechanism activates a standby component upon failure. Standby parts are not subject to wear until activated. Reliability depends on: Component reliability Switch reliability Switching time and success probability 3. Hot Redundancy (Warm Redundancy) Standby components are kept powered on but not fully loaded. Failures can still occur at lower rates while idle. Offers faster switchover than cold standby. 4

4. Mixed / Hybrid Redundancy Combination of series and parallel subsystems. Real systems often use series-parallel networks of components. Impact on System Reliability A single component with reliability 𝑅=0.9 Two such components in parallel: 𝑅𝑠𝑦𝑠 =1βˆ’(1βˆ’0.9)Β² =1βˆ’0.01=0.99 ​Reliability improves from 0.9 to 0.99. Trade-offs in Redundancy Advantage Disadvantage Improves reliability & safety Increases cost and weight Reduces downtime Requires more space & power Enhances fault tolerance May add complexity to design Increases mission success Maintenance of extra parts needed 5

Real-Life Examples Aircraft flight control systems with multiple redundant computers. Power plants having parallel backup generators. Data centers with RAID (redundant arrays of disks). Medical devices with redundant sensors. Summary Redundancy is a key reliability engineering strategy where extra components are added to ensure the system still functions if one part fails. It is crucial in safety-critical and mission-critical systems where downtime or failure is unacceptable. 6

⚑ Switching in Standby Redundancy In standby redundancy, only one unit is active at a time, and the others are kept off (cold) until needed. If the active unit fails, a switching mechanism connects a standby unit to keep the system running. The effectiveness of this switch is crucial and is described as perfect or imperfect switching. βœ… Perfect Switching (Perfect Stitching) Definition: Perfect switching means the standby unit is always connected instantly and flawlessly when the active unit fails. Key Points: Switch never fails. Switching is instantaneous (no delay). No loss of functionality or operation during switchover. Assumption: Switch reliability 𝑅𝑠𝑀 = 1 Effect on Reliability: System reliability depends only on the reliability of the units, not on the switch. Used as an ideal assumption in theoretical reliability calculations. 7

Imperfect Switching (Imperfect Stitching) Definition: Imperfect switching means the switching mechanism may fail or introduce delay, so the standby unit might not activate when the primary fails. Key Points: Switch has failure probability. Possible time delay in switching. Can cause system downtime or total failure during switchover. Switch reliability is less than 1 (𝑅𝑠𝑀<1). Effect on Reliability: System reliability depends on both: component reliabilities switching mechanism reliability Overall system reliability is lower than in perfect switching case. Formula:𝑅𝑠𝑦𝑠=𝑅₁ + (1βˆ’π‘…β‚) Γ— 𝑅𝑠𝑀 Γ— 𝑅₂ 8

Aspect Perfect Switching Imperfect Switching Switch Reliability Rsw = 1 (always works) Rsw < 1 (can fail) Switching Delay None Possible delay Continuity Always continuous May cause interruption Used in Ideal/theoretical models Realistic/practical models Effect on Reliability Higher Lower 9

βš™οΈ Reliability Apportionment and Allocation Context: When designing a complex system (made of many subsystems/components), the system must meet a required overall reliability goal. To achieve this, the designer must distribute (apportion or allocate) the system’s reliability requirement among its subsystems and components. πŸ“Œ 1. Reliability Apportionment Definition: Reliability apportionment is the process of breaking down the overall system reliability goal into reliability goals for each subsystem or component. Purpose: To ensure the entire system achieves the desired reliability. To identify critical components requiring higher reliability. To guide design and quality-control efforts. 10

Example: Suppose a system has 5 components in series and must achieve 𝑅𝑠𝑦𝑠=0.90 Then each component might be apportioned: = = = 0.979 So each component must have reliability β‰₯ 0.979. Β  Key Point : Apportionment is done at the design stage. πŸ“Œ 2. Reliability Allocation Definition: Reliability allocation is the practical assignment of reliability requirements to components, considering their importance, complexity, and cost. Unlike apportionment, which is a purely mathematical split, allocation: Considers weighting factors (importance, cost, maturity, environment).Uses engineering judgment and trade-offs. 11

Common Methods: Equal apportionment method: Equal reliability to all components. ARINC method: Assign based on importance and environment factors. Feasibility-of-objectives method: Based on how realistic the targets are.Repair rate or failure rate allocation. Example: If some components are easy to improve, more reliability can be allocated to them, while critical and hard-to-improve ones may get less. 12

πŸ“Š Summary Table Aspect Reliability Apportionment Reliability Allocation Meaning Breaking down system goal Assigning realistic targets Basis Mathematical distribution Engineering judgment + factors Consider constraints No Yes (cost, weight, environment, etc.) Stage Early design stage Detailed design stage Goal Set numerical reliability targets Ensure targets are achievable πŸ“Œ Why This Matters Ensures balanced design: No under- or over-designed parts. Helps meet mission reliability requirements. Reduces development time and cost by planning reliability early. 13

Reliability allocation method is divided into 2 types βš–οΈ Weighting Factor (𝑀 i ) Definition: A weighting factor is a numerical value assigned to each component/subsystem to indicate its relative importance or criticality in achieving the overall system reliability goal. These are used in methods like the AGREE or ARINC reliability allocation method to distribute the system reliability among components. πŸ“Œ Purpose of Weighting Factors To give more reliability to critical components To give less reliability to non-critical or easily repairable parts To reflect design difficulty, complexity, stress level, operating environment, and safety importance 14

πŸ“Š Typical Factors Considered Criterion Description Functional importance How critical it is for system operation Complexity Number of parts / chance of failure Environmental severity Harsh conditions (heat, vibration, etc.) State of the art Technological maturity of the component Repairability / accessibility Ease of replacement or repair Example: Assign scores (1–5) for each criterion Multiply or sum to get total 𝑀𝑖 for each component 15

πŸ“Œ ARINC Allocation Method ( Aerospace Recommended Numbering Identification Code.) ​ ​= 1- ) * } / where: 𝑅𝑖 = allocated reliability of component I 𝑀𝑖 = weighting factor𝑅𝑠𝑦𝑠 Rsys ​ = required system reliability Β  βš™οΈ AGREE Method (Advisory Group on Reliability of Electronic Equipment) The AGREE Method is a classical and widely used reliability allocation method, developed by the U.S. military’s Advisory Group on Reliability of Electronic Equipment (AGREE). It is mainly used for series systems, and it allocates reliability goals to each subsystem based on three main factors: βš–οΈ Basic Concept Each subsystem is given a weight based on: Number of parts (𝑁𝑖) Complexity / Design factor (𝐢𝑖​) Importance factor (𝐼𝑖) These are combined to get a weighting factor (𝑀𝑖), which is used to split the allowable failure rate of the whole system. 16

֎ Optimal Reliability Allocation Definition: Optimal reliability allocation is the process of assigning reliability targets to components such that the overall system reliability goal is achieved at minimum total cost (or weight, or size). It’s a design optimization problem. ֎ Goal Minimize total cost subject to the constraint: β‰₯ ​ while considering: Component cost vs reliability improvement Weight / size / power constraints Practical limits of achievable reliability Β  17