Module 4: Failure Investigation Flashcards
What are common pitfalls in failure investigations?
1) Jumping to conclusions
2) Not understanding the problem
3) Not understanding how the failed system is supposed to operate
4) Not considering all possible failure cases
5) Tearing system apart without a developed plan
6) Failure to follow through
7) Not asking for help
8) Thinking it is too easy to do
9) Destroying evidence due to lack of planning
What is NOT failure analysis?
1) “Give me your best guess”
2) Not identifying the root cause(s)
3) Reworking or repairing
4) Swapping parts: Remove and replace mentality
5) Ignoring the problem
6) We are going to change it later
7) Band-aid fixes: return to the supplier;scrap;another syste, supplier or inventory
8) Never happens
What are the principle stages of a failure analysis?
1) Get background information (documentation, eg procurement specs, materials list, processes lists, standards, parts list, drawings etc)
2) Catalog evidence
3) Non destructive testing, chemical analysis, mechanical testing, metallographic examination
4) Amass information to come to a conclusion
5) Make recommendations
What is the first step in failure analysis?
Get a good understanding of the conditions under which the part was active. The investigator must ask questions from those who work with, as well as those who maintain the equipment and visit the site whenever possible.
It is essential that these questions are asked ASAP after the failure
There are three reasons one would collect data through interviews:
1) Firsthand data (witnesses, participants, etc)
2) Background and circumstantial data (historical experiences, related events, situational insights etc)
3) Expert information (to elicit technical knowledge)
What is the second step in failure analysis?
Perform a visual exmaination, cataloguing and recording the physical evidence at the same time. This serves the functions of:
1) Familiarizing the investigators with the evidence
2) Creating a permanent record that can be referred to in light of new information
What should investigators be cautious about evidence?
1) Pieces should always be examined and recorded before any surface cleaning is undertaken
2) A good general rule is to be conservative when destroying evidence of any kind
What is the third step in failure analysis?
Devide on a course of action. On the basis of visual inspections and background information the investigator must outline a plan of action, which is the series of steps required to successfully complete the case. There are various means that an investigator can draw on to determine the cause of failure, which can be classified in the following categories:
- Macroscopic examination
- NDT
- Chemical analysis
- Microscopic examination
- Testing
What are some general rules for colleting samples?
1) Collect failed parts, nearby fragments and lubricant and fluid samples
2) Collect evidence beyond what is apparent at the time of the initial assessment
3) Collect undamaged samples of similar components for comparison to the damaged ones
4) Draw diagrams to indicate the position of parts and sample collection locations
5) Do not be afraid to take many photographs while photo documenting the scene
6) Take shots from every angle and always have a scalable object in the photo, preferably ra rules scale
7) Protect samples, particularly delicate items and fracture surfaces, from each other and from other sources of damage
8) Label the samples in order to indicate when and why it was collected, how it was oriented, who removed it and what were the relevant in site observations
What are steps undertaken in a laboratory analysis?
1) initial examination
2) Photodocumentation
3) NDE
4) Material verification
5) Fractographic examination
6) Microscopic analysis
7) Mechanical properties determination
8) Analysis of evidence
What are some questions one should ask?
- Should the complex part be an assembly of several parts rather than one?
- How was the component loaded, and was anisotropy considered?
- Is the material capable of being produced with the required properties in the form used?
- Can any available material meet the specifications?
- Did the strength requirements preclude toughness or corrosion resistance needs?
- Was the wear resistance adequate for the materials in contact?
- Were the seried properties compromised by the use of low-cost materials or processes?
- Did the materials and processing comply with the applicable codes and standards?
- Was the product made with unique materials and processes?
- How did the scrap value contribue to repair and maintenance decisions in service?
- Does the material prossess adequate durability in the service envrionment?
- Was the CoC of the materials compliant to the specifications?
- Are there other samples available of the same material batch? (reproduce the failure)
- Were proprietary or obsolute materials and processes employed?
- Were the manufacturing procsesses used to create the desired shape appropriate?
- Did the individual processing methods make sense?
- Should it have been preheated prior to heat treatment or welding?
- Did the fabricality requirements compromise the desired mechanical or physical properties?
- Were the manufacturing methods appropriate for the quantity produced?
- Were the operating conditions and maintenance as intended?
- Were the service conditions easy to anticipate?
- Were the operators certified for the performed manufacturing process?
- Were anomalies documented during the manufacturing process?
How to formulate conclusions?
- Has the failure sequence been established?
- If the failure involved cracking or fracture, have the initation sites been determined?
- Did cracks iniate at the surface or below the surface?
- Was cracking associated with a stress concentration
- How long was the crack present?
- What was the level of magnitude of the loads?
- What was the type of loading? static, cyclic or intermittent?
- Is the fracture surface consistent with the type of loading assumed in the design?
- What was the failure mechanism?
- Serive temperate at time of failure
- Did high/low temp. contribute? Were there temp excursions?
- Did wear contribute?
- Did corrosion contribue? what type of corrosion?
- Was proper material used?
- Was the cross section adequate? were the stresses high?
- Was the quality of the material acceptable in accordance with specifications?
- Was heat treatment properly done?
What should a failure analysis report contain?
A failure analysis report is the summation of all the individual tests and alayses performed during the course of an invesrtigations. Reports should be written with the target audience in mind.
Failure analysis reports contain as minimum:
1) An executive summary (stating the main conclusions)
2) An introduction that included background information on the subject of the report and restates the work requested and may serve as a written record for verbal work requests
3) A section detaling investigative procedures and results
4) Conclusions
What is the root cause method?
The principles of root cause analysis (RCA) may be applied to ensure that the root cause is understood and appropriate corrective actions may be identified.
These principles are rooted in the following method:
1) Identify the issue
- Describe the current situation
- Define the deficiency in terms of the symptoms (or indicators)
- Determine the impact of the dficiency on the component, product, system and customer
- Define a goal
- Collect data to provide a measurement of the deficiency
2) Determine root cause
- Design deficiencies
- Material defects
- Manufacturing/installation defects
- Service life anomalies
3) Reproduce the failure (not always possible but very useful)
4) Develop corrective actions
5) Validate and verify corrective actions
6) Standardize
What is the fish bone analysis?
Every bone is a potential root cause to the problem. Each bone has a list of causes, which may contribute to the problem. Some examples could be:
- Machine
- Method
- Material
- Man/mind power
- Measurement/medium
Discuss about developing, validating and standardizing corrective actions.
Develop corrective actions:
- List possible solutions to mitigate and prevent reccurence of the problem
- Generate alternatives
- Develop implementation plan
Validate and verify corrective actions
- Test corrective actions in pilot study. Measure effectiveness of change. Validate improvements. Verify that the problem is corrected and improves customer satisfaction.
Standardize
- Incorporate the corrective action into the standards documentation of the company, organization or industry to prevent recurrence in similar problems or systems
- Monitor changes to ensure effectiveness.