Chou fasman algorithm for protein structure prediction

33,616 views 21 slides Oct 09, 2014
Slide 1
Slide 1 of 21
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21

About This Presentation

The experimental methods used by biotechnologists to determine the structures of proteins demand sophisticated equipment and time.
A host of computational methods are developed to predict the location of secondary structure elements in proteins for complementing or creating insights into experiment...


Slide Content

Chou- Fasman Algorithm for Protein Structure Prediction By Roshan Karunarathna Gopika Ravichandra

Contents… Importance of the Structures of proteins Prediction of 2D Structures Chou- Fasman Algorithm How it works! Chou-Fasman Algorithm for Protein Prediction 2

What is chou-fasman algorithm? The experimental methods used by biotechnologists to determine the structures of proteins demand sophisticated equipment and time. A host of computational methods are developed to predict the location of secondary structure elements in proteins for complementing or creating insights into experimental results. Chou- Fasman algorithm is an empirical algorithm developed for the prediction of protein secondary structure Chou-Fasman Algorithm for Protein Prediction 3

Before we go….. Structures of proteins…… Why study of structures are important…. What is the need of an algorithm …. Chou-Fasman Algorithm for Protein Prediction 4

Chou-Fasman Algorithm for Protein Prediction 5

Secondary structure prediction In either case, amino acid propensities should be useful for predicting secondary structure Two classical methods that use previously determined propensities: Chou- Fasman Garnier - Osguthorpe -Robson Chou-Fasman Algorithm for Protein Prediction 6

Goal… Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each residue Major classes are a -helices, b -sheets and loops Chou-Fasman Algorithm for Protein Prediction 7

Structural Propensities Due to the size, shape and charge of its side chain, each amino acid may “fit” better in one type of secondary structure than another Classic example: The rigidity and side chain angle of proline cannot be accomodated in an  -helical structure Chou-Fasman Algorithm for Protein Prediction 8

Structural Propensities Two ways to view the significance of this preference (or propensity ) It may control or affect the folding of the protein in its immediate vicinity (amino acid determines structure) It may constitute selective pressure to use particular amino acids in regions that must have a particular structure (structure determines amino acid) Chou-Fasman Algorithm for Protein Prediction 9

Chou-Fasman method Uses table of conformational parameters (propensities) determined primarily from measurements of secondary structure by CD spectroscopy Table consists of one “likelihood” for each structure for each amino acid Chou-Fasman Algorithm for Protein Prediction 10

Chou-Fasman Algorithm for Protein Prediction 11 Chou- Fasman Algorithm • Conformational parameters for every amino acid (AA): P(a) = propensity in an alpha helix P(b) = propensity in a beta sheet P(turn) = propensity in a turn Based on observed propensities in proteins of known structure

Chou- Fasman propensities (partial table) Chou-Fasman Algorithm for Protein Prediction 12

Chou-Fasman method A prediction is made for each type of structure for each amino acid Can result in ambiguity if a region has high propensities for both helix and sheet (higher value usually chosen, with exceptions) Chou-Fasman Algorithm for Protein Prediction 13

Chou-Fasman method Calculation rules are somewhat ad hoc Example: Method for helix Search for nucleating region where 4 out of 6 a.a . have P  > 1.03 Extend until 4 consecutive a.a . have an average P  < 1.00 If region is at least 6 a.a . long, has an average P  > 1.03, and average P  > average P   consider region to be helix Chou-Fasman Algorithm for Protein Prediction 14

Scan the peptide and identify regions where 3 out of 5 contiguous residues have P(β)>100 . These residues nucleate β- strands. Extend these in both directions until a set of four contiguous residues have an average P(β)<100 . This ends β- strand. Chou-Fasman Algorithm for Protein Prediction 15

region containing overlapping α and β Any assignment are taken to be helical or β depending on if the average P(α) and P(β) for that region is largest. If this residues an α or β- region so that it becomes less than 5 residues, the α or β assignment for that region is removed. Chou-Fasman Algorithm for Protein Prediction 16

Chou-Fasman Algorithm for Protein Prediction 17 SPASEA SDGQSVSV P(a) P(b) S: 77 75 P: 55 55 A: 142 83 S: 77 SPAS EASDGQ FETTY P(a) P(b) E: 151 37 A: 142 83 S: 77 75 D: 101 54 G: 57 Q: 111 1) 4 o f 6, P (a) > 100 2) E x t e n d RI G H T un t i l 4 c o n t i g u o u s R e s i d u e s h ave P (a) < 100 3) C a lc u l ate S P (a) a n d S P ( b ). I s S P (a) > S P ( b )? ( D o N o t In cl u d e l a s t 4 i n su m ) Fin d potent i a l alph a heli x : MFCTYYGNNGE H I E LMM MFCTYYGNNGEHIELMM

Accuracy of Chou- Fasman predictions Sequences whose 3D structures are known are processed so that each residue is “assigned” to a given secondary structure class by looking at the backbone angles Three classes most often used ( helix=H, sheet=E, turn=C ) but sometimes use four classes ( helix, sheet, turn, loop ) Chou-Fasman Algorithm for Protein Prediction 18 Conclusion…..

Confusion matrix for Chou- Fasman method on 78 proteins Data from Z-Y Zhu, Protein Engineering 8:103-109, 1995 Average accuracy =54.4

Thank You!