Homology Modeling of Proteins
By:-Dr. Mohan Kumar
Assistant Professor
Gyan Jyoti College of Pharmacy,
Hazaribag, jharkhand
Homology Modeling
Homology modeling, also known as
comparative modeling.
It refers to constructing an atomic resolution
model of the "target"proteinfrom its amino
acid sequence and an experimental three-
dimensional structure of relatedhomologous
protein("template").
Homology modelling
Homologymodellingisanimportantcomputational
technique,withinstructuralbiology,todeterminethe3D
structureofproteins.Itusesavailablehigh-resolution
proteinstructurestoproduceamodelofaproteinof
similar,butunknown,structure.Herewewilldiscussthe
essentialstepsintheprocessandthecircumstancesin
whichhomologymodellingislikelytoleadtoauseful
result.
Homology modelling plays a valuable role in drug design
Computational methods for Protein
Structure Prediction
Homology or Comparative Modeling
Fold Recognition or threading Methods
Ab initio methods that utilize knowledge-based information
Ab initio methods without the help of knowledge-based information
Why do we need computational approaches?
The goal of research in the area of structural genomics is to provide the means to
characterize and identify the large number of protein sequences that are being
discovered
Knowledge of the three-dimensional structure
helps in the rational design of site-directed mutations
can be of great importance for the design of drugs
greatly enhances our understanding of how proteins function and how they interact with each
other ,for example, explain antigenic behaviour, DNA binding specificity, etc
Structural information from x-ray crystallographic or NMR results
obtained much more slowly.
techniques involve elaborate technical procedures
many proteins fail to crystallize at all and/or cannot be obtained or dissolved in large enough
quantities for NMR measurements
The size of the protein is also a limiting factor for NMR
With a better computational method this can be done extremely fast.
Why is there this great effort to solve protein
structures?
Becausetheycarrylargeamountsofinformation,and
influencedrugdiscovery,asoneexampleofan
application,ateverystageinthedesignprocess.
HIV/AIDSdrugssuchasAgeneraseandViraceptwere
developedusingthecrystalstructureofHIVprotease.
Thus,nomatterhowobtained(experimentally,
computationallyorusingbothapproaches),3Dprotein
structureisofundeniableimportance.
Steps in homology modelling
Homology modelling seeks to predict the 3D structure of a protein
based on its sequence similarity to one or more proteins of known
structure.
The method relies on the observation that the structural
conformation of a protein is more highly conserved than its amino
acid sequence.
Homology modelling can be divided into four steps:-
template identification,
alignment,
model building and
refinement, and
validation
Steps in homology modelling
Template identification
Template identification is the critical first step.
It lays the foundation by identifying appropriate homologue(s) of
known protein structure, called template(s), which are sufficiently
similar to the target sequence to be modelled.
A simple search submits the target sequence to programs such as
BLASTor FASTA,However, these programs work well only for
alignment of sequences with high similarities. Methods such as PSI-
BLASTand ScanPShave recently increased the possibility of
detecting distant homologues.
The ideal is to identify the template(s) which has the highest
percentage identity to the target, has the highest resolution, and has
structures with (or without) appropriate ligands and/or cofactors.
Model building and refinement
Although the theory behind building a protein homology model is
complicated, using available programs is relatively easy.
Several modelling programs are available, using different methods
to construct the 3D structures.
In segment matching methods, the target is divided into short
segments, and alignment is done over segments rather than over
the entire protein.
The method is implemented using the popular program,
Modeller,which includes the CHARMMenergy terms that ensure
valid stereochemistry is combined with spatial restraints.
There are several stand-alone modelling programs available such
as WHAT IF.
Web servers such as SwissModel and the Rosetta server make it
even easier to generate a model.
Validation
After being built, the model needs to be validated.
One of the most thorough structure checking programs is
Whatcheck.
Other programs such as Procheck, Vadar server.
The best validation combines common sense, biological
knowledge and results from analytical tools. Some
models will need further refinement. There is a cycle
between building-validating-refining. Most refinement
involves adjusting the alignment.
Advantages and limitations of homology modelling
Homology modelling is a relatively easy technique.
It takes much less time to learn, to do the calculations and obtain a result, than an
experiment.
It does not require expensive experimental facilities, just a standard desktop
computer.
In the absence of high-resolution experimental structures, therefore, homology
modelling can be of much value.
However, the quality and accuracy of the homology model depend on several factors.
The technique requires a high-resolution experimental protein structure as a
template, the accuracy of which directly affects the quality of the model. Even more
importantly, the quality of the model depends on the degree of sequence identity
between the template and protein to be modelled.
Alignment errors increase rapidly when the sequence identity is less than 30%.
Medium accuracy homology models have between about 30% and 50% sequence
identity to the template.
Advantages and limitations of homology modelling
They can facilitate structure-based prediction of target for 'drugability', the design of
mutagenesis experiments and the construction ofin vitrotest assays.
Higher accuracy models are typically obtained when there is more than 50%
sequence identity.
They can be used in the estimation of protein-ligand interactions, such as the
prediction of the preferred sites of metabolism of small molecules, as well as
structure-based drug design.
Homology modelling of membrane proteins requires particular care.
The available crystal structures are limited, and modelling methods are mainly
designed for water-soluble proteins.
Comparing results from different methods is one approach.
Another limitation of homology modelling is the presence of loops and inserts, as they
cannot be modelled without template data;
however, one can still estimate length, location, and distance from the active site if
the protein is an enzyme.
Conclusion
We have given an overview of what homology modelling is all
about:-
procedure,
applications,
advantages and limitations.
Homology modelling is entirely a computational process and much
easier to implement than the experimental path to structural
information about a protein, although it relies on suitable
experimental structures being already known.
Applications of homology modelling can range from design of the
next experiment in an ongoing biochemical investigation, to the
discovery of drugs with important disease control properties.
On the other hand, in circumstances where the homology model
may be of only limited accuracy, the results may require
experimental verification.
Challenges
To model proteins with lower similarities( eg < 30% sequence
identity)
To increase accuracy of models and to make it fully automated
Improvements may include simulataneous optimization
techniques in side chain modeling and loop modeling
Developing better optimizers and potential function, which can
lead the model structure away from template towards the correct
structure
Although comparative modelling needs significant improvement,
it is already a mature technique that can beused to address
many practical problems
Automated Web-Based Homology Modeling
SWISS Model :http://www.expasy.org/swissmod/SWISS-
MODEL.html
WHAT IF :http://www.cmbi.kun.nl/swift/servers/
The CPHModels Server :
http://www.cbs.dtu.dk/services/CPHmodels/
3D Jigsaw: http://www.bmm.icnet.uk/~3djigsaw/
SDSC1 :http://cl.sdsc.edu/hm.html
EsyPred3D :http://www.fundp.ac.be/urbm/bioinfo/esypred/
Comparative Modeling Server & Program
COMPOSER
http://www.tripos.com/sciTech/inSilicoDisc/bioInformatics/matchmak
er.html
MODELERhttp://salilab.org/modeler
InsightIIhttp://www.msi.com/
SYBYL http://www.tripos.com/
References
Protein homology modelling and its use in South Africa,S. Afr.
j. sci.vol.104n.1-2PretoriaJan./Feb.2008
Insight II manual
(http://www.csc.fi/chem/progs/insightII.phtml.en#manual)
Structural Bioinformatics, Philip E Bourne, Helge Weissig
Bioinformatics Sequence and Genome Analysis, David W Mount
http://ncisgi.ncifcrf.gov/~ravichas/HomMod/
http://www.biochem.vt.edu/modeling/homology.html
http://www.cmbi.kun.nl/gv/articles/text/gambling0.html
Advances in comparative protein-structure modelling,Roberto
Sa´nchez and Andrej Sali