University of Gondar Institute of Biotechnology
Department of Biotechnology
Introductions to Omics Course
By:-Tewodros Eshetie(BSc, in Biotechnology, MSc in Medical biotechnology) [email protected]/[email protected]
1
•Chapter 1: Introduction to Omics Science
2
What is omics ?
•Thecompletesequencingofhumangenomehasusheredinneweraofsystembiology
referredtoasomicstechnology.
•Theterm“omic”derivedfromthelattinsuffix“ome”meaningmassormany.Thus
OMICSinvolveamass(largenumber)ofmeasurementperendpoint.
•Omicsreferstovariousbranchesofsciencedealingwith“Omes”
3
•In biological context, suffix -omics is used to refer a study of large set of
biological molecule
•The broad analysis of large biological molecules was needed to be studied in detail
as the conclusion of human genome project (HGP) in 2001.
•Omesarethecompletecollectionofobjectsorthewholesystemunderstudy
➢Genome
➢Transcriptome
➢Proteome
➢Metabolome
•Omicsreferstothecollectivetechnologiesusedtoexploretheroles,
relationships,andactionsofvarioustypesofmoleculesthatmakeupthecellsof
anorganism
4
cont’d…
•Thetermgenome
✓Istheoldestandbestknownnameoftheomesfamily
✓Wasintroducedin1920bytheGermanbotanistHansWinklerfrom
•Gen(“gene”)+(chromos)om(chromosome)
•The first genome was completely sequenced by sanger in Cambridge, UK, in the
1970s.
•Genome is the most fundamental part of many omics and it is defined as an
organism's genetic material.
6
What is genomics?
•Thebranchofbiologicalsciencesthatdealswiththestructureand
informationencodedinthegenomesoforganism.
•Itsnewandexcitingsciencethatdealswiththediscoveryandnoting
ofallthesequencesintheentiregenomeofaparticularorganism
•Genomicsreallycameintoitsownwiththecompletionof
-humangenomeprojectwiththeaimtosupplyprecisemapping
andthecompletesequenceofhumangenome.
22
Cont’d…
•The topics of genomics range from genome mapping, sequencing and
functional genomic analysis to comparative genomic analysis.
•Itisthefirstoftheomicsbranchestohavebeendefinedintially
mainlyinvolvedinanalysingDNAsequencingprojectsdataaimedto
mapandsequencethesetofentiregenomes
•Thestudyofthestructure,functionandexpressionofallgenesinan
organism.
•genomicsisthereforethestudyofthegeneticmakeupoforganisms
23
•Why should we study genome?
•What is the difference between genome and transcriptome
proteome and metabolome?
•What makes genome different from the others?
26
Why should we study genome?
➢Eachandeveryoneisauniquecreation.howsimilarwearehow
differentweare
➢Todeterminethefunctionofgene
➢TodetermineStructureofgene
➢Tomakecomparision
➢Life’slittlebookofinstructionstodeterminewerewearefrom
27
Cont’d…
•The experimental techniques and tools to study the genome structure
and function include
•Gene mapping
•Sequence technique
•DNA microarrays
•PCR
•Probe
•Hybridization e.t.c
28
Cont’d…
➢Physicalmapping-usingallavailabletechniquesorinformationtodeterminethe
absolutepositionofgeneonchromosome.
•Genetic distance is measured by frequency of crossing over between loci on the
same chromosome
•One map unit= one centimorgan(CM)=1% recombination between them
31
❑Uses of Gene Mapping
•Identify genes responsible for diseases.
✓Heritable diseases
✓Cancer
•Identify genes responsible for traits.
✓Plants or Animals
✓Disease resistance
✓Meat or Milk Production
32
Cont’d…
❖History;
➢Several notable advancements in DNA sequencing were made during the 1970s.
➢Frederick Sanger developed rapid DNA sequencing methods at Cambridge, UK
and published a method for "DNA sequencing with chain terminating inhibitors" in
1977.
➢Walter Gilbert and Allan Maxam at Harvard also developed sequencing
methods, including one for "DNA sequencing by chemical degradation"
➢The first full DNA genome to be sequenced was that of bacteriophage φX174 in
1977.
35
Cont’d…
➢Several new methods for DNA sequencing were developed in the mid to late 1990s.
➢These techniques comprise the first of the "next-generation" sequencing methods.
➢In 1996, PålNyrénand his student Mostafa Ronaghi at the Royal Institute of Technology
in Stockholm published their method of pyrosequencing.
➢In 2000 Lynx Therapeutics published and marketed "Massively parallel signature
sequencing", or MPSS.
➢This method incorporated a parallelized, adapter/ligation mediated, bead-based sequencing
technology and served as the first commercially available "next-generation“ sequencing
method.
37
Why we need to sequence genome?
❑Whysequenceagenome?
•Tohavecompletegeneticblueprintoftheorganism
•Becausethesequenceofnucleotidesinthegenomeareimportant
informationtodetermine;
-Howdifferentregionsofthegenomecontrolthedevelopmentand
functionofanorganism.
•Toidentifydifferenceamongindividual
38
❑Use of sequencing
➢DNA sequencing used to determine the sequence of individual genes, larger genetic
regions, full chromosomes or entire genomes.
➢provide the order of nucleotides in DNA or RNA isolated from cells of animals,
plants, bacteria or virtually any other source of genetic information.
➢The resulting sequences may be used by researchers in molecular biology or genetics
to further scientific progress or may be used by medical personnel to make treatment
decisions or aid in genetic counseling.
39
2.Types of genomics
➢Genomediffersignificantlybetweenspeciesinsizetopologytypes
andfunction.
❖Genomicsisclassifiedas;
▪Functionalgenomics
▪Comparativegenomics
▪Structuralgenomics
40
microarray-based approaches
•Themostcommonlyusedglobalgeneexpressionprofilingmethodincurrent
genomicsresearchistheDNAmicroarray-basedapproach
•Amicroarray(orgenechip)isaslideattachedwithahigh-densityarrayof
immobilizedDNAoligomers(sometimescDNAs)representingtheentiregenome
ofthespeciesunderstudy
•Each oligomer is spotted on the slide and serves as a probe for binding to a
unique, complementary cDNA.
44
Cont’d…
•TheentirecDNApopulation,labeledwithfluorescentdyesorradioisotopes,is
allowedtohybridizewiththeoligoprobesonthechip
•Theamountoffluorescentorradiolabelsateachspotpositionreflectstheamount
ofcorrespondingmRNAinthecell.
•Using this analysis, patterns of global gene expression in a cell can be examined
•Sets of genes involved in the same regulatory or metabolic pathways can
potentially be identified.
45
General genomic comparisons
Organism Genome Size (Bases) Estimated Genes
Human (Homo sapiens) 3 billion 30,000
Laboratory mouse
(M. musculus)
2.6 billion 30,000
Thale cress (A. thaliana) 100 million 25,000
Roundworm (C. elegans) 97 million 19,000
Fruit fly (D. melanogaster) 137 million 13,000
Yeast (S. cerevisiae) 12.1 million 6,000
Bacterium (E. coli) 4.6 million 3,200
Human immunodeficiency virus
(HIV)
9700 9
49
3.2.Eukaryote's genome organization
➢Multicellularcell
➢TheDNAineukaryotesarelargerinsizeandpresentinnucleus,comparedwith
prokaryoticgenome
➢InadditiontoDNAcontainedinthenucleusandribosomes,eukaryoteshaveasmall
mitochondrialandchloroplastDNA
-because eukaryotes have mitochondria (but prokaryotes do not have)
➢Containmultiplelinearchromosomecalled,Diploidorganism
➢DNAisassociatedwithproteinhasnooperon
64
Cont’d…
•The Eukaryotes genome organization classified into 4 stages;
1.Nucleosome formation
➢eukaryotic DNA combines with basic protein molecule called histone ,to form a structure
known as nucleosome.
➢Its wrapped structure of 146bp of DNA
➢This structure contain H2A,H2B,H3 and H4 types of histone protein and also…
➢H1 is called linker histone protein
65
3.3.Difference between eukaryotic and prokaryotic genome
Prokaryotic genome Eukaryotic genome
They are small in size They are larger in size
Contain single circular chromosomeContain multiple linear chromosome
They are haploid They are diploid
DNA is not associated histone proteinDNA associated with histone protein
They are condensed in nucleoid of
cytoplasm
They are condensed in membrane bound
nucleus
More than 90% coding sequence Only 3% coding sequence
71
Cont’d…
•Genes that are on the same chromosome are said to be ‘linked’ and the
distance between these genes is called a ‘linkage distance’.
•The smaller the distance the more likely two genes will be inherited together.
•The same concept of studying how traitsare passed on was applied to develop
the first human genome map.
78
B. Fluorescent in situ hybridization(FISH)
➢ThisusesfluorescentprobestodetectthelocationofDNAsequencesonchromosomes.
➢First,theprobesareprepared.Theseareshortsequencesofsingle-strandedDNA,that
matchtheDNAsequencethatthescientistwantstofind.
➢Theprobesarethenlabelledwithfluorescentdyebeforebeingmixedwiththe
chromosomeDNA,sothatitcanbindtoacomplementarystrandofDNAonthe
chromosome.
84
Cont’d…
➢ThefluorescenttagallowsthescientisttoseethelocationoftheDNA
sequenceonthechromosome.
➢Thesiteofhybridizationisvisualizedunderafluorescencemicroscope.The
siteofthegeneappearsasabrightspot.
➢This technique provides the most direct method of visualizing genes on
chromosomes.
85
1.1.Goals Of HGP
➢To identify all the genes in human DNA.
➢To determine the sequences of the 3.2 billion base pairs.
➢To store the information in databases;
➢To develop a genetic linkage map of human genome.
➢To obtain a physical map of human genome.
➢To develop technology for the management of human genome information.
➢To know the function of genes.
104
Interesting fact or salient feature of human genome
➢The human genome is by far the most complex and largest genome.
➢The human genome contains over 3 billion nucleotide pairs and is
estimated to have about 30,000 genes .
➢Average gene consists of 3000 bases.
-But sizes of genes vary greatly, with the largest known human gene
encoding dystrophincontaining 2.5 million base pairs.
106
1.2.Issues Of Concern Ethical, Legal and Social issues of the Human
Genome Project
➢Fairnessintheuseofgeneticinformation.
➢Privacyandconfidentialityofgeneticinformation.
➢Psychologicalimpact,stigmatization,anddiscrimination
➢Reproductiveissues.
➢Clinicalissues.
➢Uncertaintiesassociatedwithgenetestsforsusceptibilitiesandcomplexconditions
109
1.3.Application Of HGP
➢Thesequencingofthehumangenomeholdsbenefitsformanyfields,frommolecular
medicinetohumanevolution.
➢Helpsinidentifyingdiseasecausinggene.
➢Identificationofmutationslinkedtodifferentformsofcancer.
➢ThesequenceoftheDNAisstoredindatabasesavailabletoanyoneontheInternet.
➢Benefittedtheadvancementofforensicscience.
110
1.4.Pros and cons of the Project
❖Pros
➢Successfully identifies where the genes of DNA are located in the body
➢Genetically modify foods
➢Make crops grow faster and more resistant to pesticides
➢Mapping can locate cancer cells and mental illnesses
❖Cons
➢15 years
➢$3-9 billion
➢Requires skill
➢Process is very difficult with lots of procedures
111
Cont’d…
➢The Plant Genome Program is one program with two parts: National
Research Initiative (NRI) and Plant Genome Database (PGD).
➢The PGD is now a real and functioning information and data resource for
agricultural and other plant science genome researchers, and it is in the
public domain.
➢In crop plants, the genome mapping and sequencing has been completed in
two plant species; Arabidopsis and Rice.
115
Cont’d…
➢Arabidopsis thaliana was the first plant to be completely sequenced.
➢It is an important model plant for identifying genes and determining their
function of other plant species because Arabidopsis
-found in all over the world
-easy to grow
-basic similarity with the other crop
-simple genetic makeup
116
1.Transcriptomics
➢Thetranscriptomeisthecompletesetoftranscriptsinacellandtheirquantity,foraspecific
developmentalstageorphysiologicalcondition.Or
➢ThetranscriptomeisthesetofallRNAmolecules,including
▪mRNA,
▪rRNA,
▪tRNA, and other non-coding RNA produced in one or a population of cells.
➢Itisacollectionofallthegenetranscriptspresentinagivencell.
122
Why the study of RNA is so important??
❖RNAprofilingprovidecluesaboutthe
✓Functionaldifferencesbetweentissuesandcelltype
✓Functionandinteractionbetweengenes
✓Generegulationandregulatorysequences
✓Identificationofcandidategenesforanyprocessordisease
✓Expressedsequencesorgenesofgenome
125
How much RNA molecules are found in the cell/organism?
➢Atypicalbacteriumcontains0.05-0.10pgofRNAmakingupabout6ofitstotalweight
➢AmammaliancellbeingmuchlargercontainmoreRNA20-30pginallbutthis
representsonly1%ofthecellasawhole
➢ThereforebestwaytounderstandthismuchofRNAcontentofacellistodivideitinto
categoriesandsubcategoriesdependingonfunction
➢AccordinglytheprimarydivisioniscodingRNAandnoncodingRNA
126
Cont’d…
•The most well –studied out come is post-transcriptional gene silencing,
-which occur when the guide strand pairs with a complementary sequence in a m
RNA molecule and induces cleavage
136
Cont’d…
137
Cont’d…
138
3.Function of RNA
3.1. RNA as Genetic information storage/transfer;
➢AlthoughRNAdoesnotserveasthehereditaryinformationinmostcells,RNAdoes
holdthisfunctionformanyvirusesthatdonotcontainDNA
➢Thus,RNAclearlydoeshavetheadditionalcapacitytoserveasgeneticinformation.
3.2.RNAasstructuralcomponentofcell
➢CompositionofbothprokaryotesandeukaryotesribosomescontainrRNA
asstructuralcomponent.
139
4.Methods for transcriptome or Gene expression analysis
•What is gene expression?
•Gene expression is the process by which the instructions in our DNA are
converted into a functional product, such as a protein.
•When the information stored in our DNA is converted into instructions for
making proteinsor other molecules, it is called gene expression.
•Gene expression is a tightly regulated process that allows a cell to respond
to its changing environment.
141
Cont’d…
➢It acts as both an on/off switch to control when proteins are made and also a
volume control that increases or decreases the amount of proteins made.
➢There are two key steps involved in making a protein, transcription and
translation.
❖Transcription
➢Transcription is when the DNA in a gene is copied to produce an RNA transcript
called messenger RNA(mRNA).
➢This is carried out by an enzyme called RNA polymerase which uses available
bases from the nucleus of the cell to form the mRNA.
142
Cont’d…
❖Translation
➢Translation occurs after the messenger RNA (mRNA) has carried the
transcribed ‘message’ from the DNA to protein-making factories in the cell,
called ribosomes.
➢The message carried by the mRNA is readby a carrier molecule called
transfer RNA (tRNA).
➢The mRNA is read three letters (a codon) at a time.
144
Cont’d…
➢Each codon specifies a particular amino acid. For example, the three bases ‘GGU’ code
for an amino acid called glycine.
➢As there are only 20 amino acids but 64 potential combinations of codon, more than one
codon can code for the same amino acid. For example, the codons ‘GGU’ and ‘GGC’
both code for glycine.
➢Each amino acid is attached specifically to its own tRNA molecule.
➢When the mRNA sequence is read, each tRNA molecule delivers its amino acid to the
ribosome and binds temporarily to the corresponding codon on the mRNA molecule.
145
Cont’d…
➢Once the tRNA is bound, it releases its amino acid and the adjacent amino acids all join
together into a long chain called a polypeptide.
➢This process continues until a protein is formed.
➢Proteins carry out most of the active functions of a cell.
146
Methods/Approach for transcriptome or Gene expression analysis
✓ExpressionSequenceTag(EST)
✓Serial/CapAnalysisofGeneExpression(SAGE/CAGE)
✓Microarray
✓RNA-Seq(RNASequancing)
147
Cont’d…
➢Expressed Sequence Tags (ESTs)
➢ESTs are short (200–500 nucleotides) DNA sequences that can be used to identify a gene
that is being expressed in a cell at a particular time.
❖The Procedure:
➢Isolate the messenger RNA (mRNA) from a particular tissue (e.g., liver)
➢Treat it with reverse transcriptase. Reverse transcriptase is a DNA polymerase that uses
RNA as its template. Thus it is able to make genetic information flow in the reverse (RNA -
>DNA) of its normal direction (DNA -> RNA).
➢This produces complementary DNA (cDNA). Note that cDNA differs from the normal gene
in lacking the intron sequences.
➢Sequence 200–500 nucleotides at both the 5′ and 3′ ends of each cDNA.
➢Examine the database of the organism's genome to find a matching sequence.
149
Steps involved in DNA microarray:
•The are four main steps involved in DNA microarray
1.Sample collection and Isolation of mRNA
2.Creation of labeled cDNA
3.Hybridization and washing
4.analysis
159
Cont’d…
1.Sample collection and isolating mRNA
•Asamplecanbeanycell/tissuethatwedesiretoconductourstudyon.
•Generally,2typesofsamplesarecollected,i.e.healthyandinfectedcells,for
comparingandobtainingtheresults.
•The extraction of RNA from a sample is performed by using a column or solvent like
phenol-chloroform.
•mRNA is isolated from the extracted RNA leaving behind rRNA and tRNA.
160
Cont’d…
➢As mRNA has a poly-A tail, column beads with poly-T tails are employed to bind mRNA.
➢Following the extraction, buffer is used to rinse the column in order to isolate mRNA from
the beads.
2.CreationoflabeledcDNA
➢Reverse transcription of mRNA yields cDNA.
➢Both the samples are then integrated with different fluorescent dyes for the production of
fluorescent cDNA strands which allows to differentiate the sample category of the cDNAs.
161
3.Hybridization and washing
•ThelabeledcDNAsfromboththesamplesareplacedontheDNA
microarraywhichpermitsthehybridizationofeachcDNAtoits
complementarystrand.
•Thentheyarethoroughlywashedtoremoveunpairedsequences.
162
❖RNA isolation:
➢The first step in the RNA sequencing is the isolation of total RNA, mRNA
or ncRNA for the experiment.
➢Isolating RNA is a tedious process in comparison with DNA isolation. The
RNA can be breakdown easily. Also, the chance of contamination is high in
RNA isolation.
➢Thus extra care must be taken while isolating RNA and expert hand in
required to do so.
➢High yield pure RNA is must require for RNA sequencing.
168
Cont’d…
➢Note: for isolating mRNA from the total RNA pool, one additional step is
required, using the oligo dT specific column in the purification or final
step of elution, total mRNA can be isolated from the rest of RNAs.
❖Reverse transcriptase PCR:
➢Another innovative set up for RNA sequencing is to do reverse transcriptase PCR in
which the RNA is reverse transcribed into DNA.
➢Using a special type of polymerase known as DNA reverse transcriptase, the cDNA is
synthesized from the mRNA.
169
Cont’d…
•In the next step, the sample of fragmented DNA is sent for sequencing.
•The fragmentation method is a key aspect of sequencing library
construction.
•Fragmentation may be achieved by chemical hydrolysis, nebulization,
sonication….
173
❖DNA sequencing:
➢Now the sample is sequenced in the high throughput NGS machine which
reads the sequence as well as quantifies the nucleic acid as well.
➢The chemistry behind sequencing the cDNA is depends on which platforms
we are using, although, the widely used method is the use of fluorescence
chemistry
➢In the sequencer, individual fragments are “read” separately and sequenced.
➢The final data are sent to the bioinformatics lab for post-sequencing analysis.
174
Types of RNA-seq technologies:
➢Roche454-thefirstNGSTechnologyforRNASeqwasintroducedbyRochein2004,
179
Cont’d…
➢TheIlluminaplatform-introducedin2006,providinghighlyaccuratesequencingwitha
lowerrorrate,evenwithinrepetitivesequenceregions.
➢SOLID-Sequencing by oligo ligation and detection (SOLID) technology, released in 2007,
uses a sequencing chemistry catalyzed by DNA ligase and produces an average read length
of 75bp.
180
What is proteome?
➢Proteomeistheentiresetofproteinsexpressedbyagenome,cell,tissueororganism.
➢Morespecifically,itisthesetofexpressedproteinsinagiventypeofcellsoranorganism
atagiventimeunderdefinedconditions.
➢Thetermisablendofproteinsandgenome
➢Differenttypesofcellsmakedifferentproteins,
-sotheproteomeofonecellwillbedifferentfromtheproteomeofanother
185
Cont’d..
➢The proteome is extremely dynamic, and minor alterations in the external or internal
environment can modify proteome function
➢By comparing protein sequences and structures, scientists can classify many proteins in an
organism’s proteome and deduce their functions by homology with proteins of known
function.
➢In the past protein determination was done by mRNA analysis, but this was found not to
correlate with protein content.
186
Cont’d…
➢Some times there is lack of correlation between mRNA and protein abundances. Why ???
➢One of the reasons for this is the existence of control mechanisms at the ribosomes, where
mRNA is translated to peptides.
➢Translational control allows the cell to select only certain mRNAs for translation and block
others.
➢The selection is often dependent on environmental conditions, so this mechanism allows
for physiological adaptation on the level of the proteome, even though the transcriptome
remains the same
187
Cont’d…
➢It is now known that mRNA is not always translated into protein, and the amount of
protein produced for a given amount of mRNA depends on the gene it is transcribed
from and on the current physiological state of the cell.
➢Another issue is post-translational modification or protein processing, processes that
can greatly affect the function of a protein.
➢Not only does the translation from mRNA cause differences, many proteins are also
subjected to a wide variety of chemical modifications after translation.
➢Many of these post-translational modifications are critical to the protein's function
188
❖What is protein ??
❖Discuss different types of protein based on their function and structure???
191
2.Protein
•Proteins are nitrogenous organic compounds of high molecular weight which
play a vital or prime role in living organisms. They are made up of a-amino
acids.
❖Amino Acids
•Amino acids are the monomers that make up proteins.
•Each amino acid has the same fundamental structure, which consists of a
central carbon atom, also known as the alpha (α) carbon, bonded to an amino
group (NH2), a carboxyl group (COOH), and to a hydrogen atom
192
➢The sequence and the number of amino acids ultimately determine the protein’s
shape, size, and function.
➢Each amino acid is attached to another amino acid by a covalent bond, known as a
peptide bond.
➢When two amino acids are covalently attached by a peptide bond, the carboxyl
group of one amino acid and the amino group of the incoming amino acid combine
and release a molecule of water.
➢Any reaction that combines two monomers in a reaction that generates H2O as one
of the products is known as a dehydration reaction.
195
Cont’d…
196
2.1.Types of protein
❖Protein can be grouped into several broad functional classes:
➢Structural proteins, which provide structural rigidity to the cell;
➢Transport proteins, which control the flow of materials across cellular
membranes;
➢Regulatory proteins, which act as sensors and switches to control protein
activity and gene function
➢Signaling proteins, including cell surface receptors and other proteins that
transmit external signals to the cell interior; and
➢Motor proteins, which cause motion.
197
Cont’d
198
2.2.Function of Protein
➢Proteins perform essential functions throughout the systems of the human body.
These long chains of amino acids are critically important for:
✓catalyzing chemical reactions
✓synthesizing and repairing DNA
✓transporting materials across the cell
✓receiving and sending chemical signals
✓responding to stimuli
✓providing structural support
199
Enzymes
➢Enzymes are proteins that catalyze biochemical reactions, which otherwise would not take
place.
➢These enzymes are essential for chemical processes like digestion and cellular metabolism.
Without enzymes, most physiological processes would proceed so slowly (or not at all) that
life could not exist.
➢Because form determines function, each enzyme is specific to its substrates.
➢The substrates are the reactants that undergo the chemical reaction catalyzed by the enzyme
200
Cont’d…
➢The location where substrates bind to or interact with the enzyme is known as
the active site, because that is the site where the chemistry occurs
➢There are two basic classes of enzymes:
➢Catabolic enzymes: enzymes that break down their substrate
➢Anabolic enzymes: enzymes that build more complex molecules from their
substrates
201
Cont’d…
➢Enzymes are also essential for biosynthesis: the process of making new, complex
molecules from the smaller subunits that are provided to or generated by the cell.
➢These biosynthetic enzymes include DNA Polymerase, which catalyzes the
synthesis of new strands of the genetic material before cell division;
➢fatty acid synthetase, which the synthesis of new fatty acids for fat or membrane
lipid formation;
➢and components of the ribosome, which catalyzes the formation of new
polypeptides from amino acid monomers.
203
2.3. Protein Structure
➢Proteins are the polymers of amino acids.
➢Individual amino acids (residues) are joined by peptide bonds to form the linear polypeptide
chain.
➢This linear polypeptide chain is folded into specific structural conformations or simply
‘structure’.
➢A protein can have up to four levels of structural conformations.
206
Cont’d…
A protein can have Four levels of structural organization:
•They are designated as:
1.Primary Structure
2.Secondary Structure
3.Tertiary Structure
4.Quaternary Structure
207
❖3.1 Molecular techniques
•I.DNA Microarrays or Gene Chips
•II. Northern/Southern Blotting
•III. Serial Analysis of Gene Expression (SAGE)
210
3.2.Separation techniques
❖Gel Electrophoresis
✓SDS-PAGE
✓2D Gel Electrophoresis
➢Gel electrophoresis: it is a technique used for the separation of DNA, RNA,
and protein molecules according to their size and electrical current using
electric current on a gel.
➢Therefore, gel electrophoresis is used for the separation of proteins on the
basis of their net negative charge and their size.
211
3.2.2.SDS-PAGE
➢SDS-PAGE(sodium dodecylsulphate-polyacrylamide gel electrophoresis)
➢The purpose of this method is to separate proteins according to their size,
and no other physical feature.
➢In order to understand how this works, we have to understand the two halves
of the name: SDSand PAGE
213
3.2.3 2D Gel electrophoresis
➢Twodimensionalgelelectrophoresisis(2DE)isaclassicalmethodtoseparateprotein
ontheirbaseofcharge(isoelectricfocusing,IEF)andoftheirsize(SDS-PAGE).
218
Cont’d…
219
Cont’d…
220
Cont’d…
221
3.3. Chromatography
➢Theseparationofcomponentsinamixturethatinvolvespassingthemixturedissolvedina
"mobilephase"throughastationaryphase,
-whichseparatestheanalytetobemeasuredfromothermoleculesinthemixture
basedondifferentialpartitioningbetweenthemobileandstationaryphases.Itincludes…
➢High performance liquid chromatography
➢Size, exclusion chromatography-separates proteins based on their molecular size
➢Ion exchange chromatography-separates proteins based on charge
➢Reverse phase chromatography
222
cont’d…
➢affinity chromatography employs reversible interactions between specific
affinity ligands and their target proteins
➢These methods can be used to purify and identify proteins of interest, as
well as to prepare proteins for further analysis
223
3.4. Protein Microarray(protein chip)
➢They are modeled after DNA microarrays, in 2000 at Harvard University
➢The success of DNA microarrays in large-scale genomic experiments
inspired researchers to develop similar technology to enable large-scale,
high-throughput proteomic experiments.
➢Protein chips enable researchers to quickly and easily survey the entire
proteome of a cell within an organism.
224
Types of protein Microarrays
225
3.4.1. Analytical microarrays
•Different types of ligands, including antibodies, antigens, DNA or RNA aptamers,
carbohydrates or small molecules, with high affinity and specificity, are spotted down onto a
derivatized surface.
•Protein samples from two biological states to be compared are separately labelled with red
or green fluorescent dyes, mixed, and incubated with the chips. Spots in red or green color
identify an excess of proteins from one state over the other.
•These types of microarrays can be used to monitor differential expression profiles and for
clinical diagnostics.
•Examples include profiling responses to environmental stress and healthy versus disease
tissues
226
3.4.2. Functional protein microarrays (target protein arrays)
➢Immobilisedpurified proteins are used to:
-identify protein-protein/DNA/RNA
-assay enzymatic activity.
➢They differ from analytical arrays in that they contain full-length functional
proteins.
228
Cont’d…
➢Functional protein microarrays differ from analytical arrays in that
functional protein arrays are composed of arrays containing full-length
functional proteins or protein domains.
➢These protein chips are used to study the biochemical activities of an
entire proteome in a single experiment.
➢They are used to study numerous protein interactions, such as protein-
protein, protein-DNA, protein-RNA, protein-phospholipid, and protein-
small molecule interactions
229
3.5.Protein Identification techniques
I. Edman sequencing
➢Edman Degradation is a method of sequencing amino acids in a peptide. Here,
the amino-terminal residue is labeled and cleaved from the peptide without
disrupting the peptide bonds between other amino acid residues
II. Mass spectroscopy
➢is an analytical technique that measures the mass-to-charge ratio of charged particles
for determining masses of particles and the elemental composition of a sample of
molecules
232
3.6.Protein Structure techniques
I.NMR(Nuclear magnetic resonance spectroscopy)
II.X-ray crystallography
233
NMR(Nuclear magnetic resonance spectroscopy)
•It is based on the measurement of absorption of electromagnetic radiation in the
form of radio-frequency
•It is a technique which exploits the magnetic properties of certain nuclei to study
physical, chemical, and biological properties of matter
•Compared to MS, larger amounts of sample are needed, but non destructive
•Protein folding studies can be done by monitoring NMR spectra
•NMR is very efficient in mapping interactions with other molecules
•The upper weight limit for NMR structure determination is -30kDa
234
X-ray crystallography
•is a technique for examining the structure of a regular crystal.
•It only works on substances that can be crystallized.
•An x-ray source is aimed at the crystal, and the diffraction pattern that is created
as the x-rays strike the crystal are studied.
•Above 30kDa, X-ray is the only technique to solve the structure of proteins
Limitations
•Crystallization required, potential crystal packing influence the structure
•Can not be used with very flexible molecules
235
3.7.Types of Proteomics
1. Expression proteomics
➢Analysis of protein expression at larger scale
➢Quantitative determination protein expression in different cell, tissue and organ
➢Helps to identify target proteins in a particular sample, and those proteins
differentially expressed in related samples such as diseased vs. healthy tissue.
➢If a protein is found only in a diseased sample then it can be useful for drug
target or diagnostic marker
➢Proteins with the same or similar expression profiles may also be functionally
related.
236
2. Structural proteomics
–Analysis of protein structures at large-scale
–It compares protein structures and helps to identify functions of newly
discovered genes
–Helps to understand where drugs bind to proteins and also show where
proteins interact with each other
–This understanding is achieved using different technologies such as X-ray
crystallography and NMR spectroscopy.
237
1.Introduction: what is metabolomics?
➢Metabolomicsisanewlyemergingfieldofresearchconcernedwiththehigh-throughput
identificationandquantificationofthesmallmoleculemetabolitesinthemetabolome.
OR
➢Metabolomicsisthe"systematicstudyoftheuniquechemicalfingerprintsthat
specificcellularprocessesleavebehind"
-specifically,thestudyoftheirsmall-moleculemetaboliteprofiles.
241
Why is Metabolomics Important?
1.Because it deals with Small Molecules
➢>95%ofalldiagnosticclinicalassaystestforsmallmolecules
➢89%ofallknowndrugsaresmallmolecules
➢50%ofalldrugsarederivedfrompre-existingmetabolites
➢30%ofidentifiedgeneticdisordersinvolvediseasesofsmallmoleculemetabolism
➢Smallmoleculesserveascofactorsandsignalingmoleculesto1000’sofproteins
244
Con…
❖Components of HPLC;
➢Column-
✓stainless still can with stand pressure upto50MP
✓The length of the column 5-25cm
✓4-5mm in diameter
✓Flow rate 1-3ml/m
➢Stationary phase –
✓adsorbentmaterialverysmallparticlesizewhichischemicallymodifiedsilica,
divinylbenzeneetc.
254
Con…
➢Mobile phase –
✓a mixture of solvent can be used as mobile phase
✓Mobile phase solvent can be polar non polar depends on the sample
➢Detector
✓attachedtotheendofthecolumn
✓Detectinformationfromthestationaryphaseanddelivertocomputer
through
✓DifferenttypesofdetectorUVdetector,IRdetector…
255