Human_genome_project---------------.pptx

sunifeb12 15 views 29 slides Jul 18, 2024
Slide 1
Slide 1 of 29
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29

About This Presentation

Human Genome Project


Slide Content

Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.

HGP: Background International Human Genome Sequencing Consortium: Proposed 1985, endorsed in 1988. 20 governmental groups. “Public project.” Craig Venter & Celera Genomics: Founded 1998. Sequence in 3 years. Technology: automation, computers. Had access to public project’s data. Race ends in tie Feb. 2001: both publish in Science and Nature.

International Human Genome Sequencing Consortium Collaboration would be open to centers from any nation 20 centers from 6 countries - US, UK, China, France, Germany, Japan Rapid and unrestricted data release - Assembled sequences >2 kb were deposited within 24 hours of assembly

Draft Sequences, 2001 International Human Genome Sequencing Consortium (‘public project’) Initial Sequencing and Analysis of the Human Genome. Nature 409:860-921, 2001 Celera Genomics – Venter JC et al. (‘private project’) The Sequence of the Human Genome. Science 291:1304-1351, 2001.

International Human Genome Sequencing Consortium Approach was conservative and methodical. Had to wait for technology. First produced a clone-based physical map of the genome that would serve as a scaffold for the later sequence data: Broke genome into chunks of DNA whose position on chromosome was known from maps, clone into bacteria using BACs. Digest BAC-inserted clonal chunks of DNA into small fragments. Sequence small fragments. Stitch together BAC clones to assemble sequence. Assemble genome sequence from BAC clone sequences, using clone-based physical map.

Celera Approach using "shotgun sequencing" (no organized map). Shreds genome randomly into small fragments with no idea of where they are physically located. Clones and sequences fragments. Uses computer to stitch together genome by matching overlapping ends of sequenced fragments.

Timeline Genome sequencing driven by technology. 1985: 500 base pairs per day by hand. 1985-86: PCR and automated DNA sequencing. 1992: BACs. 2000: 1000 bases per second.

Challenges Data were generated in labs all over the world Organism is diploid, extremely large genome Large proportion of the human genome consists of repetitive and duplicated sequences Cloning bias (under-representation of some region of the genome)

Waiting for Technology Eyes on the human genome. While waiting for technology other genomes were sequenced.

Current Status Human genome ~3.2 Gb. “Rough draft” sequence of the human genome. Have sequenced 90% of the 2.5 Gb of gene-rich (euchromatic) DNA. What is considered finished? Fewer than 1 base in 10,000 is incorrectly assigned. More than 95% of the euchromatic regions are assigned. Each gap is smaller than 150 kb.

Major Findings of the Draft Sequence

Number of Genes Number of genes only ~ 35,000 <2% of genome encode genes Fruit fly has 13,000 genes Mustard weed has 26,000 Proteome is complex 1 gene codes up to ~ 1000 proteins Alternative splicing Variation in gene regulation Post transcription modification Hundreds of genes appear to have come from bacteria

Number of Genes Estimated from: Comparisons with other genomes Comparisons with identified genes (protein motifs, pseudogenes) Extrapolations from chr 21 and 22 Presence of CpG islands Presence of initiator, promoter or enhancer / silencer sequences Evidence of alternative splicing Known expressed sequence tags

Categorization of genes 23.2% Expression, replication, maintenance 21.1% Signal transduction 17.5% Biochemical functions of the cell 38.2% Other

Access to Information All public project data on the Internet. NCBI Website: www.ncbi.nlm.nih.gov . Human genome database. Sequence and mapping tools.

Database Search Example The genome database has many tools to locate a gene of interest or search for potential traits of the gene. Example–chromosomal map search result for the "breast cancer–causing gene" BRCA2:

Early Statistics Only 28% is transcribed into RNA. Only 1.1%-1.4% of genome actually encodes protein (=5% of transcribed RNA). Surprises: More junk DNA. Fewer genes.

Junk DNA No apparent direct biological function. Long stretches of repeated sequence. Hot area of investigation. Human genome has far more repeat DNA than any other sequenced organism (over half). Parasitic elements–45% of this repeat DNA is from selfish, parasitic DNA: Transposable elements. May play role in evolution.

Gene Count Many fewer genes than expected (half): Only 35,000-45,000 genes vs. previously predicted 100,000. Only twice the amount of a nematode or a fruit fly. Does not correlate to twice as complex. Alternative splicing: Invertebrate genes are more innovative in their assembly of genes. Protein domains are mixed more creatively and in larger numbers by invertebrates. Genes elusive.

Genetic Variation The International Single Nucleotide Polymorphism (SNP) Map. Compiled 1.4 million SNPs (single-base pair differences between individuals). Investigate: Disease resistance. Response to therapeutics. Evolution. Natural selection. Individual traits.

Gene Variation Example Mutations in "breast cancer gene” BRCA2. Chromosomal location and beginning sequence with one of the mapped variations.

Future Directions Fill gaps (refinement). Bioinformatics. Sequence additional genomes. For comparison. Upcoming: mouse, fish, dogs, kangaroo, chimpanzee (most valuable). Proteomics. Gene and Protein Chips (Microarrays).
Tags