An integrated publicly accessible bioinformatics resource to support genomic/proteomic research and scientific discovery.
Established in 1984, by the National Biomedical Research Foundation (NBRF) Georgetown University Medial Center, Washington D.C., USA.
It is the source of annotated protein datab...
An integrated publicly accessible bioinformatics resource to support genomic/proteomic research and scientific discovery.
Established in 1984, by the National Biomedical Research Foundation (NBRF) Georgetown University Medial Center, Washington D.C., USA.
It is the source of annotated protein databases and analysis tools for the researchers.
Serve as primary resource for the exploration of protein information.
Accessible by text search for entry and list retrieval, and also BLAST search and peptide match.
Features of PIR
Comprehensive,Non-redundant,Annotateddatabase
containproteinsequencesofprokaryotes,eukaryotes,
viruses,phages,archaea.
Dataiswellorganized.Entriesclassifiedintoprotein
familyandsuper-family.
ProteinSequenceDatabase(PSD)cross-referencesto
othergenomicandproteomicpublicdatabases
Updatedweeklyandfullreleasearepublished
quarterly.
Providecrossreferencebetweenitsowndatabases.
Resources of PIR
TheresourcesofPIRcanbebroadlyclassifiedintotwo
categories:
1.Dataretrievalsystems
2.Databases
Data Retrieval in PIR
Data Retrieval in PIR consist of search engines of three types.
Interactive text-based
search engine
Standard Sequence
similarity search engines
Advanced Search
Engines
Boolean queries of
text fields
Peptide match
Pattern match
BLAST
FASTA
Pair-wise alignment
Multiple alignment
0 (false)
1 (true)
Combine sequence
similarity and
annotation searches
Evaluation of gene-
family relationship
Databases of PIR
UniProt-Universal Protein Resource
PIR+
EBI (European Bioinformatics Institute)
SIB (Swiss Institute of Bioinformatics)
UniProt
United Protein Database
Central resource of Protein Sequence & Function
UniProt-Universal Protein Resource
The UniProtdatabase consist of the following three database:
1.UniProtKnowledgebase (UniProtKB)
2.UniProtReference Cluster (UniRef)
3.UniProtArchive (UniParc)
UniProtKnowledgebase (UniProtKB)
•Centraldatabaseofproteinsequenceswithannotationandfunctionalinformation.
•Providesinglerecordforallproteinproductsderivedfromacertaingenefroma
certainspecies.
•Givedetailsofaccessionnumber,alternativesplicing,proteolyticcleavage,post-
translationalmodificationstoeachfromofderivedprotein.
2 Parts
Contain Manually Annotated Records Contain Computationally Analyzed Records
UniProt/Swiss-Prot UniProt/TrEMBL
Which have to be manually annotated
iProClass-Integrated Protein
Knowledgebase
•Providescomprehensivedescriptionofaproteinfamily,functionand
structureforUniProtproteinsequences,andserveasaframeworkfor
dataintegrationinadistributednetworkingenvironment.
•Containnon-redundantproteinsequencesfromPIR-PSD,Swiss-Prot,
TrEMBL.
iProClass
Family relationships
Structural
classifications
Functional
classifications
Global level
(superfamily, family)
Local level
(domain, motif, site)
Types of Protein sequence reports
iProClass
2 Types
1st Types 2nd Types
Cover information on
Structure
Function
Family
Genetics
Disease
Ontology
Taxonomy
Literature
With reference to
relevant molecular
databases
Super-family report with
Length
Taxonomy
Keyword statistics
Complete member listing
PIRSF-Protein Family Classification
System
•PIRextendeditssuper-familyconceptanddevelopedtheSuper-
FamilyClassificationsystem.
•Tofacilitatethesensiblepropagationandstandardizationofprotein
annotationandsystematicdetectionofannotationerrors.
•Consistsoftwodatasets:Preliminaryclustersandcuratedfamilies.
•Curatedfamiliesincludefamilyname,proteinmembership,parent-
childrelationship,domainarchitecture,optionaldescriptionand
bibliography.