SlideShare a Scribd company logo
BLAST
(Basic Local Alignment Search Tool)
• Developed by Steven Altschul and Samuel
Karlin in 1990.
• Compares nucleotide/aminoacid sequences
• Is a heuristic method.
• Is a fast but approximate method of alignment.
• Locates local alignments/short matches called
words
Uses of BLAST:
• Search a database for sequences similar
to an input sequence.
• Identify previously characterized
sequences.
• Find phylogenetically related sequences.
• Identify possible functions based on
similarities to known sequences.
Types of BLAST:
Types of BLAST:
 blastp: compares a protein sequence against a protein
sequence database.
 blastn: compares a nucleotide sequence against a
nucleotide sequence database.
 blastx: compares a six frame translation of a
nucleotide sequence against a protein database
 tblastn: compares a protein sequence against a six
frame translation of a nucleotide database
 tblastx: compares a six frame translation of a
nucleotide sequence against a six frame translation of
a nucleotide database.
How BLAST works
•Blast searches begin with a query sequence that
will be matched against sequence databases
specified by the user.
•Begins by breaking down the query sequence
into a series of short overlapping “words”
•Default word size for BLAST N is 28 nucleotides
•Default word size for BLAST P is 3 amino acids
•Results obtained depend on the scoring matrix
used.
•BLOSUM 62 matrix is the default scoring matrix
for BLASTP
The basic strategy used by
the BLAST algorithms
The BLASTP algorithm
• Query sequence is broken into all possible 3-letter
words using a moving window
• Numerical score is calculated for each word by adding
up the values for the amino acids from the BLOSUM62
matrix
• Words with a score of 12 or more are collected into the
initial BLASTP search set.
• The search set is broadened by adding synonyms that
differ from the words at one position.
• Only synonyms with scores above a threshold value are
added to the search set. NCBI BLASTP uses a default
threshold of 10 for synonyms
Contd….
 Using this search set, BLAST scans a database
and identifies word hits/matches that score
above the threshold.
 These short matches serve as seeds. The BLAST
algorithm attempts to extend the match in the
immediate sequence neighborhood
 BLAST keeps a running raw score, using scoring
matrices, as it extends the matches. Each new
amino acid either increases or decreases the raw
score
 Penalties are assigned for mismatches and for
gaps between the two alignments.
• In the NCBI default settings, a gap brings an initial
penalty of 11, which increases by 1 for each missing
amino acid.
• Once the score falls below a set level, the
alignment ceases and blast stops trying to extend
the alignment.
• An extended sequence alignment that was initially
seeded by a word hit is produced -called an hsp, or
high-scoring segment pair
Contd….
Contd….
 All HSPs that have a cumulative score above the
threshold score are reported in BLAST results.
 Raw scores are then converted into bit scores by
correcting for the scoring matrix used
The Blast output
 Includes a table with the bit scores (S) for each alignment and its E-
value, or “expect score”
 the score (S) is a measure of the quality of an alignment (calculated as
the sum of substitution and gap scores for each aligned residue)
 E-value (E), or expectation value is a measure of the significance of the
alignment. The E-value is the number of different alignments, with
scores equivalent to or better than S, that are expected to occur in a
database search by chance.
 The lower the E-value, the more significant the alignment result.
 Alignments with the highest bit scores and lowest E-values are listed at
the top of the table.
How a BLAST result looks
The query sequence - numbered red bar at the top of the figure. Database hits are
shown aligned to the query, below the red bar. Of the aligned sequences, the most
similar are shown closest to the query. In this case, there are three high scoring
database matches that align to most of the query sequence. The next twelve bars
represent lower-scoring matches that align to two regions of the query, from about
residues 3–60 and residues 220–500. The cross-hatched parts of the these bars
indicate that the two regions of similarity are on the same protein, but that this
intervening region does not match. The remaining bars show lower-scoring
alignments. Mousing over the bars displays the definition line for that sequence to
be shown in the window above the graphic.
One-line descriptions in the BLAST report
Each line is composed of four fields: (a) the gi number, database designation, accession
number, and locus name for the matched sequence, separated by vertical bars (appendix 1);
(b) a brief textual description of the sequence, the definition. This usually includes
information on the organism from which the sequence was derived, the type of sequence
(e.G., mRNA or DNA), and some information about function or phenotype. The definition
line is often truncated in the one-line descriptions to keep the display compact; (c) the
alignment score in bits. Higher scoring hits are found at the top of the list; and (d) the e-
value, which provides an estimate of statistical significance. For the first hit in the list, the
gi number is 116365, the database designation is sp (for SWISS-PROT), the accession
number is P26374, the locus name is RAE2_HUMAN, the definition line is rab proteins, the
A pairwise sequence alignment from a BLAST report
The alignment is preceded by the sequence identifier, the full definition line, and the length of the matched
sequence, in amino acids. Next comes the bit score (the raw score is in parentheses) and then the E-value. The
following line contains information on the number of identical residues in this alignment (Identities), the
number of conservative substitutions (Positives), and if applicable, the number of gaps in the alignment.
Finally, the actual alignment is shown, with the query on top, and the database match is labeled as Sbjct,
below. The numbers at left and right refer to the position in the amino acid sequence. One or more dashes (–)
within a sequence indicate insertions or deletions. Amino acid residues in the query sequence that have been
masked because of low complexity are replaced by Xs (see, for example, the fourth and last blocks). The line
between the two sequences indicates the similarities between the sequences. If the query and the subject

More Related Content

Similar to BLAST : features, types,algorithm, working etc.

Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
AnkitTiwari354
 
BLAST AND FASTA.pptx
BLAST AND FASTA.pptxBLAST AND FASTA.pptx
BLAST AND FASTA.pptx
PiyushBehgal1
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
Sobia
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptx
ericndunek
 
Blast fasta
Blast fastaBlast fasta
Blast fastayaghava
 
FASTA
FASTAFASTA
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
alizain9604
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignment
barathvaj
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
Nagendrasahu6
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
zahid6
 
BLAST
BLASTBLAST
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
PagudalaSangeetha
 
Database Searching
Database SearchingDatabase Searching
Database Searching
Meghaj Mallick
 
BLAST
BLASTBLAST
BLAST
rishabhaks
 
sequence alignment and matrices for makaut
sequence alignment and matrices for makautsequence alignment and matrices for makaut
sequence alignment and matrices for makaut
Soumyajitdey27
 

Similar to BLAST : features, types,algorithm, working etc. (20)

Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Blasta
BlastaBlasta
Blasta
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
BLAST AND FASTA.pptx
BLAST AND FASTA.pptxBLAST AND FASTA.pptx
BLAST AND FASTA.pptx
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Lecture 5.pptx
Lecture 5.pptxLecture 5.pptx
Lecture 5.pptx
 
Blast fasta
Blast fastaBlast fasta
Blast fasta
 
FASTA
FASTAFASTA
FASTA
 
BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234BLAST AND FASTA.pptx12345789999987544321234
BLAST AND FASTA.pptx12345789999987544321234
 
Blast gp assignment
Blast  gp assignmentBlast  gp assignment
Blast gp assignment
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
 
Blast 2013 1
Blast 2013 1Blast 2013 1
Blast 2013 1
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Blast Algorithm
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
 
BLAST
BLASTBLAST
BLAST
 
Sequence similarity tools.pptx
Sequence similarity tools.pptxSequence similarity tools.pptx
Sequence similarity tools.pptx
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
BLAST
BLASTBLAST
BLAST
 
sequence alignment and matrices for makaut
sequence alignment and matrices for makautsequence alignment and matrices for makaut
sequence alignment and matrices for makaut
 

More from Cherry

Large scale production of streptomycin.pptx
Large scale production of streptomycin.pptxLarge scale production of streptomycin.pptx
Large scale production of streptomycin.pptx
Cherry
 
INDUSTRIAL PRODUCTION OF ETHANOL.....pptx
INDUSTRIAL PRODUCTION OF ETHANOL.....pptxINDUSTRIAL PRODUCTION OF ETHANOL.....pptx
INDUSTRIAL PRODUCTION OF ETHANOL.....pptx
Cherry
 
AMYLASE..............................pptx
AMYLASE..............................pptxAMYLASE..............................pptx
AMYLASE..............................pptx
Cherry
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
RETROGRESSIVE CHANGES, CONCEPT OF CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...
RETROGRESSIVE CHANGES, CONCEPT OF  CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...RETROGRESSIVE CHANGES, CONCEPT OF  CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...
RETROGRESSIVE CHANGES, CONCEPT OF CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...
Cherry
 
COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...
COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...
COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...
Cherry
 
Remote sensing.......................pptx
Remote sensing.......................pptxRemote sensing.......................pptx
Remote sensing.......................pptx
Cherry
 
METHODS OF TRANSCRIPTOME ANALYSIS....pptx
METHODS OF TRANSCRIPTOME ANALYSIS....pptxMETHODS OF TRANSCRIPTOME ANALYSIS....pptx
METHODS OF TRANSCRIPTOME ANALYSIS....pptx
Cherry
 
AIZOACEAE............................pptx
AIZOACEAE............................pptxAIZOACEAE............................pptx
AIZOACEAE............................pptx
Cherry
 
Cryoprervation techniques.............pptx
Cryoprervation techniques.............pptxCryoprervation techniques.............pptx
Cryoprervation techniques.............pptx
Cherry
 
APPLICATIONS OF GM ANIMALS...........pptx
APPLICATIONS OF GM ANIMALS...........pptxAPPLICATIONS OF GM ANIMALS...........pptx
APPLICATIONS OF GM ANIMALS...........pptx
Cherry
 
Tropical coastal ecosystems...........pptx
Tropical coastal ecosystems...........pptxTropical coastal ecosystems...........pptx
Tropical coastal ecosystems...........pptx
Cherry
 
Phytogeography........................pptx
Phytogeography........................pptxPhytogeography........................pptx
Phytogeography........................pptx
Cherry
 
Structural annotation................pptx
Structural annotation................pptxStructural annotation................pptx
Structural annotation................pptx
Cherry
 
Adventitious shoot regeneration.....pptx
Adventitious shoot regeneration.....pptxAdventitious shoot regeneration.....pptx
Adventitious shoot regeneration.....pptx
Cherry
 
Tissue engineering......................pptx
Tissue engineering......................pptxTissue engineering......................pptx
Tissue engineering......................pptx
Cherry
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptx
Cherry
 
SYNTHETIC SEED PRODUCTION.............pptx
SYNTHETIC SEED PRODUCTION.............pptxSYNTHETIC SEED PRODUCTION.............pptx
SYNTHETIC SEED PRODUCTION.............pptx
Cherry
 
Reporter genes.......................pptx
Reporter genes.......................pptxReporter genes.......................pptx
Reporter genes.......................pptx
Cherry
 
Somaclonal Variation.....................pptx
Somaclonal Variation.....................pptxSomaclonal Variation.....................pptx
Somaclonal Variation.....................pptx
Cherry
 

More from Cherry (20)

Large scale production of streptomycin.pptx
Large scale production of streptomycin.pptxLarge scale production of streptomycin.pptx
Large scale production of streptomycin.pptx
 
INDUSTRIAL PRODUCTION OF ETHANOL.....pptx
INDUSTRIAL PRODUCTION OF ETHANOL.....pptxINDUSTRIAL PRODUCTION OF ETHANOL.....pptx
INDUSTRIAL PRODUCTION OF ETHANOL.....pptx
 
AMYLASE..............................pptx
AMYLASE..............................pptxAMYLASE..............................pptx
AMYLASE..............................pptx
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
RETROGRESSIVE CHANGES, CONCEPT OF CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...
RETROGRESSIVE CHANGES, CONCEPT OF  CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...RETROGRESSIVE CHANGES, CONCEPT OF  CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...
RETROGRESSIVE CHANGES, CONCEPT OF CLIMAX COMMUNITIES AND RESILIENCE OF COMMU...
 
COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...
COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...
COMMUNITY DYNAMICS CHARACTERISTICS- CYCLIC AND NON-CYCLIC REPLACEMENT CHANGES...
 
Remote sensing.......................pptx
Remote sensing.......................pptxRemote sensing.......................pptx
Remote sensing.......................pptx
 
METHODS OF TRANSCRIPTOME ANALYSIS....pptx
METHODS OF TRANSCRIPTOME ANALYSIS....pptxMETHODS OF TRANSCRIPTOME ANALYSIS....pptx
METHODS OF TRANSCRIPTOME ANALYSIS....pptx
 
AIZOACEAE............................pptx
AIZOACEAE............................pptxAIZOACEAE............................pptx
AIZOACEAE............................pptx
 
Cryoprervation techniques.............pptx
Cryoprervation techniques.............pptxCryoprervation techniques.............pptx
Cryoprervation techniques.............pptx
 
APPLICATIONS OF GM ANIMALS...........pptx
APPLICATIONS OF GM ANIMALS...........pptxAPPLICATIONS OF GM ANIMALS...........pptx
APPLICATIONS OF GM ANIMALS...........pptx
 
Tropical coastal ecosystems...........pptx
Tropical coastal ecosystems...........pptxTropical coastal ecosystems...........pptx
Tropical coastal ecosystems...........pptx
 
Phytogeography........................pptx
Phytogeography........................pptxPhytogeography........................pptx
Phytogeography........................pptx
 
Structural annotation................pptx
Structural annotation................pptxStructural annotation................pptx
Structural annotation................pptx
 
Adventitious shoot regeneration.....pptx
Adventitious shoot regeneration.....pptxAdventitious shoot regeneration.....pptx
Adventitious shoot regeneration.....pptx
 
Tissue engineering......................pptx
Tissue engineering......................pptxTissue engineering......................pptx
Tissue engineering......................pptx
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptx
 
SYNTHETIC SEED PRODUCTION.............pptx
SYNTHETIC SEED PRODUCTION.............pptxSYNTHETIC SEED PRODUCTION.............pptx
SYNTHETIC SEED PRODUCTION.............pptx
 
Reporter genes.......................pptx
Reporter genes.......................pptxReporter genes.......................pptx
Reporter genes.......................pptx
 
Somaclonal Variation.....................pptx
Somaclonal Variation.....................pptxSomaclonal Variation.....................pptx
Somaclonal Variation.....................pptx
 

Recently uploaded

GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
azzyixes
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
binhminhvu04
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 

Recently uploaded (20)

GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 

BLAST : features, types,algorithm, working etc.

  • 1. BLAST (Basic Local Alignment Search Tool) • Developed by Steven Altschul and Samuel Karlin in 1990. • Compares nucleotide/aminoacid sequences • Is a heuristic method. • Is a fast but approximate method of alignment. • Locates local alignments/short matches called words
  • 2. Uses of BLAST: • Search a database for sequences similar to an input sequence. • Identify previously characterized sequences. • Find phylogenetically related sequences. • Identify possible functions based on similarities to known sequences.
  • 4. Types of BLAST:  blastp: compares a protein sequence against a protein sequence database.  blastn: compares a nucleotide sequence against a nucleotide sequence database.  blastx: compares a six frame translation of a nucleotide sequence against a protein database  tblastn: compares a protein sequence against a six frame translation of a nucleotide database  tblastx: compares a six frame translation of a nucleotide sequence against a six frame translation of a nucleotide database.
  • 5. How BLAST works •Blast searches begin with a query sequence that will be matched against sequence databases specified by the user. •Begins by breaking down the query sequence into a series of short overlapping “words” •Default word size for BLAST N is 28 nucleotides •Default word size for BLAST P is 3 amino acids •Results obtained depend on the scoring matrix used. •BLOSUM 62 matrix is the default scoring matrix for BLASTP
  • 6. The basic strategy used by the BLAST algorithms
  • 7. The BLASTP algorithm • Query sequence is broken into all possible 3-letter words using a moving window • Numerical score is calculated for each word by adding up the values for the amino acids from the BLOSUM62 matrix • Words with a score of 12 or more are collected into the initial BLASTP search set. • The search set is broadened by adding synonyms that differ from the words at one position. • Only synonyms with scores above a threshold value are added to the search set. NCBI BLASTP uses a default threshold of 10 for synonyms
  • 8.
  • 9. Contd….  Using this search set, BLAST scans a database and identifies word hits/matches that score above the threshold.  These short matches serve as seeds. The BLAST algorithm attempts to extend the match in the immediate sequence neighborhood  BLAST keeps a running raw score, using scoring matrices, as it extends the matches. Each new amino acid either increases or decreases the raw score  Penalties are assigned for mismatches and for gaps between the two alignments.
  • 10. • In the NCBI default settings, a gap brings an initial penalty of 11, which increases by 1 for each missing amino acid. • Once the score falls below a set level, the alignment ceases and blast stops trying to extend the alignment. • An extended sequence alignment that was initially seeded by a word hit is produced -called an hsp, or high-scoring segment pair Contd….
  • 11. Contd….  All HSPs that have a cumulative score above the threshold score are reported in BLAST results.  Raw scores are then converted into bit scores by correcting for the scoring matrix used
  • 12.
  • 13. The Blast output  Includes a table with the bit scores (S) for each alignment and its E- value, or “expect score”  the score (S) is a measure of the quality of an alignment (calculated as the sum of substitution and gap scores for each aligned residue)  E-value (E), or expectation value is a measure of the significance of the alignment. The E-value is the number of different alignments, with scores equivalent to or better than S, that are expected to occur in a database search by chance.  The lower the E-value, the more significant the alignment result.  Alignments with the highest bit scores and lowest E-values are listed at the top of the table.
  • 14. How a BLAST result looks
  • 15. The query sequence - numbered red bar at the top of the figure. Database hits are shown aligned to the query, below the red bar. Of the aligned sequences, the most similar are shown closest to the query. In this case, there are three high scoring database matches that align to most of the query sequence. The next twelve bars represent lower-scoring matches that align to two regions of the query, from about residues 3–60 and residues 220–500. The cross-hatched parts of the these bars indicate that the two regions of similarity are on the same protein, but that this intervening region does not match. The remaining bars show lower-scoring alignments. Mousing over the bars displays the definition line for that sequence to be shown in the window above the graphic.
  • 16. One-line descriptions in the BLAST report Each line is composed of four fields: (a) the gi number, database designation, accession number, and locus name for the matched sequence, separated by vertical bars (appendix 1); (b) a brief textual description of the sequence, the definition. This usually includes information on the organism from which the sequence was derived, the type of sequence (e.G., mRNA or DNA), and some information about function or phenotype. The definition line is often truncated in the one-line descriptions to keep the display compact; (c) the alignment score in bits. Higher scoring hits are found at the top of the list; and (d) the e- value, which provides an estimate of statistical significance. For the first hit in the list, the gi number is 116365, the database designation is sp (for SWISS-PROT), the accession number is P26374, the locus name is RAE2_HUMAN, the definition line is rab proteins, the
  • 17. A pairwise sequence alignment from a BLAST report The alignment is preceded by the sequence identifier, the full definition line, and the length of the matched sequence, in amino acids. Next comes the bit score (the raw score is in parentheses) and then the E-value. The following line contains information on the number of identical residues in this alignment (Identities), the number of conservative substitutions (Positives), and if applicable, the number of gaps in the alignment. Finally, the actual alignment is shown, with the query on top, and the database match is labeled as Sbjct, below. The numbers at left and right refer to the position in the amino acid sequence. One or more dashes (–) within a sequence indicate insertions or deletions. Amino acid residues in the query sequence that have been masked because of low complexity are replaced by Xs (see, for example, the fourth and last blocks). The line between the two sequences indicates the similarities between the sequences. If the query and the subject