author_facet Altschul, Stephen F.
Wootton, John C.
Gertz, E. Michael
Agarwala, Richa
Morgulis, Aleksandr
Schäffer, Alejandro A.
Yu, Yi‐Kuo
Altschul, Stephen F.
Wootton, John C.
Gertz, E. Michael
Agarwala, Richa
Morgulis, Aleksandr
Schäffer, Alejandro A.
Yu, Yi‐Kuo
author Altschul, Stephen F.
Wootton, John C.
Gertz, E. Michael
Agarwala, Richa
Morgulis, Aleksandr
Schäffer, Alejandro A.
Yu, Yi‐Kuo
spellingShingle Altschul, Stephen F.
Wootton, John C.
Gertz, E. Michael
Agarwala, Richa
Morgulis, Aleksandr
Schäffer, Alejandro A.
Yu, Yi‐Kuo
The FEBS Journal
Protein database searches using compositionally adjusted substitution matrices
Cell Biology
Molecular Biology
Biochemistry
author_sort altschul, stephen f.
spelling Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo 1742-464X 1742-4658 Wiley Cell Biology Molecular Biology Biochemistry http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p> Protein database searches using compositionally adjusted substitution matrices The FEBS Journal
doi_str_mv 10.1111/j.1742-4658.2005.04945.x
facet_avail Online
Free
finc_class_facet Chemie und Pharmazie
Biologie
format ElectronicArticle
fullrecord blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTExMS9qLjE3NDItNDY1OC4yMDA1LjA0OTQ1Lng
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTExMS9qLjE3NDItNDY1OC4yMDA1LjA0OTQ1Lng
institution DE-D275
DE-Bn3
DE-Brt1
DE-Zwi2
DE-D161
DE-Gla1
DE-Zi4
DE-15
DE-Pl11
DE-Rs1
DE-105
DE-14
DE-Ch1
DE-L229
imprint Wiley, 2005
imprint_str_mv Wiley, 2005
issn 1742-464X
1742-4658
issn_str_mv 1742-464X
1742-4658
language English
mega_collection Wiley (CrossRef)
match_str altschul2005proteindatabasesearchesusingcompositionallyadjustedsubstitutionmatrices
publishDateSort 2005
publisher Wiley
recordtype ai
record_format ai
series The FEBS Journal
source_id 49
title Protein database searches using compositionally adjusted substitution matrices
title_unstemmed Protein database searches using compositionally adjusted substitution matrices
title_full Protein database searches using compositionally adjusted substitution matrices
title_fullStr Protein database searches using compositionally adjusted substitution matrices
title_full_unstemmed Protein database searches using compositionally adjusted substitution matrices
title_short Protein database searches using compositionally adjusted substitution matrices
title_sort protein database searches using compositionally adjusted substitution matrices
topic Cell Biology
Molecular Biology
Biochemistry
url http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x
publishDate 2005
physical 5101-5109
description <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p>
container_issue 20
container_start_page 5101
container_title The FEBS Journal
container_volume 272
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
_version_ 1792346889695264774
geogr_code not assigned
last_indexed 2024-03-01T17:46:34.072Z
geogr_code_person not assigned
openURL url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=Protein+database+searches+using+compositionally+adjusted+substitution+matrices&rft.date=2005-10-01&genre=article&issn=1742-4658&volume=272&issue=20&spage=5101&epage=5109&pages=5101-5109&jtitle=The+FEBS+Journal&atitle=Protein+database+searches+using+compositionally+adjusted+substitution+matrices&aulast=Yu&aufirst=Yi%E2%80%90Kuo&rft_id=info%3Adoi%2F10.1111%2Fj.1742-4658.2005.04945.x&rft.language%5B0%5D=eng
SOLR
_version_ 1792346889695264774
author Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo
author_facet Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo, Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo
author_sort altschul, stephen f.
container_issue 20
container_start_page 5101
container_title The FEBS Journal
container_volume 272
description <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p>
doi_str_mv 10.1111/j.1742-4658.2005.04945.x
facet_avail Online, Free
finc_class_facet Chemie und Pharmazie, Biologie
format ElectronicArticle
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
geogr_code not assigned
geogr_code_person not assigned
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTExMS9qLjE3NDItNDY1OC4yMDA1LjA0OTQ1Lng
imprint Wiley, 2005
imprint_str_mv Wiley, 2005
institution DE-D275, DE-Bn3, DE-Brt1, DE-Zwi2, DE-D161, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229
issn 1742-464X, 1742-4658
issn_str_mv 1742-464X, 1742-4658
language English
last_indexed 2024-03-01T17:46:34.072Z
match_str altschul2005proteindatabasesearchesusingcompositionallyadjustedsubstitutionmatrices
mega_collection Wiley (CrossRef)
physical 5101-5109
publishDate 2005
publishDateSort 2005
publisher Wiley
record_format ai
recordtype ai
series The FEBS Journal
source_id 49
spelling Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo 1742-464X 1742-4658 Wiley Cell Biology Molecular Biology Biochemistry http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p> Protein database searches using compositionally adjusted substitution matrices The FEBS Journal
spellingShingle Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo, The FEBS Journal, Protein database searches using compositionally adjusted substitution matrices, Cell Biology, Molecular Biology, Biochemistry
title Protein database searches using compositionally adjusted substitution matrices
title_full Protein database searches using compositionally adjusted substitution matrices
title_fullStr Protein database searches using compositionally adjusted substitution matrices
title_full_unstemmed Protein database searches using compositionally adjusted substitution matrices
title_short Protein database searches using compositionally adjusted substitution matrices
title_sort protein database searches using compositionally adjusted substitution matrices
title_unstemmed Protein database searches using compositionally adjusted substitution matrices
topic Cell Biology, Molecular Biology, Biochemistry
url http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x