Eintrag weiter verarbeiten
Protein database searches using compositionally adjusted substitution matrices
Gespeichert in:
Zeitschriftentitel: | The FEBS Journal |
---|---|
Personen und Körperschaften: | , , , , , , |
In: | The FEBS Journal, 272, 2005, 20, S. 5101-5109 |
Format: | E-Article |
Sprache: | Englisch |
veröffentlicht: |
Wiley
|
Schlagwörter: |
author_facet |
Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo |
---|---|
author |
Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo |
spellingShingle |
Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo The FEBS Journal Protein database searches using compositionally adjusted substitution matrices Cell Biology Molecular Biology Biochemistry |
author_sort |
altschul, stephen f. |
spelling |
Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo 1742-464X 1742-4658 Wiley Cell Biology Molecular Biology Biochemistry http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p> Protein database searches using compositionally adjusted substitution matrices The FEBS Journal |
doi_str_mv |
10.1111/j.1742-4658.2005.04945.x |
facet_avail |
Online Free |
finc_class_facet |
Chemie und Pharmazie Biologie |
format |
ElectronicArticle |
fullrecord |
blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTExMS9qLjE3NDItNDY1OC4yMDA1LjA0OTQ1Lng |
id |
ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTExMS9qLjE3NDItNDY1OC4yMDA1LjA0OTQ1Lng |
institution |
DE-D275 DE-Bn3 DE-Brt1 DE-Zwi2 DE-D161 DE-Gla1 DE-Zi4 DE-15 DE-Pl11 DE-Rs1 DE-105 DE-14 DE-Ch1 DE-L229 |
imprint |
Wiley, 2005 |
imprint_str_mv |
Wiley, 2005 |
issn |
1742-464X 1742-4658 |
issn_str_mv |
1742-464X 1742-4658 |
language |
English |
mega_collection |
Wiley (CrossRef) |
match_str |
altschul2005proteindatabasesearchesusingcompositionallyadjustedsubstitutionmatrices |
publishDateSort |
2005 |
publisher |
Wiley |
recordtype |
ai |
record_format |
ai |
series |
The FEBS Journal |
source_id |
49 |
title |
Protein database searches using compositionally adjusted substitution matrices |
title_unstemmed |
Protein database searches using compositionally adjusted substitution matrices |
title_full |
Protein database searches using compositionally adjusted substitution matrices |
title_fullStr |
Protein database searches using compositionally adjusted substitution matrices |
title_full_unstemmed |
Protein database searches using compositionally adjusted substitution matrices |
title_short |
Protein database searches using compositionally adjusted substitution matrices |
title_sort |
protein database searches using compositionally adjusted substitution matrices |
topic |
Cell Biology Molecular Biology Biochemistry |
url |
http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x |
publishDate |
2005 |
physical |
5101-5109 |
description |
<jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p> |
container_issue |
20 |
container_start_page |
5101 |
container_title |
The FEBS Journal |
container_volume |
272 |
format_de105 |
Article, E-Article |
format_de14 |
Article, E-Article |
format_de15 |
Article, E-Article |
format_de520 |
Article, E-Article |
format_de540 |
Article, E-Article |
format_dech1 |
Article, E-Article |
format_ded117 |
Article, E-Article |
format_degla1 |
E-Article |
format_del152 |
Buch |
format_del189 |
Article, E-Article |
format_dezi4 |
Article |
format_dezwi2 |
Article, E-Article |
format_finc |
Article, E-Article |
format_nrw |
Article, E-Article |
_version_ |
1792346889695264774 |
geogr_code |
not assigned |
last_indexed |
2024-03-01T17:46:34.072Z |
geogr_code_person |
not assigned |
openURL |
url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=Protein+database+searches+using+compositionally+adjusted+substitution+matrices&rft.date=2005-10-01&genre=article&issn=1742-4658&volume=272&issue=20&spage=5101&epage=5109&pages=5101-5109&jtitle=The+FEBS+Journal&atitle=Protein+database+searches+using+compositionally+adjusted+substitution+matrices&aulast=Yu&aufirst=Yi%E2%80%90Kuo&rft_id=info%3Adoi%2F10.1111%2Fj.1742-4658.2005.04945.x&rft.language%5B0%5D=eng |
SOLR | |
_version_ | 1792346889695264774 |
author | Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo |
author_facet | Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo, Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo |
author_sort | altschul, stephen f. |
container_issue | 20 |
container_start_page | 5101 |
container_title | The FEBS Journal |
container_volume | 272 |
description | <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p> |
doi_str_mv | 10.1111/j.1742-4658.2005.04945.x |
facet_avail | Online, Free |
finc_class_facet | Chemie und Pharmazie, Biologie |
format | ElectronicArticle |
format_de105 | Article, E-Article |
format_de14 | Article, E-Article |
format_de15 | Article, E-Article |
format_de520 | Article, E-Article |
format_de540 | Article, E-Article |
format_dech1 | Article, E-Article |
format_ded117 | Article, E-Article |
format_degla1 | E-Article |
format_del152 | Buch |
format_del189 | Article, E-Article |
format_dezi4 | Article |
format_dezwi2 | Article, E-Article |
format_finc | Article, E-Article |
format_nrw | Article, E-Article |
geogr_code | not assigned |
geogr_code_person | not assigned |
id | ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTExMS9qLjE3NDItNDY1OC4yMDA1LjA0OTQ1Lng |
imprint | Wiley, 2005 |
imprint_str_mv | Wiley, 2005 |
institution | DE-D275, DE-Bn3, DE-Brt1, DE-Zwi2, DE-D161, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229 |
issn | 1742-464X, 1742-4658 |
issn_str_mv | 1742-464X, 1742-4658 |
language | English |
last_indexed | 2024-03-01T17:46:34.072Z |
match_str | altschul2005proteindatabasesearchesusingcompositionallyadjustedsubstitutionmatrices |
mega_collection | Wiley (CrossRef) |
physical | 5101-5109 |
publishDate | 2005 |
publishDateSort | 2005 |
publisher | Wiley |
record_format | ai |
recordtype | ai |
series | The FEBS Journal |
source_id | 49 |
spelling | Altschul, Stephen F. Wootton, John C. Gertz, E. Michael Agarwala, Richa Morgulis, Aleksandr Schäffer, Alejandro A. Yu, Yi‐Kuo 1742-464X 1742-4658 Wiley Cell Biology Molecular Biology Biochemistry http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x <jats:p>Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long‐standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions.</jats:p><jats:p>Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of <jats:sc>blast</jats:sc>.</jats:p> Protein database searches using compositionally adjusted substitution matrices The FEBS Journal |
spellingShingle | Altschul, Stephen F., Wootton, John C., Gertz, E. Michael, Agarwala, Richa, Morgulis, Aleksandr, Schäffer, Alejandro A., Yu, Yi‐Kuo, The FEBS Journal, Protein database searches using compositionally adjusted substitution matrices, Cell Biology, Molecular Biology, Biochemistry |
title | Protein database searches using compositionally adjusted substitution matrices |
title_full | Protein database searches using compositionally adjusted substitution matrices |
title_fullStr | Protein database searches using compositionally adjusted substitution matrices |
title_full_unstemmed | Protein database searches using compositionally adjusted substitution matrices |
title_short | Protein database searches using compositionally adjusted substitution matrices |
title_sort | protein database searches using compositionally adjusted substitution matrices |
title_unstemmed | Protein database searches using compositionally adjusted substitution matrices |
topic | Cell Biology, Molecular Biology, Biochemistry |
url | http://dx.doi.org/10.1111/j.1742-4658.2005.04945.x |