author_facet Morgulis, Aleksandr
Agarwala, Richa
Morgulis, Aleksandr
Agarwala, Richa
author Morgulis, Aleksandr
Agarwala, Richa
spellingShingle Morgulis, Aleksandr
Agarwala, Richa
GigaScience
SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
Computer Science Applications
Health Informatics
author_sort morgulis, aleksandr
spelling Morgulis, Aleksandr Agarwala, Richa 2047-217X Oxford University Press (OUP) Computer Science Applications Health Informatics http://dx.doi.org/10.1093/gigascience/giaa023 <jats:title>Abstract</jats:title> <jats:sec> <jats:title>Background</jats:title> <jats:p>Alignment of sequence reads generated by next-generation sequencing is an integral part of most pipelines analyzing next-generation sequencing data. A number of tools designed to quickly align a large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output. They also do not support searching genome assemblies, such as the human genome assembly GRCh38, that include primary and alternate sequences and placement information for alternate sequences to primary sequences in the assembly.</jats:p> </jats:sec> <jats:sec> <jats:title>Findings</jats:title> <jats:p>This paper describes SRPRISM (Single Read Paired Read Indel Substitution Minimizer), an alignment tool for aligning reads without splices. SRPRISM has features not available in most tools, such as (i) support for searching genome assemblies with alternate sequences, (ii) partial alignment of reads with a specified region of reads to be included in the alignment, (iii) choice of ranking schemes for alignments, and (iv) explicit criteria for search sensitivity. We compare the performance of SRPRISM to GEM, Kart, STAR, BWA-MEM, Bowtie2, Hobbes, and Yara using benchmark sets for paired and single reads of lengths 100 and 250 bp generated using DWGSIM. SRPRISM found the best results for most benchmark sets with error rate of up to ∼2.5% and GEM performed best for higher error rates. SRPRISM was also more sensitive than other tools even when sensitivity was reduced to improve run time performance.</jats:p> </jats:sec> <jats:sec> <jats:title>Conclusions</jats:title> <jats:p>We present SRPRISM as a flexible read mapping tool that provides explicit guarantees on results.</jats:p> </jats:sec> SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees GigaScience
doi_str_mv 10.1093/gigascience/giaa023
facet_avail Online
Free
finc_class_facet Informatik
Medizin
format ElectronicArticle
fullrecord blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9naWdhc2NpZW5jZS9naWFhMDIz
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9naWdhc2NpZW5jZS9naWFhMDIz
institution DE-D275
DE-Bn3
DE-Brt1
DE-Zwi2
DE-D161
DE-Gla1
DE-Zi4
DE-15
DE-Pl11
DE-Rs1
DE-105
DE-14
DE-Ch1
DE-L229
imprint Oxford University Press (OUP), 2020
imprint_str_mv Oxford University Press (OUP), 2020
issn 2047-217X
issn_str_mv 2047-217X
language English
mega_collection Oxford University Press (OUP) (CrossRef)
match_str morgulis2020srprismsinglereadpairedreadindelsubstitutionminimizeranefficientalignerforassemblieswithexplicitguarantees
publishDateSort 2020
publisher Oxford University Press (OUP)
recordtype ai
record_format ai
series GigaScience
source_id 49
title SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_unstemmed SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_full SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_fullStr SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_full_unstemmed SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_short SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_sort srprism (single read paired read indel substitution minimizer): an efficient aligner for assemblies with explicit guarantees
topic Computer Science Applications
Health Informatics
url http://dx.doi.org/10.1093/gigascience/giaa023
publishDate 2020
physical
description <jats:title>Abstract</jats:title> <jats:sec> <jats:title>Background</jats:title> <jats:p>Alignment of sequence reads generated by next-generation sequencing is an integral part of most pipelines analyzing next-generation sequencing data. A number of tools designed to quickly align a large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output. They also do not support searching genome assemblies, such as the human genome assembly GRCh38, that include primary and alternate sequences and placement information for alternate sequences to primary sequences in the assembly.</jats:p> </jats:sec> <jats:sec> <jats:title>Findings</jats:title> <jats:p>This paper describes SRPRISM (Single Read Paired Read Indel Substitution Minimizer), an alignment tool for aligning reads without splices. SRPRISM has features not available in most tools, such as (i) support for searching genome assemblies with alternate sequences, (ii) partial alignment of reads with a specified region of reads to be included in the alignment, (iii) choice of ranking schemes for alignments, and (iv) explicit criteria for search sensitivity. We compare the performance of SRPRISM to GEM, Kart, STAR, BWA-MEM, Bowtie2, Hobbes, and Yara using benchmark sets for paired and single reads of lengths 100 and 250 bp generated using DWGSIM. SRPRISM found the best results for most benchmark sets with error rate of up to ∼2.5% and GEM performed best for higher error rates. SRPRISM was also more sensitive than other tools even when sensitivity was reduced to improve run time performance.</jats:p> </jats:sec> <jats:sec> <jats:title>Conclusions</jats:title> <jats:p>We present SRPRISM as a flexible read mapping tool that provides explicit guarantees on results.</jats:p> </jats:sec>
container_issue 4
container_start_page 0
container_title GigaScience
container_volume 9
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
_version_ 1792343358445715469
geogr_code not assigned
last_indexed 2024-03-01T16:50:25.735Z
geogr_code_person not assigned
openURL url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=SRPRISM+%28Single+Read+Paired+Read+Indel+Substitution+Minimizer%29%3A+an+efficient+aligner+for+assemblies+with+explicit+guarantees&rft.date=2020-04-01&genre=article&issn=2047-217X&volume=9&issue=4&jtitle=GigaScience&atitle=SRPRISM+%28Single+Read+Paired+Read+Indel+Substitution+Minimizer%29%3A+an+efficient+aligner+for+assemblies+with+explicit+guarantees&aulast=Agarwala&aufirst=Richa&rft_id=info%3Adoi%2F10.1093%2Fgigascience%2Fgiaa023&rft.language%5B0%5D=eng
SOLR
_version_ 1792343358445715469
author Morgulis, Aleksandr, Agarwala, Richa
author_facet Morgulis, Aleksandr, Agarwala, Richa, Morgulis, Aleksandr, Agarwala, Richa
author_sort morgulis, aleksandr
container_issue 4
container_start_page 0
container_title GigaScience
container_volume 9
description <jats:title>Abstract</jats:title> <jats:sec> <jats:title>Background</jats:title> <jats:p>Alignment of sequence reads generated by next-generation sequencing is an integral part of most pipelines analyzing next-generation sequencing data. A number of tools designed to quickly align a large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output. They also do not support searching genome assemblies, such as the human genome assembly GRCh38, that include primary and alternate sequences and placement information for alternate sequences to primary sequences in the assembly.</jats:p> </jats:sec> <jats:sec> <jats:title>Findings</jats:title> <jats:p>This paper describes SRPRISM (Single Read Paired Read Indel Substitution Minimizer), an alignment tool for aligning reads without splices. SRPRISM has features not available in most tools, such as (i) support for searching genome assemblies with alternate sequences, (ii) partial alignment of reads with a specified region of reads to be included in the alignment, (iii) choice of ranking schemes for alignments, and (iv) explicit criteria for search sensitivity. We compare the performance of SRPRISM to GEM, Kart, STAR, BWA-MEM, Bowtie2, Hobbes, and Yara using benchmark sets for paired and single reads of lengths 100 and 250 bp generated using DWGSIM. SRPRISM found the best results for most benchmark sets with error rate of up to ∼2.5% and GEM performed best for higher error rates. SRPRISM was also more sensitive than other tools even when sensitivity was reduced to improve run time performance.</jats:p> </jats:sec> <jats:sec> <jats:title>Conclusions</jats:title> <jats:p>We present SRPRISM as a flexible read mapping tool that provides explicit guarantees on results.</jats:p> </jats:sec>
doi_str_mv 10.1093/gigascience/giaa023
facet_avail Online, Free
finc_class_facet Informatik, Medizin
format ElectronicArticle
format_de105 Article, E-Article
format_de14 Article, E-Article
format_de15 Article, E-Article
format_de520 Article, E-Article
format_de540 Article, E-Article
format_dech1 Article, E-Article
format_ded117 Article, E-Article
format_degla1 E-Article
format_del152 Buch
format_del189 Article, E-Article
format_dezi4 Article
format_dezwi2 Article, E-Article
format_finc Article, E-Article
format_nrw Article, E-Article
geogr_code not assigned
geogr_code_person not assigned
id ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTA5My9naWdhc2NpZW5jZS9naWFhMDIz
imprint Oxford University Press (OUP), 2020
imprint_str_mv Oxford University Press (OUP), 2020
institution DE-D275, DE-Bn3, DE-Brt1, DE-Zwi2, DE-D161, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229
issn 2047-217X
issn_str_mv 2047-217X
language English
last_indexed 2024-03-01T16:50:25.735Z
match_str morgulis2020srprismsinglereadpairedreadindelsubstitutionminimizeranefficientalignerforassemblieswithexplicitguarantees
mega_collection Oxford University Press (OUP) (CrossRef)
physical
publishDate 2020
publishDateSort 2020
publisher Oxford University Press (OUP)
record_format ai
recordtype ai
series GigaScience
source_id 49
spelling Morgulis, Aleksandr Agarwala, Richa 2047-217X Oxford University Press (OUP) Computer Science Applications Health Informatics http://dx.doi.org/10.1093/gigascience/giaa023 <jats:title>Abstract</jats:title> <jats:sec> <jats:title>Background</jats:title> <jats:p>Alignment of sequence reads generated by next-generation sequencing is an integral part of most pipelines analyzing next-generation sequencing data. A number of tools designed to quickly align a large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output. They also do not support searching genome assemblies, such as the human genome assembly GRCh38, that include primary and alternate sequences and placement information for alternate sequences to primary sequences in the assembly.</jats:p> </jats:sec> <jats:sec> <jats:title>Findings</jats:title> <jats:p>This paper describes SRPRISM (Single Read Paired Read Indel Substitution Minimizer), an alignment tool for aligning reads without splices. SRPRISM has features not available in most tools, such as (i) support for searching genome assemblies with alternate sequences, (ii) partial alignment of reads with a specified region of reads to be included in the alignment, (iii) choice of ranking schemes for alignments, and (iv) explicit criteria for search sensitivity. We compare the performance of SRPRISM to GEM, Kart, STAR, BWA-MEM, Bowtie2, Hobbes, and Yara using benchmark sets for paired and single reads of lengths 100 and 250 bp generated using DWGSIM. SRPRISM found the best results for most benchmark sets with error rate of up to ∼2.5% and GEM performed best for higher error rates. SRPRISM was also more sensitive than other tools even when sensitivity was reduced to improve run time performance.</jats:p> </jats:sec> <jats:sec> <jats:title>Conclusions</jats:title> <jats:p>We present SRPRISM as a flexible read mapping tool that provides explicit guarantees on results.</jats:p> </jats:sec> SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees GigaScience
spellingShingle Morgulis, Aleksandr, Agarwala, Richa, GigaScience, SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees, Computer Science Applications, Health Informatics
title SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_full SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_fullStr SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_full_unstemmed SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_short SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
title_sort srprism (single read paired read indel substitution minimizer): an efficient aligner for assemblies with explicit guarantees
title_unstemmed SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees
topic Computer Science Applications, Health Informatics
url http://dx.doi.org/10.1093/gigascience/giaa023