A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model

Gespeichert in:

Bibliographische Detailangaben
Zeitschriftentitel:	Biology Open
Personen und Körperschaften:	Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar
In:	Biology Open, 2017
Format:	E-Article
Sprache:	Englisch
veröffentlicht:	The Company of Biologists
Schlagwörter:	General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology

author_facet	Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar
author	Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar
spellingShingle	Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar Biology Open A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology
author_sort	orgeur, mickael
spelling	Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar 2046-6390 The Company of Biologists General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology http://dx.doi.org/10.1242/bio.028498 <jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p> A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model Biology Open
doi_str_mv	10.1242/bio.028498
facet_avail	Online Free
format	ElectronicArticle
fullrecord	blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTI0Mi9iaW8uMDI4NDk4
id	ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTI0Mi9iaW8uMDI4NDk4
institution	DE-D275 DE-Bn3 DE-Brt1 DE-D161 DE-Zwi2 DE-Gla1 DE-Zi4 DE-15 DE-Pl11 DE-Rs1 DE-105 DE-14 DE-Ch1 DE-L229
imprint	The Company of Biologists, 2017
imprint_str_mv	The Company of Biologists, 2017
issn	2046-6390
issn_str_mv	2046-6390
language	English
mega_collection	The Company of Biologists (CrossRef)
match_str	orgeur2017adualtranscriptdiscoveryapproachtoimprovethedelimitationofgenefeaturesfromrnaseqdatainthechickenmodel
publishDateSort	2017
publisher	The Company of Biologists
recordtype	ai
record_format	ai
series	Biology Open
source_id	49
title	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_unstemmed	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_full	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_fullStr	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_full_unstemmed	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_short	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_sort	a dual transcript-discovery approach to improve the delimitation of gene features from rna-seq data in the chicken model
topic	General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology
url	http://dx.doi.org/10.1242/bio.028498
publishDate	2017
physical
description	<jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p>
container_start_page	0
container_title	Biology Open
format_de105	Article, E-Article
format_de14	Article, E-Article
format_de15	Article, E-Article
format_de520	Article, E-Article
format_de540	Article, E-Article
format_dech1	Article, E-Article
format_ded117	Article, E-Article
format_degla1	E-Article
format_del152	Buch
format_del189	Article, E-Article
format_dezi4	Article
format_dezwi2	Article, E-Article
format_finc	Article, E-Article
format_nrw	Article, E-Article
_version_	1792347617848459264
geogr_code	not assigned
last_indexed	2024-03-01T17:57:27.634Z
geogr_code_person	not assigned
openURL	url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=A+dual+transcript-discovery+approach+to+improve+the+delimitation+of+gene+features+from+RNA-seq+data+in+the+chicken+model&rft.date=2017-01-01&genre=article&issn=2046-6390&jtitle=Biology+Open&atitle=A+dual+transcript-discovery+approach+to+improve+the+delimitation+of+gene+features+from+RNA-seq+data+in+the+chicken+model&aulast=Stricker&aufirst=Sigmar&rft_id=info%3Adoi%2F10.1242%2Fbio.028498&rft.language%5B0%5D=eng
SOLR
_version_	1792347617848459264
author	Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar
author_facet	Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar, Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar
author_sort	orgeur, mickael
container_start_page	0
container_title	Biology Open
description	<jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p>
doi_str_mv	10.1242/bio.028498
facet_avail	Online, Free
format	ElectronicArticle
format_de105	Article, E-Article
format_de14	Article, E-Article
format_de15	Article, E-Article
format_de520	Article, E-Article
format_de540	Article, E-Article
format_dech1	Article, E-Article
format_ded117	Article, E-Article
format_degla1	E-Article
format_del152	Buch
format_del189	Article, E-Article
format_dezi4	Article
format_dezwi2	Article, E-Article
format_finc	Article, E-Article
format_nrw	Article, E-Article
geogr_code	not assigned
geogr_code_person	not assigned
id	ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTI0Mi9iaW8uMDI4NDk4
imprint	The Company of Biologists, 2017
imprint_str_mv	The Company of Biologists, 2017
institution	DE-D275, DE-Bn3, DE-Brt1, DE-D161, DE-Zwi2, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229
issn	2046-6390
issn_str_mv	2046-6390
language	English
last_indexed	2024-03-01T17:57:27.634Z
match_str	orgeur2017adualtranscriptdiscoveryapproachtoimprovethedelimitationofgenefeaturesfromrnaseqdatainthechickenmodel
mega_collection	The Company of Biologists (CrossRef)
physical
publishDate	2017
publishDateSort	2017
publisher	The Company of Biologists
record_format	ai
recordtype	ai
series	Biology Open
source_id	49
spelling	Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar 2046-6390 The Company of Biologists General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology http://dx.doi.org/10.1242/bio.028498 <jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p> A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model Biology Open
spellingShingle	Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar, Biology Open, A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model, General Agricultural and Biological Sciences, General Biochemistry, Genetics and Molecular Biology
title	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_full	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_fullStr	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_full_unstemmed	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_short	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
title_sort	a dual transcript-discovery approach to improve the delimitation of gene features from rna-seq data in the chicken model
title_unstemmed	A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
topic	General Agricultural and Biological Sciences, General Biochemistry, Genetics and Molecular Biology
url	http://dx.doi.org/10.1242/bio.028498