Eintrag weiter verarbeiten
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model
Gespeichert in:
Zeitschriftentitel: | Biology Open |
---|---|
Personen und Körperschaften: | , , , , , |
In: | Biology Open, 2017 |
Format: | E-Article |
Sprache: | Englisch |
veröffentlicht: |
The Company of Biologists
|
Schlagwörter: |
author_facet |
Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar |
---|---|
author |
Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar |
spellingShingle |
Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar Biology Open A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology |
author_sort |
orgeur, mickael |
spelling |
Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar 2046-6390 The Company of Biologists General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology http://dx.doi.org/10.1242/bio.028498 <jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p> A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model Biology Open |
doi_str_mv |
10.1242/bio.028498 |
facet_avail |
Online Free |
format |
ElectronicArticle |
fullrecord |
blob:ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTI0Mi9iaW8uMDI4NDk4 |
id |
ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTI0Mi9iaW8uMDI4NDk4 |
institution |
DE-D275 DE-Bn3 DE-Brt1 DE-D161 DE-Zwi2 DE-Gla1 DE-Zi4 DE-15 DE-Pl11 DE-Rs1 DE-105 DE-14 DE-Ch1 DE-L229 |
imprint |
The Company of Biologists, 2017 |
imprint_str_mv |
The Company of Biologists, 2017 |
issn |
2046-6390 |
issn_str_mv |
2046-6390 |
language |
English |
mega_collection |
The Company of Biologists (CrossRef) |
match_str |
orgeur2017adualtranscriptdiscoveryapproachtoimprovethedelimitationofgenefeaturesfromrnaseqdatainthechickenmodel |
publishDateSort |
2017 |
publisher |
The Company of Biologists |
recordtype |
ai |
record_format |
ai |
series |
Biology Open |
source_id |
49 |
title |
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_unstemmed |
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_full |
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_fullStr |
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_full_unstemmed |
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_short |
A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_sort |
a dual transcript-discovery approach to improve the delimitation of gene features from rna-seq data in the chicken model |
topic |
General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology |
url |
http://dx.doi.org/10.1242/bio.028498 |
publishDate |
2017 |
physical |
|
description |
<jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p> |
container_start_page |
0 |
container_title |
Biology Open |
format_de105 |
Article, E-Article |
format_de14 |
Article, E-Article |
format_de15 |
Article, E-Article |
format_de520 |
Article, E-Article |
format_de540 |
Article, E-Article |
format_dech1 |
Article, E-Article |
format_ded117 |
Article, E-Article |
format_degla1 |
E-Article |
format_del152 |
Buch |
format_del189 |
Article, E-Article |
format_dezi4 |
Article |
format_dezwi2 |
Article, E-Article |
format_finc |
Article, E-Article |
format_nrw |
Article, E-Article |
_version_ |
1792347617848459264 |
geogr_code |
not assigned |
last_indexed |
2024-03-01T17:57:27.634Z |
geogr_code_person |
not assigned |
openURL |
url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fvufind.svn.sourceforge.net%3Agenerator&rft.title=A+dual+transcript-discovery+approach+to+improve+the+delimitation+of+gene+features+from+RNA-seq+data+in+the+chicken+model&rft.date=2017-01-01&genre=article&issn=2046-6390&jtitle=Biology+Open&atitle=A+dual+transcript-discovery+approach+to+improve+the+delimitation+of+gene+features+from+RNA-seq+data+in+the+chicken+model&aulast=Stricker&aufirst=Sigmar&rft_id=info%3Adoi%2F10.1242%2Fbio.028498&rft.language%5B0%5D=eng |
SOLR | |
_version_ | 1792347617848459264 |
author | Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar |
author_facet | Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar, Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar |
author_sort | orgeur, mickael |
container_start_page | 0 |
container_title | Biology Open |
description | <jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p> |
doi_str_mv | 10.1242/bio.028498 |
facet_avail | Online, Free |
format | ElectronicArticle |
format_de105 | Article, E-Article |
format_de14 | Article, E-Article |
format_de15 | Article, E-Article |
format_de520 | Article, E-Article |
format_de540 | Article, E-Article |
format_dech1 | Article, E-Article |
format_ded117 | Article, E-Article |
format_degla1 | E-Article |
format_del152 | Buch |
format_del189 | Article, E-Article |
format_dezi4 | Article |
format_dezwi2 | Article, E-Article |
format_finc | Article, E-Article |
format_nrw | Article, E-Article |
geogr_code | not assigned |
geogr_code_person | not assigned |
id | ai-49-aHR0cDovL2R4LmRvaS5vcmcvMTAuMTI0Mi9iaW8uMDI4NDk4 |
imprint | The Company of Biologists, 2017 |
imprint_str_mv | The Company of Biologists, 2017 |
institution | DE-D275, DE-Bn3, DE-Brt1, DE-D161, DE-Zwi2, DE-Gla1, DE-Zi4, DE-15, DE-Pl11, DE-Rs1, DE-105, DE-14, DE-Ch1, DE-L229 |
issn | 2046-6390 |
issn_str_mv | 2046-6390 |
language | English |
last_indexed | 2024-03-01T17:57:27.634Z |
match_str | orgeur2017adualtranscriptdiscoveryapproachtoimprovethedelimitationofgenefeaturesfromrnaseqdatainthechickenmodel |
mega_collection | The Company of Biologists (CrossRef) |
physical | |
publishDate | 2017 |
publishDateSort | 2017 |
publisher | The Company of Biologists |
record_format | ai |
recordtype | ai |
series | Biology Open |
source_id | 49 |
spelling | Orgeur, Mickael Martens, Marvin Börno, Stefan T. Timmermann, Bernd Duprez, Delphine Stricker, Sigmar 2046-6390 The Company of Biologists General Agricultural and Biological Sciences General Biochemistry, Genetics and Molecular Biology http://dx.doi.org/10.1242/bio.028498 <jats:p>The sequence of the chicken genome, like several other draft genome sequences, is presently not fully covered. Gaps, contigs assigned with low confidence and uncharacterized chromosomes result in gene fragmentation and imprecise gene annotation. Transcript abundance estimation from RNA sequencing (RNA-seq) data relies on read quality, library complexity and expression normalization. In addition, the quality of the genome sequence used to map sequencing reads and the gene annotation that defines gene features must also be taken into account. Partially covered genome sequence causes the loss of sequencing reads from the mapping step, while an inaccurate definition of gene features induces imprecise read counts from the assignment step. Both steps can significantly bias interpretation of RNA-seq data. Here, we describe a dual transcript-discovery approach combining a genome-guided gene prediction and a de novo transcriptome assembly. This dual approach enabled us to increase the assignment rate of RNA-seq data by nearly 20% as compared to when using only the chicken reference annotation, contributing therefore to a more accurate estimation of transcript abundance. More generally, this strategy could be applied to any organism with partial genome sequence and/or lacking a manually-curated reference annotation in order to improve the accuracy of gene expression studies.</jats:p> A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model Biology Open |
spellingShingle | Orgeur, Mickael, Martens, Marvin, Börno, Stefan T., Timmermann, Bernd, Duprez, Delphine, Stricker, Sigmar, Biology Open, A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model, General Agricultural and Biological Sciences, General Biochemistry, Genetics and Molecular Biology |
title | A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_full | A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_fullStr | A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_full_unstemmed | A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_short | A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
title_sort | a dual transcript-discovery approach to improve the delimitation of gene features from rna-seq data in the chicken model |
title_unstemmed | A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model |
topic | General Agricultural and Biological Sciences, General Biochemistry, Genetics and Molecular Biology |
url | http://dx.doi.org/10.1242/bio.028498 |