Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques

Dóra Tombácz, István Prazsák, Attila Szucs, B. Dénes, Michael Snyder, Z. Boldogkői

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Background: Poxviruses are large DNA viruses that infect humans and animals. Vaccinia virus (VACV) has been applied as a live vaccine for immunization against smallpox, which was eradicated by 1980 as a result of worldwide vaccination. VACV is the prototype of poxviruses in the investigation of the molecular pathogenesis of the virus. Short-read sequencing methods have revolutionized transcriptomics; however, they are not efficient in distinguishing between the RNA isoforms and transcript overlaps. Long-read sequencing (LRS) is much better suited to solve these problems and also allow direct RNA sequencing. Despite the scientific relevance of VACV, no LRS data have been generated for the viral transcriptome to date. Findings: For the deep characterization of the VACV RNA profile, various LRS platforms and library preparation approaches were applied. The raw reads were mapped to the VACV reference genome and also to the host (Chlorocebus sabaeus) genome. In this study, we applied the Pacific Biosciences RSII and Sequel platforms, which altogether resulted in 937,531 mapped reads of inserts (1.42 Gb), while we obtained 2,160,348 aligned reads (1.75 Gb) from the different library preparation methods using the MinION device from Oxford Nanopore Technologies. Conclusions: By applying cutting-edge technologies, we were able to generate a large dataset that can serve as a valuable resource for the investigation of the dynamic VACV transcriptome, the virus-host interactions, and RNA base modifications. These data can provide useful information for novel gene annotations in the VACV genome. Our dataset can also be used to analyze the currently available LRS platforms, library preparation methods, and bioinformatics pipelines.

Original languageEnglish
JournalGigaScience
Volume7
Issue number12
DOIs
Publication statusPublished - Dec 1 2018

Fingerprint

Vaccinia virus
Gene Expression Profiling
Viruses
RNA
Libraries
Poxviridae
Genes
Genome
Transcriptome
Nanopores
RNA Isoforms
Technology
RNA Sequence Analysis
Molecular Sequence Annotation
Smallpox
DNA Viruses
Datasets
Computational Biology
Immunization
Vaccines

ASJC Scopus subject areas

  • Health Informatics
  • Computer Science Applications

Cite this

Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. / Tombácz, Dóra; Prazsák, István; Szucs, Attila; Dénes, B.; Snyder, Michael; Boldogkői, Z.

In: GigaScience, Vol. 7, No. 12, 01.12.2018.

Research output: Contribution to journalArticle

Tombácz, Dóra ; Prazsák, István ; Szucs, Attila ; Dénes, B. ; Snyder, Michael ; Boldogkői, Z. / Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. In: GigaScience. 2018 ; Vol. 7, No. 12.
@article{7256f552506a451da996f0b1304dd522,
title = "Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques",
abstract = "Background: Poxviruses are large DNA viruses that infect humans and animals. Vaccinia virus (VACV) has been applied as a live vaccine for immunization against smallpox, which was eradicated by 1980 as a result of worldwide vaccination. VACV is the prototype of poxviruses in the investigation of the molecular pathogenesis of the virus. Short-read sequencing methods have revolutionized transcriptomics; however, they are not efficient in distinguishing between the RNA isoforms and transcript overlaps. Long-read sequencing (LRS) is much better suited to solve these problems and also allow direct RNA sequencing. Despite the scientific relevance of VACV, no LRS data have been generated for the viral transcriptome to date. Findings: For the deep characterization of the VACV RNA profile, various LRS platforms and library preparation approaches were applied. The raw reads were mapped to the VACV reference genome and also to the host (Chlorocebus sabaeus) genome. In this study, we applied the Pacific Biosciences RSII and Sequel platforms, which altogether resulted in 937,531 mapped reads of inserts (1.42 Gb), while we obtained 2,160,348 aligned reads (1.75 Gb) from the different library preparation methods using the MinION device from Oxford Nanopore Technologies. Conclusions: By applying cutting-edge technologies, we were able to generate a large dataset that can serve as a valuable resource for the investigation of the dynamic VACV transcriptome, the virus-host interactions, and RNA base modifications. These data can provide useful information for novel gene annotations in the VACV genome. Our dataset can also be used to analyze the currently available LRS platforms, library preparation methods, and bioinformatics pipelines.",
author = "D{\'o}ra Tomb{\'a}cz and Istv{\'a}n Prazs{\'a}k and Attila Szucs and B. D{\'e}nes and Michael Snyder and Z. Boldogkői",
year = "2018",
month = "12",
day = "1",
doi = "10.1093/gigascience/giy139",
language = "English",
volume = "7",
journal = "GigaScience",
issn = "2047-217X",
publisher = "BioMed Central",
number = "12",

}

TY - JOUR

T1 - Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques

AU - Tombácz, Dóra

AU - Prazsák, István

AU - Szucs, Attila

AU - Dénes, B.

AU - Snyder, Michael

AU - Boldogkői, Z.

PY - 2018/12/1

Y1 - 2018/12/1

N2 - Background: Poxviruses are large DNA viruses that infect humans and animals. Vaccinia virus (VACV) has been applied as a live vaccine for immunization against smallpox, which was eradicated by 1980 as a result of worldwide vaccination. VACV is the prototype of poxviruses in the investigation of the molecular pathogenesis of the virus. Short-read sequencing methods have revolutionized transcriptomics; however, they are not efficient in distinguishing between the RNA isoforms and transcript overlaps. Long-read sequencing (LRS) is much better suited to solve these problems and also allow direct RNA sequencing. Despite the scientific relevance of VACV, no LRS data have been generated for the viral transcriptome to date. Findings: For the deep characterization of the VACV RNA profile, various LRS platforms and library preparation approaches were applied. The raw reads were mapped to the VACV reference genome and also to the host (Chlorocebus sabaeus) genome. In this study, we applied the Pacific Biosciences RSII and Sequel platforms, which altogether resulted in 937,531 mapped reads of inserts (1.42 Gb), while we obtained 2,160,348 aligned reads (1.75 Gb) from the different library preparation methods using the MinION device from Oxford Nanopore Technologies. Conclusions: By applying cutting-edge technologies, we were able to generate a large dataset that can serve as a valuable resource for the investigation of the dynamic VACV transcriptome, the virus-host interactions, and RNA base modifications. These data can provide useful information for novel gene annotations in the VACV genome. Our dataset can also be used to analyze the currently available LRS platforms, library preparation methods, and bioinformatics pipelines.

AB - Background: Poxviruses are large DNA viruses that infect humans and animals. Vaccinia virus (VACV) has been applied as a live vaccine for immunization against smallpox, which was eradicated by 1980 as a result of worldwide vaccination. VACV is the prototype of poxviruses in the investigation of the molecular pathogenesis of the virus. Short-read sequencing methods have revolutionized transcriptomics; however, they are not efficient in distinguishing between the RNA isoforms and transcript overlaps. Long-read sequencing (LRS) is much better suited to solve these problems and also allow direct RNA sequencing. Despite the scientific relevance of VACV, no LRS data have been generated for the viral transcriptome to date. Findings: For the deep characterization of the VACV RNA profile, various LRS platforms and library preparation approaches were applied. The raw reads were mapped to the VACV reference genome and also to the host (Chlorocebus sabaeus) genome. In this study, we applied the Pacific Biosciences RSII and Sequel platforms, which altogether resulted in 937,531 mapped reads of inserts (1.42 Gb), while we obtained 2,160,348 aligned reads (1.75 Gb) from the different library preparation methods using the MinION device from Oxford Nanopore Technologies. Conclusions: By applying cutting-edge technologies, we were able to generate a large dataset that can serve as a valuable resource for the investigation of the dynamic VACV transcriptome, the virus-host interactions, and RNA base modifications. These data can provide useful information for novel gene annotations in the VACV genome. Our dataset can also be used to analyze the currently available LRS platforms, library preparation methods, and bioinformatics pipelines.

UR - http://www.scopus.com/inward/record.url?scp=85058609345&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058609345&partnerID=8YFLogxK

U2 - 10.1093/gigascience/giy139

DO - 10.1093/gigascience/giy139

M3 - Article

VL - 7

JO - GigaScience

JF - GigaScience

SN - 2047-217X

IS - 12

ER -