-
Notifications
You must be signed in to change notification settings - Fork 2
Background
The broad-scale availability of virus sequence data is transforming the way that viruses are studied and monitored. With billions of bases generated from sequencing technologies, there is unprecedented potential for advancing knowledge in the field. Sequence data support critical functions, including:
- Surveillance, Control, and Reporting: Facilitating the monitoring of viral diseases and outbreaks.
- Vaccine Development and Diagnostics: Supporting the design and evaluation of vaccines and diagnostic tools.
- Understanding Pathogenesis and Transmissibility: Identifying factors influencing how viruses spread and cause disease.
Despite these advancements, it remains challenging for researchers to make productive use of this data. The complexity of the underlying data, the rapid accumulation of new genome sequences, and the fast pace of technological advances create significant hurdles.
Virus databases are pivotal in enabling researchers to examine viral properties and epidemic patterns in innovative ways. These databases combine genomic data with other critical information, such as epidemiological data, to provide a more complete view of viral ecology and evolution. Sequencing data is particularly valuable as it not only captures the viral genetic code but also links closely to the evolutionary history of the virus, facilitating the tracking of viral lineages and mutation rates.
The development of sequence databases has a rich history, beginning with early examples of highly sequenced viruses such as Influenza A virus and HIV-1. These viruses served as proving grounds for comparative and phylogenetic approaches, showcasing the potential of genomic analysis in understanding viral behavior.
-
Influenza A Virus: Initially focused on epidemiological studies, the database provided insights into the rate and direction of the virus's spread, as well as its pathways of transmission. Over time, the focus expanded to include efforts to design vaccines, demonstrating the critical role of genomic data in public health responses.
-
HIV-1: The establishment of databases like HIVdb at Stanford and the Los Alamos HIV database marked significant milestones in the use of genomic data to track and respond to the HIV epidemic. These databases facilitated the analysis of drug resistance and the identification of new therapeutic targets.
Despite the advancements, many species-focused databases have been developed but often lack maintenance, leading to significant gaps in resources for viruses such as measles and respiratory syncytial virus (RSV). As sequencing technologies continue to advance, the need for well-maintained databases will only grow.
Given the unique challenges posed by viruses, there is a pressing need for specialized systems like GLUE. Viruses exhibit greater diversity and higher mutation rates than other organisms, making it essential to have tailored approaches for their study. The rapid capacity for evolution among viruses not only complicates their management but also presents opportunities for real-time tracking of viral epidemics.
The complexity of viral genomes, particularly RNA virus genomes, serves as a reminder of the intricate ways these entities manipulate host cells for replication. To realize the full value of virus genome sequencing, sequence data should be processed within 'sequence-oriented resources' like GLUE. These scalable software systems encapsulate domain knowledge relevant to various analysis objectives, enabling researchers to work collaboratively across related fields of study.
GLUE by Robert J. Gifford Lab.
For questions, issues, or feedback, please open an issue on the GitHub repository.
- Project Data Model
- Schema Extensions
- Modules
- Alignments
- Variations
- Scripting Layer
- Freemarker Templates
- Example GLUE Project
- Command Line Interpreter
- Build Your Own Project
- Querying the GLUE Database
- Working With Deep Sequencing Data
- Invoking GLUE as a Unix Command
- Known Issues and Fixes
- Overview
- Hepatitis Viruses
- Arboviruses
- Respiratory Viruses
- Animal Viruses
- Spillover Viruses
- Virus Diversity
- Retroviruses
- Paleovirology
- Transposons
- Host Genes