Journal Article FZJ-2023-01932

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices

 ;  ;  ;  ;  ;

2023
Oxford University Press Oxford

GigaScience 12, giad025 () [10.1093/gigascience/giad025]

This record in other databases:      

Please use a persistent id in citations:   doi:

Abstract: Background: The sequencing of whole genomes is becoming increasingly affordable. In this context, large-scale sequencing projectsare generating ever larger datasets of species-specific genomic diversity. As a consequence, more and more genomic data need to bemade easily accessible and analyzable to the scientific community.Findings: We present DivBrowse, a web application for interactive visualization and exploratory analysis of genomic diversity datastored in Variant Call Format (VCF) files of any size. By seamlessly combining BLAST as an entry point together with interactive dataanalysis features such as principal component analysis in one graphical user interface, DivBrowse provides a novel and unique setof exploratory data analysis capabilities for genomic biodiversity datasets. The capability to integrate DivBrowse into existing webapplications supports interoperability between different web applications. Built-in interactive computation of principal componentanalysis allows users to perform ad hoc analysis of the population structure based on specific genetic elements such as genes andexons. Data interoperability is supported by the ability to export genomic diversity data in VCF and General Feature Format 3 files.Conclusion: DivBrowse offers a novel approach for interactive visualization and analysis of genomic diversity data and optionally alsogene annotation data by including features like interactive calculation of variant frequencies and principal component analysis. Theuse of established standard file formats for data input supports interoperability and seamless deployment of application instancesbased on the data output of established bioinformatics pipelines.

Classification:

Contributing Institute(s):
  1. Bioinformatik (IBG-4)
Research Program(s):
  1. 2171 - Biological and environmental resources for sustainable use (POF4-217) (POF4-217)
  2. de.NBI (deNBI) (deNBI)
  3. DFG project 491250510 - Open-Access-Publikationskosten / 2022-2024 / Leibniz-Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK), Gatersleben (491250510) (491250510)

Appears in the scientific report 2023
Database coverage:
Medline ; Creative Commons Attribution CC BY 4.0 ; DOAJ ; OpenAccess ; BIOSIS Previews ; Clarivate Analytics Master Journal List ; Current Contents - Agriculture, Biology and Environmental Sciences ; Current Contents - Life Sciences ; DOAJ Seal ; Ebsco Academic Search ; IF >= 5 ; JCR ; PubMed Central ; SCOPUS ; Web of Science Core Collection ; Zoological Record
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Institute Collections > IBG > IBG-4
Workflow collections > Public records
Publications database
Open Access

 Record created 2023-04-24, last modified 2024-02-26


OpenAccess:
Download fulltext PDF
External link:
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)