Journal Article FZJ-2023-01932

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices

 ;  ;  ;  ;  ;

2023
Oxford University Press Oxford

GigaScience 12, giad025 () [10.1093/gigascience/giad025]

This record in other databases:      

Please use a persistent id in citations:   doi:

Abstract: Background: The sequencing of whole genomes is becoming increasingly affordable. In this context, large-scale sequencing projectsare generating ever larger datasets of species-specific genomic diversity. As a consequence, more and more genomic data need to bemade easily accessible and analyzable to the scientific community.Findings: We present DivBrowse, a web application for interactive visualization and exploratory analysis of genomic diversity datastored in Variant Call Format (VCF) files of any size. By seamlessly combining BLAST as an entry point together with interactive dataanalysis features such as principal component analysis in one graphical user interface, DivBrowse provides a novel and unique setof exploratory data analysis capabilities for genomic biodiversity datasets. The capability to integrate DivBrowse into existing webapplications supports interoperability between different web applications. Built-in interactive computation of principal componentanalysis allows users to perform ad hoc analysis of the population structure based on specific genetic elements such as genes andexons. Data interoperability is supported by the ability to export genomic diversity data in VCF and General Feature Format 3 files.Conclusion: DivBrowse offers a novel approach for interactive visualization and analysis of genomic diversity data and optionally alsogene annotation data by including features like interactive calculation of variant frequencies and principal component analysis. Theuse of established standard file formats for data input supports interoperability and seamless deployment of application instancesbased on the data output of established bioinformatics pipelines.

Classification:

Contributing Institute(s):
  1. Bioinformatik (IBG-4)
Research Program(s):
  1. 2171 - Biological and environmental resources for sustainable use (POF4-217) (POF4-217)
  2. de.NBI (deNBI) (deNBI)
  3. DFG project 491250510 - Open-Access-Publikationskosten / 2022-2024 / Leibniz-Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK), Gatersleben (491250510) (491250510)

Appears in the scientific report 2023
Database coverage:
Medline ; Creative Commons Attribution CC BY 4.0 ; DOAJ ; OpenAccess ; BIOSIS Previews ; Clarivate Analytics Master Journal List ; Current Contents - Agriculture, Biology and Environmental Sciences ; Current Contents - Life Sciences ; DOAJ Seal ; Ebsco Academic Search ; IF >= 5 ; JCR ; PubMed Central ; SCOPUS ; Web of Science Core Collection ; Zoological Record
Click to display QR Code for this record

The record appears in these collections:
Dokumenttypen > Aufsätze > Zeitschriftenaufsätze
Institutssammlungen > IBG > IBG-4
Workflowsammlungen > Öffentliche Einträge
Publikationsdatenbank
Open Access

 Datensatz erzeugt am 2023-04-24, letzte Änderung am 2024-02-26


OpenAccess:
Volltext herunterladen PDF
Externer link:
Volltext herunterladenFulltext by OpenAccess repository
Dieses Dokument bewerten:

Rate this document:
1
2
3
 
(Bisher nicht rezensiert)