The increasing amount of unparalleled heterogeneous genomic data generated today necessitates a robust platform for dealing with the practical issues of genomic data interoperability, structure, standardization, security, quality, pre-processing, governance development, long-term support, management of exponential growth of genomic applications and their datasets of enormous size and diversity. Addressing most of these living challenges and practical issues to the genomic big data club, here, we present a new scientific platform i.e. MAV-seq, towards automated management and processing of Next Generation Sequencing (NGS) data.
MAV-seq (Management, Analysis, Visualization of Sequence data) is an interactive, user friendly, cross platform, secure, encrypted, automated, customized, centralized, multi-roles based database application for the management of sample repertoires and automation of the data pre-processing of epigenomic and transcriptomic data. MAV-seq is a robust solution, which integrates different organizational units for efficient data sharing and communication. It supports, (i) management of research, experiment, sample and NGS metadata; (ii) controlled access to the centralized and distributed storage and high performance computing (HPC) resources for NGS data processing; (iii) interactive, automated and standardized quality checking, pre-processing and analysis of NGS data with visualization and report generation of obtained results; and (iv) partial integration with existing LIMS
MAV-seq is a secure database management system which deals with the security threats including privilege abuse, weak authentication, weak system configuration, backup, front and back end system vulnerabilities. It applies different data encryption algorithms to encode data and provides controlled system’s access to the users based on their roles and privileges.
Review the MAV-seq multi-role based user interface.
MAV-seq integrates various genomic data quality check and pre-processing pipelines (e.g. mRNA-seq, WGS, WES, ATAC-seq, ChIP-seq, PDX-mRNA-seq, PDX-WGS etc.) with user-friendly graphical interface to enable biologist with no programming experience to process their NGS datasets. With this platform, we aim to simplify management and storage of NGS datasets including the standardization and automation of quality control and basic processing steps. MAV-seq is very simple and easy to learn platform, which does not require bioinformatics and programming abilities.