Working with Diversity Mouse Population Data

Analysis Publications

Analysis publications

  • Logan RW et al
    High-precision genetic mapping of behavioral traits in the diversity outbred mouse population
    Genes Brain Behav. 2013 Jun;12(4):424-37. doi: 10.1111/gbb.12029. Epub 2013 Mar 20.
    PMID: 23433259

  • Svenson KL et al
    High-resolution genetic mapping using the Mouse Diversity outbred population
    Genetics. 2012 Feb;190(2):437-47. doi: 10.1534/genetics.111.132597
    PMID: 22345611

  • Broman K
    Haplotype Probabilities in Advanced Intercross Populations
    G3 (Bethesda). 2012 Feb; 2(2): 199–202.
    PMID: 3284327

  • Broman K
    Genotype Probabilities at Intermediate Generations in the Construction of Recombinant Inbred Lines
    Genetics. 2012 Feb; 190(2): 403–412.
    PMID: 3276635

Genotypes and Phenotypes

What do I do with my genotype / phenotype data?

  • Diversity Outbred Database (DODB), citation: - DODB provides a location for investigators to submit their DO experiments including associated genotype array data (MUGA platforms), clinical phenotype data, associated publication(s) and other types of supporting data such as: RNA expression, eQTL, proteomics, metabolomics, etc...  The download interface acts as the primary entry point for researchers who want to interrogate existing DO studies that have been submitted.  There is also an application programming interface (API) to allow data analysts to pull study information from languages like R or Python.  DODB allows searching genotype and phenotype data for DO experiments by Publication, Investigator, Project, associated data types submitted with projects, and provides the ability to download full studies or subsets of studies in zipped CSV or R/QTL2 formats.  While a user can download subsets of a study's samples, genotypes or phenotypes; other supporting data such as expression, proteomic and metabalomic data are provided as whole project file downloads.  As the DODB query interface evolves to be the Diversity platform's entry point for interrogating data, we will begin to allow a user to launch some analysis pipelines, and have visualization endpoints allowing a user to explore the analyzed results (QTLViewer being an example endpoint).
  • Mouse Phenome Database  - A highly curated phenotype data repository and analysis platform for experimental mouse data.  This will be the primary repository for data from BXD, CC and DO phenotyping experiments.  Part of our current initiative is to fully integrate MPD and DODB, so that data submissions for the two web applications are collected through one intake and curation platform, ensuring that data is stored once, is consistent and well curated between the two specialized applications.  We are also planning integration between the two interfaces so that a user is taken to the tool that best meets their investigative needs. The data intake and curation platform that is currently under development will allow an investigator to submit their data online, and do some initial self curation before submitting the data to our team of professional curators.
  • QTL Viewer, source code:, example instance for DO Islet: (associated with paper: Keller, et al. Genetic Drivers of Pancreatic Islet Function, PMID 29567659) -  QTL Viewer is an interactive web-based analysis tool that will allow users to replicate the analyses reported for a study (in the example link, the afore mentioned paper). It includes the ability to search various subsets of data from a study such as phenotypes or  expression data and then visualize data with profile, correlation, LOD, effect, mediation and SNP association plots.
  • HaploQA - - A web application for performing haplotype analysis of genotype calls from the “MUGA” platform genotyping arrays. The application was developed at the Jackson Laboratory to facilitate genetic quality assurance of mice using genotype data derived from these platforms.  The tool allows the community to examine data sets which have been publicly released by viewing Karoytype plots generated by haplotype reconstructions.  An individual can also contact the team to register and receive an account, which will allow them to upload their own data (MegaMUGA or GigaMUGA genotypes), have haplotype reconstructions run, and examine their data using a private account, and if they choose, share that data publicly.  An individual can also set up their own private instance, with source code available here:
  • GeneNetwork, - GeneNetwork is a web based genetics platform, formally known as WebQTL. This is considered to be the primary web platform for searching and interrogating data from BXD lines.
  • R/qtl2 - An R package for QTL analysis for high-dimensional data and complex crosses.  This is a reimplementation of the QTL analysis software R/qtl, to better handle these types of data.  We consider this to be the de facto package for command-line analysis of DO data.

For questions associated with these tools, send email to

Mouse Human Comparisons

How can I compare my results with human data?

  • GeneWeaver - - A system for the integration and analysis of heterogeneous functional genomics data. A powerful tool for mapping mouse data to other species and for discovery of gene → trait interactions. 
  • Synteny Browser -  - Conserved synteny describes a condition in which common ancestry is reflected as similar genome feature content and order along a chromosome in different species. The JAX Synteny Browser allows users to search for and selectively display genome features within syntenic blocks according to the biological and functional annotations associated with the features. The most common use cases for this tool involve uploading genomes of two or more species and explore the relationships of conserved synteny between the two at different levels of depth and detail. Based on a selected reference (aka source) and comparison (aka target or destination) species, users have the ability to search for conserved features by name, function, or phenotype in the reference genome and can investigate the corresponding matched features within the comparison genome.
  • The Human - Mouse: Disease Connection - The Human - Mouse: Disease Connection (HMDC) is designed to facilitate the identification of published and potential mouse models of human disease, the discovery of candidate genes and the investigation of phenotypic similarity between mouse models and human patients.
  • The Monarch Initiative - - An effort to integrate, align, and re-distribute cross-species gene, genotype, variant, disease, and phenotype data, provide a portal for exploration of phenotype-based similarity, facilitate identification of animal models of human disease through phenotypic similarity, enable quantitative comparison of cross-species phenotypes, develop embeddable widgets for data exploration, influence genotype and phenotype reporting standards, and improve ontologies to better curate genotype-phenotype data.
For questions associated with these tools, send email to

Other Analyses

Tools for building and testing hypotheses

  • g2gtools - creates custom genomes by incorporating (phased) SNPs and indels into reference genome, extracts regions of interest, e.g., exons or transcripts, from custom genomes, and converts coordinates of files (bam, gtf, bed) between two genomes.
  • EMASE - An expectation maximization algorithm for allele specific expression. Primary author K. Choi of the Jackson Laboratory. Published in Bioinformatics by Raghupathy/Choi, et al. Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression, 2018 PMID: 29444201.
  • gbrsGenotype-free genome reconstruction and ASE quantification.  Example use case deducing potential sample mixups by comparison of GigaMUGA haplotype reconstructions to haplotypes deduced from islet RNA-seq-based genotype-by-sequencing method  (mentioned in Chick/Munger et al. Defining the consequences of genetic variation on a proteome–wide scale, 2016 PMID: 5292866).
  • Systems Genetics tool, - Webtool makes use of the multilayered datasets from the BXD mouse population to expedite in silico gene function prediction through a series of integrative and complimentary systems analytical approaches. 

Tools for mediation analysis

  • QTL Viewer - mentioned above
  • Intermediate - An R package for eQTL/pQTL mediation analysis (mentioned in Chick/Munger et al. Defining the consequences of genetic variation on a proteome–wide scale, 2016 PMID: 5292866)
For questions associated with these tools, send email to