Downloads provided by UsageCounts
The Collaborative Cross (CC) is a multiparent recombinant inbred strain mouse panel derived from eight founder inbred strains. A distinct advantage of recombinant inbred panels is that detailed characterization of their genomes does not need to be performed by each user. Until now the CC genomes were haplotype reconstructions based on dense genotyping of the most recent common ancestors (MRCAs) of each strain followed by imputation from the genome sequence of the corresponding founder inbred strain. The MRCA resource had the advantage that it captured segregating regions in strains that were not fully inbred, but it had limited resolution in the transition regions between founder haplotypes and resulted in uncertainty about founder assignment in regions of limited diversity. Here we report the whole genome sequence of 69 CC strains generated by paired-end short reads at 30X coverage of a single male per strain. Sequencing results in a substantial improvement in the fine structure and completeness of the genomes of the CC. Both MRCAs and sequenced samples have significant reduction in the genome-wide haplotype frequencies of two of the wild-derived strains, CAST/EiJ and PWK/PhJ. In addition, analysis of the evolution of the patterns of heterozygosity indicates that selection against three wild-derived founder strains played a significant role in shaping the genomes of the CC. The sequencing resource provides the first description of tens of thousands of new genetic variants introduced by genetic drift on the CC genomes. The CC strains represent an extreme example of the principle that genetic drift is expected to have maximum impact in populations with small effective size and high level of inbreeding. We estimate that new SNP mutations are accumulating in each CC strain at a rate of 2.4 per Gb per generation. The majority of these mutations are novel compared to currently sequenced laboratory stocks and wild mice, and some are predicted to alter gene function. Overall, genetic drift has increased the number of variants segregating among CC strains by more than 2%. Approximately one third of the CC inbred strains have acquired large deletions (>10kb) many of which overlap known coding genes and functional elements. In conclusion we provide a critical resource to users of the CC increase threefold the number of mouse inbred strain genomes available publicly and provide a striking example of the effect of genetic drift on common resources.
whole genome sequence, drift, selection, genetic variants
whole genome sequence, drift, selection, genetic variants
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 41 | |
| downloads | 48 |

Views provided by UsageCounts
Downloads provided by UsageCounts