Background: As part of an ongoing efforts to characterize genetic diversity in the nuclear and cytoplasmic genomes of D. melanogaster, the Haddrill and Bergman Labs have collaborated to sequence the complete genomes of 20 D. melanogaster isofemale strains collected by Penny Haddrill in Montpellier, France in August 2010. These 20 genomes represent a random sample of the full collection described by Haddrill and Vespoor (2011), which also describes microsatellite variation data for these strains.
Following on from the very generous early release of D. melanogaster genomes by major resequencing efforts in Drosophila, we have decided to follow suit and release these genomes prior to publication to maximise their utility by the wider research community and prevent unnecessary duplication of effort. One major aim for our sequencing of a reasonably high number of strains from this European population is to provide a complementary dataset to help interpret the larger samples of North American and (predominantly) African strains from the Drosophila Genetic Reference Panel and Drosophila Population Genomics Project, respectively. For more on the philosophy behind why we have made the decision to release these data early, please see this blog post on genomic data release by individual labs in the next-generation sequencing era.
Methods: Genomic DNA was prepared by Penny Haddrill for each isofemale line by pooling fifty females, snap freezing them in liquid nitrogen, extracting DNA using a standard phenol-chloroform extraction protocol with ethanol and ammonium acetate precipitation. 500 bp short-insert libraries were constructed and 91 bp paired-end reads were generated using an Illumina HiSeq 2000 to an estimated coverage of ~50x per strain by BGI-Hong Kong. Basic QC on reads was performed by BGI and mapping to the Wolbachia genome following the protocol in Richardson et al. (submitted) confirmed the same infection status for as determined by PCR in Haddrill and Vespoor (2011).
Conditions for use: The Haddrill and Bergman labs intend to use these data to study patterns of genetic diversity in the nuclear and cytoplasmic genomes, to estimate the ratio of diversity on the X chromosome relative to the autosomes, to detect signatures of both positive and negative selection in the nuclear and cytoplasmic genomes, and investigate the impact of variation in recombination rate around the genome.
We have decidede to release these genomic data under a Creative Commons CC-BY license, which requires only that you credit the originators of the work as specified below. However, we hope that users of these data respect the established model of genomic data release under the Ft. Lauderdale agreement that is traditionally honored for major sequencing centers. Until the paper describing these genomes is published, please cite these data as:
- Haddrill, P. and C.M. Bergman (2012) 20 Drosophila melanogaster genomes from Montpellier, France. http://bergmanlab.smith.man.ac.uk/?p=1685
We also ask that downloads of this data be conducted in serial to allow normal functioning of live web services running on this server (see below).
The data: Gzipped Illumina fastq files for forward and reverse paired reads can be downloaded at the following locations. A script to dowload all files in serial can be found below the table.
|
Strain
|
Forward (*_1.fq.gz)
|
Reverse (*_2.fq.gz)
|
|---|---|---|
| FR23 | FR23_1.fq.gz | FR23_2.fq.gz |
| FR24 | FR24_1.fq.gz | FR24_2.fq.gz |
| FR25 | FR25_1.fq.gz | FR25_2.fq.gz |
| FR26 | FR26_1.fq.gz | FR26_2.fq.gz |
| FR28 | FR28_1.fq.gz | FR28_2.fq.gz |
| FR29 | FR29_1.fq.gz | FR29_2.fq.gz |
| FR30 | FR30_1.fq.gz | FR30_2.fq.gz |
| FR31 | FR31_1.fq.gz | FR31_2.fq.gz |
| FR32 | FR32_1.fq.gz | FR32_2.fq.gz |
| FR33 | FR33_1.fq.gz | FR33_2.fq.gz |
| FR34 | FR34_1.fq.gz | FR34_2.fq.gz |
| FR35 | FR35_1.fq.gz | FR35_2.fq.gz |
| FR37 | FR37_1.fq.gz | FR37_2.fq.gz |
| FR38 | FR38_1.fq.gz | FR38_2.fq.gz |
| FR39 | FR39_1.fq.gz | FR39_2.fq.gz |
| FR42 | FR42_1.fq.gz | FR42_2.fq.gz |
| FR44 | FR44_1.fq.gz | FR44_2.fq.gz |
| FR45 | FR45_1.fq.gz | FR45_2.fq.gz |
| FR46 | FR46_1.fq.gz | FR46_2.fq.gz |
| FR48 | FR48_1.fq.gz | FR48_2.fq.gz |
A shell script to dowload all files in serial can be found below.
#!/bin/bash wget http://bergman.smith.man.ac.uk/data/genomes/FR23_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR23_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR24_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR24_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR25_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR25_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR26_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR26_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR28_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR28_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR29_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR29_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR30_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR30_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR31_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR31_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR32_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR32_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR33_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR33_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR34_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR34_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR35_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR35_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR37_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR37_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR38_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR38_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR39_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR39_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR42_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR42_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR44_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR44_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR45_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR45_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR46_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR46_2.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR48_1.fq.gz wget http://bergman.smith.man.ac.uk/data/genomes/FR48_2.fq.gz



