On March 29th, NIH and Amazon Web Services jointly announced the public availability of the 1000 Genomes data set, which will ultimately include data from over 2600 samples and is being touted as the world’s largest collection of data on human genetic variation.
Most of the samples used for the project have been anonymized and do not have any associated clinical or phenotypic data. Researchers will be able to use the data set on its own or in combination with data collected from their own projects.