www.internationalgenome.org Open in urlscan Pro
193.62.193.83  Public Scan

Submitted URL: http://phase3browser.1000genomes.org/
Effective URL: https://www.internationalgenome.org/data
Submission: On November 17 via api from US — Scanned from GB

Form analysis 1 forms found in the DOM

GET /data-portal/search

<form class="navbar-form navbar-right" role="search" action="/data-portal/search" method="get">
  <div class="input-group center">
    <div class="form-group">
      <input type="text" class="form-control" placeholder="Search IGSR" name="q" id="search_id">
    </div>
    <span class="input-group-btn">
      <button type="submit" class="btn btn-default"><span class="glyphicon glyphicon-search"></span></button>
    </span>
  </div>
</form>

Text Content

This website uses cookies and Google Analytics. By using this site you are
agreeing to this. Learn more
I agree, dismiss this notice


IGSR: THE INTERNATIONAL GENOME SAMPLE RESOURCE


SUPPORTING OPEN HUMAN VARIATION DATA

Toggle navigation
 * Home
 * About
 * Data
 * Help
   




USING DATA FROM IGSR

IGSR provides open data to support the community’s research efforts. You can see
our terms of use in our data disclaimer. Please also consult the associated data
reuse statements and cite associated publications appropriately. To cite IGSR,
please use our NAR paper.


EXPLORE THE DATA SETS IN IGSR THROUGH OUR DATA PORTAL

IGSR shares data files from many studies via our FTP site. To make it easier to
find the files you want, we present key data sets in our data portal.

Files can be browsed by:

 * sample (i.e. NA12878)
 * population (i.e. Yoruba in Ibadan, Nigeria)
 * technology (i.e. PacBio HiFi)
 * data type (i.e. alignment)
 * collection (i.e. 1000 Genomes Project phase three)

Our portal provides an overview of the available collections and their
associated publications.


VIEW VARIANTS IN GENOMIC CONTEXT IN ENSEMBL

IGSR works alongside the EnsEMBL genome browser. EnsEMBL presents some of the
key call sets in IGSR, placing the variation data in genomic context and adding
up-to-date annotation of the variant data in their displays for individual
variations.

In EnsEMBL you can:

 * Browse the 1000 Genomes Project phase three call set on GRCh37
 * Browse data from the 1000 Genomes Project samples and other data sets on
   GRCh38
 * View data for a specific variation and search by rsID
 * View population frequency data
 * Use a selection of tools to retrieve subsets of data, convert VCF to PED and
   calculate linkage disequilibrium


DOWNLOAD DATA FROM THE IGSR FTP SITE

The full set of files hosted by IGSR are available on our FTP site. This
includes data shared pre-publication and intermediate and working data for
projects where we contribute to the project’s data management. A set of README
files provides additional information.

The data can be downloaded via FTP, Aspera and Globus GridFTP. More information
about using Aspera or Globus can be found in our FAQ.

How to download files using Aspera
How to download files using Globus

FTP HIERARCHY

The FTP structure was changed in September 2015. The revised structure is
described in the FTP site structure README.

OTHER DATA SOURCES

During the main 1000 Genomes Project, the NCBI acted as a mirror of the EBI
hosted 1000 Genomes Project FTP site and also uploaded alignments and variant
calls to an Amazon S3 bucket. This mirroring process stopped in September 2015.
The NCBI FTP site and the Amazon S3 bucket still host 1000 Genomes Project data
but no longer mirror new data. Both these locations reflect the structure of the
FTP site in August 2015 and hold all the pilot, phase 1 and phase 3 data. NCBI
and Amazon do not hold new alignments based on GRCh38, the current reference
genome.

NCBI FTP Site : ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp
Amazon S3 : s3://1000genomes

Information on Amazon Web Services can be found on 1000 Genomes public data set
page or directly on http://s3.amazonaws.com/1000genomes.

For a small number of newer data sets, data has been added to AWS and AnVIL.
Where this is the case, this is mentioned in our portal.

© EMBL-EBI 2008-2021

Site maintained by EMBL-EBI | Terms of Use, Privacy and Cookies

To cite IGSR please use our NAR publication

The International Genome Sample Resource (IGSR) has been established at EMBL-EBI
to continue supporting data generated by the 1000 Genomes Project, supplemented
with new data and new analysis.