How can you access your sequencing data?

< Back
You are here:

When sequencing runs are finished, we automatically upload FastQ files to a client’s Box.com account.

These files are typically found in the directory:

  • ClientName-> Reports-> ReportID-> FastQ

The FastQ files in this directory use the following naming convention:

  • ClientName_SampleID_SampleName_R1.fastq.gz
  • ClientName_SampleID_SampleName_R2.fastq.gz

where

  • ClientName = the name of the client’s organization
  • SampleID = an internal sample identifier (you can generally ignore this)
  • SampleName = this is the sample name supplied to The Sequencing Center by the client; it typically corresponds to the sample vial name
  • R1 = the forward read identifier
  • R2 = the reverse read identifier
  • fastq = identifies this as a file in FastQ format
  • gz = the files are compressed in gzip (“.gz”) format

 

In most, but not all, sequencing runs we use paired-end reads.  In this format there is a forward read, “R1”, and a reverse read, “R2”, which together form a single paired-end read. The forward reads and reverse reads are stored in separate files.  Many bioinformatics applications process these two files together, so you can keep track of them with the “R1” and “R2” identifiers.

Depending on the sequencing type, we may also include an “Alignments” or “Genotype” folder where we will store additional bioinformatics results.