What does a sequence alignment and variant call report look like?
Our sequence alignment and variant call reports are generated by a tool called Geneious Prime. We export a select list of variables including CDS, polymorphism types, codon change, amino acid change and protein effect. There is a wealth of…
How to perform de novo assembly with PATRIC
De novo assembly is a method for generating contigs, scaffolds or complete genomes from Next Generation sequencing reads (i.e. DNA fragments). Some researchers use the free, online bioinformatics tool PATRIC for de novo assembly. PATRIC…
How to perform an NCBI nucleotide BLAST search
We often use NCBI nucleotide Blast queries to identify DNA sequences derived from Next Generation Sequencing (NGS) runs. This brief note describes how to use the NCBI Blast algorithm and website with NGS DNA datasets.
Create an NCBI Account
Before…
How to annotate a genome with Geneious
Geneious Annotations
Geneious Prime includes genome annotation features and reports. Annotation is often used to describe the structure and function of various genomic regions, such as genes, CDS's (coding sequences), exons, introns, 5'-…
How to create Geneious variant call reports
Geneious Reports
We have numerous clients who use Geneious for bioinformatics data analysis. They use Geneious for sequence alignments of their sample datasets against reference genomes and for the interpretation of Geneious variant call…
Microbiome reports with Kraken
We use a variety of data analysis tools such as OneCodex and Megan6 for quantifying and visualizing metagenomic sequencing datasets.
Kraken2 is a popular taxonomic classification tool for metagenomic and microbiome sequencing results. The…
Geneious sequence alignment & variant calls workflow
Geneious Prime is one of many bioinformatics tools that we use to postprocess sequencing datasets. Geneious includes an extensive set of functions to perform sequence alignments of sample reads against reference genomes. And it includes…
Trimming Illumina adapter sequences
Adapter Sequences
During the library preparation process, Illumina adapter sequences are annealed to sequencing reads. The adapter sequences are required for attaching reads to flow cells and for attaching indexes to reads. When sequencing…
What’s included in “basic bioinformatics”?
The Sequencing Center provides basic bioinformatics as an inclusive service with all genome sequencing projects. Here's what you can expect to receive with each type of project.
Microbial Projects
FASTQ Files: All microbial sequencing projects…
Microbiome reports with Megan6
We use MEGAN6 to generate microbiome reports along with a host of other analytical and visualization tools. Here is an example of a microbiome report from MEGAN6.
Fig. 1
Fig. 1 shows a phylogram of a human nasopharyngeal sample displayed…
What is de novo assembly?
De novo assembly is a method for constructing genomes from a large number of (short- or long-) DNA fragments, with no a priori knowledge of the correct sequence or order of those fragments.
The terminology for de novo assembly is sometimes…
What are unmapped reads?
Save Unmapped Reads
One of the most common activities we perform is the sequence alignment of sequenced bacterial samples against known reference genomes. The alignment process generates two types of output: mapped reads and unmapped reads.…
Sequence alignments with Geneious
Geneious Alignments
We often use Geneious for the alignment of sequenced samples to reference genomes. When sequencing runs are finished, our sequencers generate raw FastQ files, which typically contain millions of short DNA fragments ("reads").…
Microbiome reports with One Codex
We use One Codex to generate microbiome reports along with a host of other analytical and visualization tools. Here is an example of a microbiome report from One Codex.
Fig. 1
Rank-ordered Abundance
Fig. 1 shows the rank-ordered…
How to read Geneious variant call reports
Reading Geneious Reports
We often use Geneious to generate SNP/indel variant call reports. A truncated report is available here. (The number of SNP's/indels in a typical report can be quite large so we only show a few records in this example).
Report…
How to access your sequencing data
When sequencing runs are finished, we automatically upload FastQ files to a client's Box.com account.
These files are typically found in the directory:
ClientName->Reports->ReportID->FastQ
The FastQ files in this directory…
How to use Phaster for bacteriophage bioinformatics
Phaster is a bioinformatics tool for analyzing bacteriophage sequencing data sets. It generates a variety of phage identification and annotation results that are very useful for interpreting phage sequencing data. Currently, Phaster contains…
How to read FastQ files
FastQ File Format
Illumina sequencing instruments generate FastQ files when a sequencing run is finished. FastQ files are the starting point for all downstream bioinformatics data analysis.
The file name suffix for a FastQ file is:…