Per base sequence content

Analysis by base along each sequence

PerBaseSeqContentheatmap

Clic on one sample to see bases along each sequence:

PerBaseSeqContentoneSample

Each of the 4 DNA bases is normally found with about the same percentage (little or no difference between bases) as the sequence is read.

Therefore, the lines on the graph should be parallel to each other. The relative amount of each base should reflect the overall amount of these bases in your genome, but in any case there should not be huge imbalances from one another.

If there is an imbalance between the different bases, this usually indicates that a sequence is overrepresented and therefore your library is contaminated. If this bias is consistent across all bases, it indicates either :

  • that the original library was biased,
  • or that there is a systematic problem during the sequencing of the library.

Warning

This module issues a warning if the difference between A and T, or U and C is greater than 10% in any position.

Failure

This module fails if the difference between A and T, or U and C is greater than 20% in any position.

Common reasons for warnings

Example of good data Good data:

  • Smooth overlength: the lines run parallel with each other.
  • Organims dependant (GC content)

Example of bad data Bad data: Sequence position bias.

results matching ""

    No results matching ""