Sequence files background
We’ve extracted totalRNA and shipped it to Azenta for sequencing… so what happens next? How did we go from working with microcentrifuge tubes in the lab to large files on the computer?
First they run the total RNA through quality assessment and quality control steps (QAQC) to make sure it is sufficient in quantity and quality to sequence. Here is the QAQC report from Azenta:
The total RNA is then subjected to library prep, where the RNA is turned into cDNA.
The cDNA is what is actually sequenced, with an Illumina sequencer, 20 million reads, Poly-A selection,
The raw FASTA files come back demultiplexed
.
Coding resources
This roadmap was built off the following resources and references from:
Sam White
-
<<<<<<< HEAD
- Sam’s Notebook =======
- Notebook
- QAQC post >>>>>>> 53c082d (pipeline overview update rendered)
Babraham Bioinformatics Institute:
Erin Chille:
- Mcapitata_OA_Developmental_Gene_Expression_Timeseries github repository from the [@chille2022]paper
Steven Roberts:
- Bioinformatics FISH 546 course at UW
- Lab Handbook
Savanah Liedholt:
Ariana Huffmyer:
- EarlyLifeHistory_Energetics TagSeq github repository
Sarah Tanja:
Pipeline birds-eye view
1. Receive raw FASTA files
files are already
demultiplexed
files have a
.fasta.gz
zipped formatfiles must checked to make sure there were no errors in the transfer process (this is done with
md5sum
)