Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Info

Status (02 May 2022)

  • Data arrived by sftp on 28 April 2022. Everything below is modified from Bioinformatics for Novaseq run 4 ). I expect to finish processing, Excluding the graph correlating sequences removed and original assigned reads, Data processing finished 5-02-22.

  • Current figures are irrelevant placeholders. Text in purple and sandwiched between ______________________ has not been successfully run.

...

_______________________________________________________________

I used commands.R in that folder to make a plot of numbers of reads per sample (horizontal axis) and the number reads that were removed because they did not merge, or did meet quality criteria and were filtered out (vertical axis). Purple is for 16S and orange is for ITS. It might be interesting to do that plot for each of the projects in the library (TODO), and possibly to have marginal histograms (or put quantiles on the plots).

...

_______________________________________________________________

...

In /project/microbiome/data_queue/seq/psomagen_6mar20/coligoISD, /project/microbiome/data/seq/psomagen_26may20/coligoISD, and /project/microbiome/data/seq/psomagen_29jan21novaseq1c/coligoISD, there are 16S and ITS directories for all projects. These contain a file named coligoISDtable.txt with counts of the coligos and the ISD found in the trimmed forward reads, per sample. The file run_slurm_mkcoligoISDtable.pl has the code that passes over all of the projects and uses vsearch for making the table.

...

5-2-2022 Alex Buerkle transferred all of this to /project/microbiome/data/seq/cu_29april22novaseq5