...
ALF1_16S_Demux.csv
is used to map MIDS to sample names and projects.
Stopped here on 8-30-22
Splitting to fastq for individuals
The info lines for each read in parsed_ALF16S_R1_*.fastq
and parsed_ALF16S_R2_*.fastq
have the locus, the forward mid, the reverse mid, and the sample name. These can be used with the demux key to separate reads into loci, projects, and samples, in the folder sample_fastq/
. The reads are in separate files for each sequenced sample, including replicates. The unique combination of forward and reverse MIDs (for a locus) is part of the filename and allows replicates to be distinguished and subsequently merged.
run_splitFastq_fwd.sh
Stopped here on 8-31-22
and run_splitFastq_rev.sh
run splitFastq_manyInputfiles.pl
, which steps through the many pairs of files to split reads by sample and project, and place them in /project/gtl/data/raw/ALF1/16S/rawdata/sample_fastq/
...