Reconciling TN sample list and our transcribing

(July 14th, 2021) The tubes to be plated from the most recent TN shipment, are currently arranged in 96 position tube racks awaiting processing. They are arranged per the “TubesToBePlated” sheet of this workbook. We are still missing the 10 samples listed below.

@Alex Buerkle used the complicated TN spreadsheet to create a list of tubes that need to be amplicon sequenced. @Shannon Harris had already arranged the tubes they had sent and transcribed the hand written labels into a spreadsheet and added them to 96 tube racks. I attempted to reconcile the tobeplated list from Alex’s data manipulation with this using this r-script and these data. I was able to locate 293 of the 303 tobeplated tubes and created a pull list. I found one transcription error. There are still 10 tubes I could not locate:

Site_101_Soil_3

Site_102_Soil_29

Site_102_Alf_Epi_7

Site_102_Trans_Epi_7

Site_111_Alf_Epi_10

Site_111_Trans_Epi_1

Site_127_Trans_Epi_9

Site_129_Alf_Epi_5

Site_129_Alf_Epi_25

Site_130_Alf_Epi_30

I tried to delve into the data to see if I could locate these 10 samples. I did not have any luck. I do not think it is a transcription error when I anti joined samplesInSites against our spreadsheet and then the spreadsheet against samplesInSites there was no easily discernible pattern. The list of sites between the 2 even disagreed. And, there were 212-228 disagreements.

212 samples from the transcribed list do not match anything in samplesInSites (an R object that reflects the spreadsheets that came from Tennessee). These are in DisputedTubes in the R workspace.

@Gregg Randolph : on line 361 of manageAlfalfa2.R, you have ShippedTubes <- samplesInSites[!(samplesInSites$site %in% unique(samplesInPlates$site)),]. I am not entirely sure what this subsetting was for, but it turns out that some (maybe all of our sample names that we thought we were missing (the 212 above) are in the samplesInSites variable, but not in ShippedTubes. For example, forty of the 212 samples, were attributable to site 107. There is not Site 107 in ShippedTubes, but these samples do appear to be in samplesInSites.

grep("^Site_107", ShippedTubes$modmodname, value = TRUE) returns no values

samplesInSites[samplesInSites == 107,]

Sites with Tube Samples Not Matching from TN spreadsheet

Sites with Tube Samples Not Matching from our spreadsheet

Sites with Tube Samples Not Matching from TN spreadsheet

Sites with Tube Samples Not Matching from our spreadsheet

101

101

102

 

 

103

 

104

 

107

 

108

 

109

 

110

111

 

 

116

127

 

128

 

129

 

So, something is still fishy with the TN spreadsheet concerning tubes.