Day two
Requests:
editing misspellings or irregular capitalization
substitution of field separators
…
A brief explanation of the concept of a file and ASCII text. What role do pipes and the diamond operator play in prepping data for analysis?
Examples:
capturing data from a complex text string:
on the command line:
cat Pta.seq.uniq | grep "^>" | sed -E 's#.*/gi=([[:digit:]]+).*/len=([[:digit:]]+).*#\1,\2#'
cat Pta.seq.uniq | grep "^>" | sed -E 's#.*/gi=([[:digit:]]+).*#\1#' | sort | uniq| wc -l
cat Pta.seq.uniq | grep "^>" | sed -E 's#.*/gi=([[:digit:]]+).*#\1#' | wc -l
in R … examples with sub() from weather station data
editing misspellings or irregular capitalization
substitution of field separators
Day one
For our first work with regular expressions, we will start with a browser-based tool. Please bring a laptop. For experimenting with the UNIX tools we will use laptops that run linux or UNIX, log into a remote linux machine (teton), or share computers as needed. Below are the data we will work with.
...