This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
user:vcordes1:portfolio:cla [2011/12/15 17:04] – vcordes1 | user:vcordes1:portfolio:cla [2011/12/15 17:33] (current) – [Attributes] vcordes1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | =====Purpose===== | ||
+ | The purpose here is to mine the 2012 spring class schedule in html format and extract specific classes | ||
+ | =====Necessities===== | ||
+ | * Knowledge of Regular Expressions | ||
+ | * Knowledge of Shell Scripting | ||
+ | |||
+ | =====Process===== | ||
+ | * With this I will be saving the relevant data to a file and manipulating the file via a shell script. | ||
+ | |||
+ | =====Things===== | ||
+ | |||
+ | * To get the dataz | ||
+ | <cli> | ||
+ | * cat spring2012-20111103.html | grep " | ||
+ | </ | ||
+ | |||
+ | |||
+ | * Shell Script | ||
+ | < | ||
+ | #!/bin/bash | ||
+ | |||
+ | echo -n " | ||
+ | |||
+ | read class | ||
+ | |||
+ | cat combooutput1 | grep -A5 $class | ||
+ | </ | ||
+ | |||
+ | =====Attributes===== | ||
+ | * Files and directories | ||
+ | * Commands | ||
+ | * The UNIX shell | ||
+ | * Regular Expressions | ||
+ | * Filters | ||
+ | * Scripting | ||
+ | * The UNIX development Environment | ||
+ | =====Final Thinkings===== | ||
+ | * This was relatively easy working with only the necessary data. |