User Tools

Site Tools


user:vcordes1:portfolio:cla

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
user:vcordes1:portfolio:cla [2011/12/08 21:55] – created vcordes1user:vcordes1:portfolio:cla [2011/12/15 17:33] (current) – [Attributes] vcordes1
Line 1: Line 1:
 +=====Purpose=====
 +The purpose here is to mine the 2012 spring class schedule in html format and extract specific classes
  
 +=====Necessities=====
 +  * Knowledge of Regular Expressions
 +  * Knowledge of Shell Scripting
 +
 +=====Process=====
 +  * With this I will be saving the relevant data to a file and manipulating the file via a shell script.
 +
 +=====Things=====
 +
 +  * To get the dataz
 +<cli>
 +  * cat spring2012-20111103.html | grep "ddtitle" | sed 's/^<TH CLASS="ddtitle".*crn_in=.....">//g' | sed 's/<\/A.*$//g' | sed 's/^\(.*\) - \([0-9][0-9][0-9][0-9][0-9]\) - \(.*\) - \([0-9][0-9][0-9]\)$/\1: \3-\4:\2/g'
 +</cli>
 +
 +
 +  * Shell Script
 +<code>
 +#!/bin/bash
 +
 +echo -n "please enter a class: "
 +
 +read class
 +
 +cat combooutput1 | grep -A5 $class
 +</code>
 +
 +=====Attributes=====
 +  * Files and directories
 +  * Commands
 +  * The UNIX shell
 +  * Regular Expressions
 +  * Filters
 +  * Scripting
 +  * The UNIX development Environment
 +=====Final Thinkings=====
 +  * This was relatively easy working with only the necessary data.