Latest Inquiries - Data Extraction Software

GK_schedule

Submitted: 8/23/2013

I would like to scrape the Global Knowledge website for courses including, the date, Start time, title, Course code, Duration, hours/day, & price for all regions. From the main page above, using the Courses drop dow box, select Browse Catalog.

Create a list group from

http://www.globalknowledge.com/training/generic.asp?pageid=9&country=United+States

Create a list group from

http://www.globalknowledge.com/training/category.asp?pageid=9&catid=514&country=United+States

As an example,I chose

DB2 10 for z/OS Database Administration Workshop Part 1 (CV831G)

http://www.globalknowledge.com/training/course.asp?pageid=9&courseid=18281&catid=449&country=United+States

Entities needed here:

Title, Main Course Code(IBM Course Code: CV831), Course Code (9016), Price(3595), Duration (2 day course)

Next Click on View Schedule (http://www.globalknowledge.com/training/dates.asp?pageid=9&catid=449&courseid=18280&country=United+States)

Must Change Preferred Location to all regions via the Change Link (http://www.globalknowledge.com/training/selectloc.asp?pageid=9&country=United+States)

Click SUBMIT button

Create a list of the Elements in the table.

http://www.globalknowledge.com/training/dates.asp?pageid=9&catid=449&courseid=18280&country=United+States

Would be awesome if the date were seperated from the time, but certainly not a deal breaker. If no classes are scheduled, Capture the "By Request" tag.

 As it stands right now, I am getting hung up on the last two steps. On the tables list, I have yet to figure out how to pull these into the result set (in usable fashion at all) . The other item is that I have to change the regions to all and I can't quite figure that out either.

As a test project, i certainly hope this isn't too difficult. I've got a lot to learn, but webripper could just make my life 100% simpler. Thanks for the time and effort

 

 

Replied: 8/23/2013 7:43:47 PM
Please check the attached demo project.

You need to place the project file in default projects folder, then run this project in VWR program.
Globalknowledge.rip