Latest Inquiries - Data Extraction Software

MBA research on social media

Submitted: 3/24/2013

Goodday, I am looking to use web scraping as a data collection method for my MBA research. Basically what I want to do is to use all the SEC listed companies, from their official websites then get their linkedin, twitter and facebook pages and then see how many followers/likes/connections they have. I do need this info for the last 3 years (say January 2011, January 2012 and January 2013). Will you be able to do this? Also, can the webscraping tool go back in time, or only monitor a site from a date that you activate the "robot" ?

Thanking you,


Replied: 3/24/2013 7:50:50 AM
Hi there, please see attached file of where I will typically get the list of companies and their website addresses and how I can then from there get their facebook pages or twitter page etc. Using webscraping then I want to see how many followers the had on twitter 3 years ago, 2 years ago and 1 year ago (if it is possible to do timeseries scraping). Thanking you in advance. Regards, Amanda

Replied: 3/24/2013 6:47:58 AM
Where 's the all the SEC listed companies? can you please attach a few screenshot to point out?

Please provide specific guidance how to reach the data you want to extract, so I can figure out exactly what you needed . thanks.
Replied: 3/24/2013 6:26:09 PM
I don't think that VWR can extract any of twitter account / facebook account on any website from any company.

Since VWR project is scraping data based on fixed templates,those company websites have different layout, they couldn't apply same template, 

See the attached demo project, it gets the HTML source for any website at last, then you can attempt to use Content Transformation further parsing twitter / facebook link out of that HTML source , as my thought, it's difficult to do as metioned above.

Content Transformation