Latest Inquiries - Data Extraction Software

Insert URLs in the final sheet

Submitted: 2/24/2016


I would like my program to include the URLs I'm scraping the data from in my final sheet where I have all my data. In other words, I would like to have the URLs mentioned in my final data sheet each time the browser changed the url it scrapes the datat from. How can I do that?

Also my program puts some data in columns instead of putting them in one row. I want to extract data for several universities, and have all the data for one same university in one same row. However, you will see that the "World rank", and the "Number of Staff Nobel Laureates and Fields Medalists', and other fields are displayed below or above the universities columns. How can I do to have all the information for one same university in one same row?

Thank you very much for your help.


Replied: 2/25/2016 10:04:10 AM
Thank you so much for your help, it's very nice of you. However, I didn't manage to open the file you sent me. Each time I try, I have a message saying "This project has been created with a later version of Visual Web Ripper. You must upgrade to the latest version of Visual Web Ripper to ensure the project will work correctly and does not get corrupted. Do you want to upgrade Visual Web Ripper now?" I don't want to upgrade my version of Visual Web Ripper, but if I click on 'No", I just have the starting page of the Visual Web Ripper, but the program is not there neither the page of the website I'm scraping the data from.

Could you please send the program you have changed in another version of Visual Web Ripper? Mine is the version 2. Or maybe tell me how I could read and register the program you have sent me?

Thank you very much.


Replied: 2/25/2016 3:21:46 AM

Multiple parallel page area template couldn't be combined in one main table, however, you did wrong design to iterate through rows in table, attached new project that I've put all elements in one page area template ,therefore, it will output all columns in one main table, moreover, I created 'URL' page attribute element inside 'University Name' link template where to capture current url .

Replied: 2/26/2016 1:55:54 AM

Unfortunately, we don't provide a fixed project on old specific version , we keep to fix any issue at latest version of VWR, that will be more better since latest version has many enhancement, you should upgrade to the latest version to open the project that I fixed.

If you prefer to stay in old version, I will explain exactly what I fixed, firstly, you only need to create one page area template iterating through all rows, then creating certain elements through each column, you can refer to the below topic link where teach you how to select first row (i.e, <TR> tag) then creating list of rows for page area template:

Regarding to capture current page url, you simply create one element  then change to 'PageAttribute' type on edit, then selecting 'URL' at right hand side 'Page attribute' dropdown.