12/26/2023 0 Comments Web data extractor 6.0You can extract the plain html code of the element -and all its children elements- should you write "outerhtml" Then in the attribute dropdown list you can write "class" if you want to extract its class, "id" if you want to extract its id.and so on. For example if an element in the HTML code of the page is: In the Attribute field of the "Advanced Settings" of the "Extraction Preview" window, other than the attributes that are listed in the drop down list, you can specify any other attribute that the element has. Ul:eq(0) > li:eq(0) > span Attribute "Own Text" div:eq(1) > ul > li" and then for each or the item we extract the This means that the extraction starts from the ". We move on to the second result/product to do the same and the table is automatically created in the extraction preview window.įor the table, in the same notion as extracting the list, we have the Base CSS Selector, which is the root element in the HTML code, under which the data of each result/product exist. Let's say that we want to extract the title of the product, the link behind it, and the price.įor the first result we right click on the title, extract its "Text", then right click again to extract the "Href" and finally we right click on the price element to extract its "Text". In order to extract more than one piece of info for each result you would have to extract a table. You also have the option to apply Regular Expressions on the extracted text, in order to get just a part of it.Ĭhanging the selector by hand, then you can click on the "Recalculate now" button to see the extraction's result. The attribute that you are extracting is "Own Text" and it can be changed to "Title", "Href", "SourceLink", "Exists" or any other Attribute is available in the HTML code of the page for this element. This means that the extraction starts from the ".div:eq(1) > ul > li"įor each list item from the list ".div:eq(1) > ul > li" and then it gets the "h3 > a" element. The Base selector is the root element in the HTML code, under which the items of the list are listed. Click on the "Advanced Settings" icon to review the CSS selector which you can modify and make it even more efficient.Īs you can see while extracting a list, we have the Base selector and the CSS selector. Then right click on the first result and extract its Text as in the screenshot below:ĭo the same for the second result and a list of all the items' text will be automatically extracted. Having the "Extract data from Web Page" action open, hover your mouse on the page (or click on a blank area). Lets say that you wish to extract the title for all available results in a webpage. In this Window you will be able to preview the extracted data. ![]() Should you click in the webpage, then the "Live Web Helper- Extract Data From Web Page" window will pop up. The web data extraction can also take place on an actual IE if you have the "Extract data from Web Page" action open while you move your mouse pointer to the page of interest.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |