Scrape human-readable content of a page element

Can you please add the ability to set the Attribute Name and Attribute Value to the “scrape” option of the web scraper node?

I still want the human-readable version of the page rather than raw HTML but I want only the content of an element (id=“main-content”).

Returning the whole page means returning the menu on the left, the blocks in the right sidebar and the footer. All these pollute the response and make it harder for me to distinguish product names that are referenced only in the main content—especially since they are all in the left column.

For reference, see this page:

Gotcha. Yeah we can potentially add that, we’ve been looking to upgrade our website scraper nodes.

Great, thx for considering it.

This topic was automatically closed 4 days after the last reply. New replies are no longer allowed.