Scrape human-readable content of a page element

AangelHD · May 12, 2025, 5:02pm

Can you please add the ability to set the Attribute Name and Attribute Value to the “scrape” option of the web scraper node?

I still want the human-readable version of the page rather than raw HTML but I want only the content of an element (id=“main-content”).

Returning the whole page means returning the menu on the left, the blocks in the right sidebar and the footer. All these pollute the response and make it harder for me to distinguish product names that are referenced only in the main content—especially since they are all in the left column.

For reference, see this page:

Wasay-Gumloop · May 13, 2025, 1:50am

Gotcha. Yeah we can potentially add that, we’ve been looking to upgrade our website scraper nodes.

AangelHD · May 13, 2025, 3:27pm

Great, thx for considering it.

Topic		Replies	Views
Smart Extraction for scraping Feature Request Extract-Data , Website-Scraper	0	89	March 1, 2025
Web scraping node - Return HTML General Question Website-Scraper	6	133	March 1, 2025
The 'attribute name' or 'attribute value' entered does not match any element found on the page Get Help Website-Scraper	2	85	May 21, 2025
Selecting an element is broken for a scrape not working Get Help Website-Scraper	2	61	May 13, 2025
Extracting text did not work—still gives me HTML Get Help Extract-Data	3	69	May 13, 2025

Scrape human-readable content of a page element

Related topics