Web scraping node - Return HTML

Shrikar · March 1, 2025, 8:23am

I am able to do it in python but I don’t understand why we can’t do it in the scrape source https://www.gumloop.com/pipeline?workbook_id=wj9uLit6jYjgPYwK8XaAU4. Ideally we should be able to traverse a dom elements without spending anything on the llm credits. I also think this might be a feature worth adding don’t run the LLM content on the whole page source specially when we have a lot of page with heavy html elements(wasting tokens). Let the end user define the scope of the html element and then run llm on top of those for extracting data

Topic		Replies	Views
Smart Extraction for scraping Feature Request Extract-Data , Website-Scraper	1	83	March 5, 2025
Data extract web scrap Bug Extract-Data	4	94	February 9, 2025
Issue with Web Scraping Product Hunt Visit Button URLs Get Help Website-Scraper , Web-Agent	4	135	March 4, 2025
Not extracting the actual social media link that is on the webpage Get Help Extract-Data	8	120	February 21, 2025
Booking Agent Node Get Help Ask-AI , Extract-Data , Website-Scraper , Web-Agent	3	85	March 16, 2025

Web scraping node - Return HTML

Related topics