Webscraping multiple Trustpilot pages for reviews

danieltiwatts · February 17, 2025, 12:51pm

Having no issue scraping the first page of a company’s Trustpilot, but can’t workout how to get Gumloop to click onto the next page and scrape that page too. The goal is to to have Gumloop scrape, say the first 10 pages, and then feed that into an LLM for cleaning and categorisation. Here’s the error I’m getting which is not solved by the offered solution: "Web Agent Scraper Failed! The element to be clicked is covered by another element.

Consider adding a ‘wait’ action before this click, or add a ‘screenshot’ action to check if there is a popup or overlay blocking the element."

danieltiwatts · February 17, 2025, 1:02pm

Potentially an easier solution is to update the trustpilot URL to include ‘?page=2’ once the first page has been scraped, then ?page=3 and so on. Running into an issue where I can’t work out how to do this. Advice massively appreciated!

danieltiwatts · February 17, 2025, 3:01pm

Have added an input flow which appends my specified ?=page2 input with the URL from the scraper, which works, but then I need to repeat the steps for as many pages as I want to scrape. If there’s a more elegant solution with fewer steps then I’m all ears.

Wasay-Gumloop · February 17, 2025, 3:57pm

Hey! You can use a Split Text node along with a Combine Text node to dynamically append page numbers to the URL. Here’s an example: https://www.gumloop.com/pipeline?workbook_id=weAk8EFFvjuVJ3M1oWEauS&run_id=3TkgaMoxWqUQTSKAKQxiMC

Let me know if this is what you were looking for.

danieltiwatts · February 17, 2025, 6:07pm

You’re a champ - thank you

danieltiwatts · February 17, 2025, 6:54pm

So that creates a nice output of URLs with sequentially appended numbers at the end, but as far as I can see I can only feed one URL into the scraper. How’d I use this suggested method to scrape all of the required pages?

Wasay-Gumloop · February 17, 2025, 6:59pm

If you have a flow that works well for a single URL, you can use that as a subflow to loop over a list of URLs.

Subflow tutorial: https://vimeo.com/1052111235/cb7e3a446b
Subflow Docs: https://docs.gumloop.com/core-concepts/subflows

danieltiwatts · February 18, 2025, 9:45am

Appreciate it, Wasay

system · February 22, 2025, 9:46am

This topic was solved and automatically closed 4 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Scraping Trustpilot reviews Get Help Website-Scraper	6	63	April 14, 2025
How to scrape multipage website? General Question Website-Scraper	3	49	March 14, 2025
Writing Product Reviews — Website Scraping Error Get Help Website-Scraper , Drive-File-Reader	7	84	February 4, 2025
Amazon Content Review Get Help Website-Scraper , Google-Sheets-Reader	5	33	May 20, 2025
Facebook Scraper will not work for multiple urls General Question Website-Scraper	6	26	March 11, 2025

Webscraping multiple Trustpilot pages for reviews

Related topics