Paginated listing pages data extraction issue | Google sheet Writer not connecting

I am getting this issue of ‘Types do not match! List of List =/ List’ between the extract data 2 node and the google sheet writer node.

https://www.gumloop.com/pipeline?workbook_id=2k8nbc35bSo7s1n8QdMvzw

I have tried using join list, create list nodes too but to no success.

This workflow is around scraping doctor details from a listing page that contains multiple pages. The workflow should continue through all paginated listing pages and store the extracted data in a Google Sheet.

have a look (had the same problem) - https://www.gumloop.com/pipeline?workbook_id=49v7ebRut93oPdDTbQbX9Y

Hi Donna, Access please.

Done. Tell me if this is what you were looking for

Thanks Donna but unfortunately I doesn’t help.

For my current scraping flow, the issue is more about handling List of Lists data structure rather than text manipulation.

Can you check the below index once? Maybe you’ll find something.

I want to scrape doctor details from a listing page that contains multiple pages. The goal is to extract doctor names, specialties, addresses, phone numbers, and emails from each doctor’s profile. The workflow should continue through all paginated listing pages and store the extracted data in a Google Sheet. Below is the format:

Doctor Name, Speciality, Address, Phone Number, Email

Until the index runs out of pages.

Hey @Manishdwivedi - You can use Flatten List of List nodes here but also the first website scraper node running in loop mode seems unnecessary, if you disable that you should be able to simplify the flow.

Would also recommend looking into subflows to further simplify the flow: https://vimeo.com/1052111235/cb7e3a446b

1 Like

This topic was solved and automatically closed 4 days after the last reply. New replies are no longer allowed.