I have a lot of data in the form of PDFs. This consists of demographic data, graduation dates, and prior work history. I would like to scrape it and then have each individual data point put into an excel worksheet as it’s own row. (I have posted the individual categories below, about 160 categories).
Do I have to put each individual category into the categorizer?
Hey @Knight_Rider! If you’re reporting an issue with a flow or an error in a run, please include the run link and make sure it’s shareable so we can take a look.
-
Find your run link on the history page. Format: https://www.gumloop.com/pipeline?run_id={{your_run_id}}&workbook_id={{workbook_id}}
-
Make it shareable by clicking “Share” → ‘Anyone with the link can view’ in the top-left corner of the flow screen.

-
Provide details about the issue—more context helps us troubleshoot faster.
You can find your run history here: https://www.gumloop.com/history
Hey @Knight_Rider, in this scenario since your data is in PDFs, a better setup is to use a file reader together with the Extract Data node. The reader will pull out the text, and the Extract Data node lets you define the fields you want to capture. From there, you can send the results straight into a Google Sheet.
This way, you’re not limited by categorizer constraints, and you’ll get a clean table with all your data points.