Regex doubt

Hello ,everyone!

I’m trying to use a regex plugin in this following situation:
I’m inputting a PDF file in my workflow, so, the text file output create a big file like a unique string (my PDF file is a invoice).
My problem is: i need specific fields in this “big string” like INVOICE NUMBER, VALUES, COMPANY NAME, etc…

The question: is possible and how to create a fields using regex evalution, ou the better solution is use a script?


Hello @douglasf ,

The technology you are looking for is OCR, to convert a table from PDF to a table in Excel.

While the “PDF to Text” step converts your PDF to a single string, it can be very difficult to extract the relevant information that you need. This approach is not recommended for dealing with PDFs where table data extraction is involved.