Feb 27, 2023 03:11 AM
Can airtable take an uploaded document, such as pdf, doc, docx and then extract the text from it?
If so how do I sort that out?
I want users to upload a document rather than paste the words in the document for a smoother UX journey
Feb 27, 2023 06:28 AM - edited Feb 27, 2023 06:30 AM
Unfortunately, Airtable doesn't have native OCR capabilities. However, you could do this with the Vision API extension - https://support.airtable.com/docs/vision-extension - or other OCR APIs with Scripting.
Either will require knowledge of code. If you need support with this, you might look at hiring an Airtable consultant - https://ecosystem.airtable.com/consultants
And/or posting in the Airtable Job Board - https://community.airtable.com/t5/job-board/bd-p/jobs
You might also look at OCR integrations on Zapier
https://zapier.com/apps/airtable/integrations/mindee-ocr
https://zapier.com/apps/airtable/integrations/nanonets-ocr
Feb 27, 2023 08:50 AM
You can use a third party service that can extract text. CloudConvert can convert pdf, doc, and docx to plain text. Pdf.co can convert pdf to text as well. You can integrate with either of these services using scripting or an integration tool such as Zapier or Make.