OCR & Text Extraction

Extracting text from images, scanned documents, and photos using optical character recognition.

Last updated: April 14, 2026

ocrtext extractionimagesscanningphotosrecognition

What is OCR?

OCR (Optical Character Recognition) is the technology that extracts readable text from images. Your AI employee uses OCR to read text from photos, screenshots, scanned documents, business cards, receipts, whiteboards, and any other image containing text. The extracted text can then be edited, searched, stored in your knowledge base, or used as input for other tasks.

Using OCR

To extract text from an image, simply send the image to your AI employee and ask it to read the text. You can send photos taken with your phone, screenshots from your computer, or scanned document images. The AI processes the image and returns the extracted text in a clean, editable format. It handles multiple languages, various fonts, and both printed and handwritten text (though handwriting accuracy varies).

Extracting text from a photo

Read text from an image.

You say:

[Sends photo of a business card] Save this contact information.

Sarudo responds:

I've extracted the text from the business card: Jennifer Park, VP of Marketing, DataStream Inc. Email: jennifer.park@datastream.io. Phone: (650) 555-0173. Address: 500 Tech Drive, Suite 400, Palo Alto, CA 94301. I've created a new contact record in your CRM with these details. Would you like to add any notes about where you met Jennifer?

Scanned Documents

For scanned PDFs (PDFs that are essentially images of pages rather than digital text), the AI applies OCR to convert them into searchable, editable text. This is particularly useful for digitizing old contracts, letters, or any paper documents. After OCR processing, the text can be ingested into your knowledge base, making the information from physical documents as searchable as digital content.

For the best OCR results, scan documents at 300 DPI or higher. Ensure good lighting and a flat surface when photographing documents with your phone.

Accuracy & Tips

OCR accuracy depends on image quality, font clarity, and document layout. Clean, well-lit images of printed text typically achieve 98% or higher accuracy. Factors that can reduce accuracy include low resolution, poor lighting, unusual fonts, handwriting, and complex layouts with overlapping text and images. For critical documents, always review the extracted text for accuracy. The AI will flag sections where it had low confidence in the extraction.

Document Ingestion

Uploading PDFs, DOCX files, spreadsheets, and presentations for automatic chunking, embedding, and knowledge extraction.

PDF Operations

Merge, split, compress, encrypt, and decrypt PDF files using your AI employee.

File Sharing

How to send files to your AI employee and receive generated files back, including supported formats and download links.