Hi Amit,
On our further analysis, we can able to OCR the PDF document by using Azure Computer Vision API. In that, we have extract the images from this document page and get the OCR result from the image using Azure Computer vision API, then we draw that result to the page graphics. For this, we have created the simple POC sample to achieve this requirement. Please find the sample from below download link,
We have logged a feature request for ”Add Support to include other engines instead of Tesseract in OCRProcessor”. For this implementation, we have internally extract the images from the pdf document and return the images. You can perform OCR for those image by using Azure Computer vision API and get the result. Finally, passing those OCR result and we have drawn the result to the page graphics internally and return the pdf document with selectable text. We are planned to include this support in our upcoming volume 2 main release, which will be available on June 2022 tentatively
Please find the feedback link to track the implementation of the feature below.
Regards,
Gowthamraj K