Rpa29
(Rpa)
December 30, 2023, 6:36am
1
Hi Everyone,
I want to extract data from .tiff file, how can i do that? Should i use get ocr text or first should i convert to pdf file ?
1 Like
lrtetala
(Lakshman Reddy)
December 30, 2023, 6:41am
2
Hi @Rpa29
Welcome to Community!!
Please check on below thread
I have a Tif file i want read tif and find the date tile and dates stored in excel file it’s possible to do that
You can use UI automation to extract text from each page in your Tiff file.
Open the Tiff file in Windows Photo Viewer, or any other photo viewer that works, use the Screen Scraping wizard to get the text using OCR. Then go to next page using a Click activity and repeat the Screen Scraping.
Of course, you can use a loop activity to do this for each page in your Tiff file.
Or you can use an external application to split the Tiff file, this one for example - http://tiffsplitter.codeplex.com/ .
Hi,
We can achieve it using tiff library such as Bitmiracle.TiffLib.net which is used in UiPath.PDF.Activities package etc.
[image]
imgTiff = BitMiracle.LibTiff.Classic.Tiff.Open("sample.tif","r")
Then
fv_width = imgtiff.GetField(BitMiracle.LibTiff.Classic.TiffTag.IMAGEWIDTH)
CInt(fv_width.First().ToString()) returns image width
Sample20230417-1L.zip (58.7 KB)
Regards,
Hope this helps!!
Rpa29
(Rpa)
January 1, 2024, 12:42am
3
Hi,
Thank you for the links you shared, I examined them all. With the Taxonomy manager and digitize document activity, I was able to get only the data in the first file from the invoice files in the link below. I want to extract the information in the tiff files in the link as a datatable. How can I do that?
Link : İnvoice - Google Drive