ok, so I found a way how to get the mapping table - traverse through document elements "/Resources" -> "/Font" (PdfDictionary objects). This way I should get all used fonts, and they should have their own element "/ToUnicode" where a mapping table is defined. In my PDF document I found 4 fonts, but only one had "/ToUnicode" element, so I have only one mapping table.
Back to my example: "\03\0U\0\u0081\0I" -> " Prüf"
In my mapping table I was able to find 2 values
<0003> <0020> (meaning ' ' in Unicode)
<0081> <00FC> (meaning 'ü' in Unicode)
So what about the remaining values, for example "\0U"? How to map them when they are not in the mapping table? Is this really the right way to go, or is there some more convenient API to get the correct mapping table? I still feel that this is just a workaround and not the correct way how to get to the mapping table. I am still missing some important information.
TH-Soft wrote:
Here someone solved the task without sharing any code:
viewtopic.php?p=10564#p10564I'm afraid I cannot point you to sample code.