PDFsharp & MigraDoc Foundation
https://forum.pdfsharp.net/

Getting the individual objects from an existing PDF?
https://forum.pdfsharp.net/viewtopic.php?f=2&t=1426
Page 1 of 1

Author:  Sunset.Towers [ Thu Nov 18, 2010 1:22 am ]
Post subject:  Getting the individual objects from an existing PDF?

I need to process the objects in a PDF page so I can get access to their printed location. I did this several years ago, but I have lost the original coding I had do to this.

What I did was I figured out a location range for each place where I expected certain text objects to be printed. Searched through the objects in the PDF and only extracted the text that was being printed at those locations. Parsing the text using an entire page extractor is not an option as it is impossible to determine what text goes to what original place in the PDF. For example there are places where and X is used to denote an option. When a different option is Xed it changes the output of the extracted text making it impossible to determine with any certainty where other parts are located.

I'm almost positively certain I used PDFSharp to do this work with. If it isn't possible though, could someone please point me in the right direction.

Author:  Remis [ Wed Nov 24, 2010 10:45 pm ]
Post subject:  Re: Getting the individual objects from an existing PDF?

>> need to process the objects in a PDF page
PDF is write-only format.

>>I figured out a location range for each place where I expected certain text objects to be printed
Huh... (I tried it at home with no luck)

Page 1 of 1 All times are UTC
Powered by phpBB® Forum Software © phpBB Group
https://www.phpbb.com/