| PDFsharp & MigraDoc Forum https://forum.pdfsharp.net/ |
|
| working with existing PDF - simple manipulation https://forum.pdfsharp.net/viewtopic.php?f=2&t=2954 |
Page 1 of 1 |
| Author: | macbaine [ Thu Oct 16, 2014 1:54 am ] |
| Post subject: | working with existing PDF - simple manipulation |
New to this API - haven't seen much documentation or other topics on basic manipulation of an existing file, with respect to the multitude of classes/objects provided. I need to simply load an existing PDF (figured that out at least), and search for a word(s)... eventually returning a sentence or sentences containing that word. I've played around with the different classes I suspect are involved.. Pdf.Content.Objects, but haven't gotten anywhere meaningful. If I can achieve the above, being able to read a stream of text from a PDF and work with it in my own program, I would be happy. Any pointers appreciated Thanks |
|
| Author: | Thomas Hoevel [ Thu Oct 16, 2014 7:33 am ] |
| Post subject: | Re: working with existing PDF - simple manipulation |
PDFsharp was not designed to extract text. Related posts you can look at: viewtopic.php?p=1603#p1603 viewtopic.php?p=4010#p4010 |
|
| Author: | kensands [ Thu Oct 16, 2014 2:56 pm ] |
| Post subject: | Re: working with existing PDF - simple manipulation |
I use the pdfium library for text extraction, it takes some time and effort to setup as you'll get a c++ dll that needs some effort to use from .net code but once you have the interface in place it works very well. pdfsharp is there for creating/editing/splitting/saving but not viewing or unfortunately text extraction (the raw data is there but you need to be able to render the document in order to work out the positions of characters and then extract them in the correct order). |
|
| Author: | macbaine [ Thu Oct 16, 2014 3:10 pm ] |
| Post subject: | Re: working with existing PDF - simple manipulation |
Ok.. thanks for the replies. Anybody aware of any other open-source libraries with VB.net compatibility? I will look into pdfium... I have found a couple other libraries that require licenses.. if I can avoid that I would like to Thanks |
|
| Page 1 of 1 | All times are UTC |
| Powered by phpBB® Forum Software © phpBB Group https://www.phpbb.com/ |
|