PDFsharp & MigraDoc Foundation

PDFsharp - A .NET library for processing PDF & MigraDoc Foundation - Creating documents on the fly
It is currently Fri Apr 26, 2024 12:51 pm

All times are UTC


Forum rules


Please read this before posting on this forum: Forum Rules



Post new topic Reply to topic  [ 4 posts ] 
Author Message
PostPosted: Wed Sep 30, 2015 2:17 pm 
Offline

Joined: Tue Sep 29, 2015 3:10 pm
Posts: 2
I am working on a project to Import Text and Vector paths from PDFs. Does this product supports that functionality? Do you have a code sample for how to extract the text and path data rotated and transformed to the coordinate system they are viewed at in a PDF viewer along with the layer they come from?


Top
 Profile  
Reply with quote  
PostPosted: Wed Sep 30, 2015 2:49 pm 
Offline
PDFsharp Guru
User avatar

Joined: Mon Oct 16, 2006 8:16 am
Posts: 3096
Location: Cologne, Germany
Hi!

There is a NuGet package I haven't tried yet:
https://www.nuget.org/packages/PdfTextract/

From PDFsharp FAQ:
"Can I use PDFsharp to extract text from PDF?
This can be done at a low level. You can get at the characters in the order they are drawn - and most applications draw them from top-left to bottom-right. There are no high-level functions that return words, paragraphs, or whole pages."
http://www.pdfsharp.net/wiki/PDFsharpFA ... rom_PDF_13

_________________
Regards
Thomas Hoevel
PDFsharp Team


Top
 Profile  
Reply with quote  
PostPosted: Wed Sep 30, 2015 10:39 pm 
Offline

Joined: Tue Sep 29, 2015 3:10 pm
Posts: 2
How about path coordinates, transformations, rotations, and layers?


Top
 Profile  
Reply with quote  
PostPosted: Thu Oct 01, 2015 7:43 am 
Offline
PDFsharp Guru
User avatar

Joined: Mon Oct 16, 2006 8:16 am
Posts: 3096
Location: Cologne, Germany
PDFsharp does not parse that stuff, so it's left as an exercise (not a simple one).

I don't know which library can parse that stuff.
Hint: Libraries that render PDF files must be able to parse such details. PDFsharp cannot render PDF and does not delve into such details.

_________________
Regards
Thomas Hoevel
PDFsharp Team


Top
 Profile  
Reply with quote  
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 4 posts ] 

All times are UTC


Who is online

Users browsing this forum: Google [Bot] and 365 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Privacy Policy, Data Protection Declaration, Impressum
Powered by phpBB® Forum Software © phpBB Group