PDFsharp & MigraDoc Foundation • View topic

View unanswered posts | View active topics

Board index » PDFsharp & MigraDoc » Support

All times are UTC

Forum rules

Please read this before posting on this forum: Forum Rules

From Stream.value to

Moderator: Stefan Lange

Page 1 of 1

[ 2 posts ]

Print view

Previous topic | Next topic

Author

Message

ag3

Post subject: From Stream.value to

Posted: Fri Feb 10, 2012 11:42 am

Joined: Fri Feb 10, 2012 11:31 am
Posts: 1

HI,

I am looking to extract text from my pdf.

Currently I do :

Code:

PdfDocument document = PdfReader.Open(this.filePath);
            foreach (PdfPage page in document.Pages)
            {
                for (int index = 0; index < page.Contents.Elements.Count; index++)
                {

                    PdfDictionary.PdfStream stream = page.Contents.Elements.GetDictionary(index).Stream;
                    String res = "";
                    foreach (byte cd in stream.Value)
                        res += (char)cd; 
                   //TODO: res encoding invalid
            }

my variable res contains text but also text encoded.
I tried to use unicode, iso encoders without success.

Quote:

res contains:
BT
/R7 9.96 Tf
0.999386 0 0 1 278.4 761.6 Tm
( )Tj
-221.896 -12.12 Td
(\n \r)Tj
227.54 -675.96 Td
( )Tj
ET

I am looking for something like (Hello World)Tj.

Maybe it's coded through the font ?

Could you give me some hints to decode the text.

Thx

Regards,
alex

Top

Thomas Hoevel

Post subject: Re: From Stream.value to

Posted: Mon Feb 13, 2012 9:38 am

PDFsharp Guru

Joined: Mon Oct 16, 2006 8:16 am
Posts: 3101
Location: Cologne, Germany

Hi!

Not my area of expertise.

Maybe this thread will help:
viewtopic.php?p=4010#p4010

_________________
Regards
Thomas Hoevel
PDFsharp Team

Top

Page 1 of 1

[ 2 posts ]

Board index » PDFsharp & MigraDoc » Support

All times are UTC

Who is online

Users browsing this forum: Bing [Bot] and 55 guests

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum