PDFsharp & MigraDoc Foundation • View topic - Corrupted (blank page) when using PDFs from SQL 2008 Reports

View unanswered posts | View active topics

Board index » PDFsharp & MigraDoc » Bug Reports

All times are UTC

Forum rules

Please read this before posting on this forum: Forum Rules

Corrupted (blank page) when using PDFs from SQL 2008 Reports

Moderator: Stefan Lange

Page 1 of 1

[ 12 posts ]

Print view

Previous topic | Next topic

Author

Message

simbolo

Post subject: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Wed Jan 14, 2009 2:23 am

Joined: Wed Jan 14, 2009 2:10 am
Posts: 7
Location: London, United Kingdom

Scenario:

A Microsoft SQL 2008 Database with Report Services (version 10) generates a PDF report. The report is 1 page long, some basic text, nothing fancy.

Using .Net code, the PDF is opened and saved, the result is a blank PDF (blank as viewed in Adobe PDF Reader 8, i.e., the content has been lost when the PDF goes through PDFSharp).

If I change the document title, default zoom etc, these settings are saved, but the PDF is still blank.

The same report, as generated per the previous version of SQL report services (version 9) runs through PDFSharp normally.

If I try to merge a PDF into the background of that report, like a PDF with a formatted header/footer using the code...

Code:

Dim gfx As XGraphics = XGraphics.FromPdfPage(FirstPage, XGraphicsPdfPageOptions.Prepend)

...an exception is encountered when the above line of code is called. (This error doesn't occour on v9 PDFs, only since v10 was introduced)...

Code:

PdfSharp.SharpZipLib.SharpZipBaseException: Header checksum illegal
  at PdfSharp.SharpZipLib.Zip.Compression.Inflater.DecodeHeader() in PDFsharp\code\PdfSharp\PdfSharp.SharpZipLib\Zip\Compression\Inflater.cs:line 247
  at PdfSharp.SharpZipLib.Zip.Compression.Inflater.Decode() in PDFsharp\code\PdfSharp\PdfSharp.SharpZipLib\Zip\Compression\Inflater.cs:line 427
  at PdfSharp.SharpZipLib.Zip.Compression.Inflater.Inflate(Byte[] buf, Int32 offset, Int32 len) in PDFsharp\code\PdfSharp\PdfSharp.SharpZipLib\Zip\Compression\Inflater.cs:line 711
  at PdfSharp.SharpZipLib.Zip.Compression.Streams.InflaterInputStream.Read(Byte[] b, Int32 off, Int32 len) in PDFsharp\code\PdfSharp\PdfSharp.SharpZipLib\Zip\Compression\Streams\InflaterInputStream.cs:line 409

Microsoft must have done something that upsets PDFSharp with their new rendering engine for PDFs, at this stage I am unsure which side the bug is in.

The PDF version of the report services pdf is 1.3 (Acrobat 4x).

Any suggestions?
Thanks.

Top

Thomas Hoevel

Post subject:

Posted: Wed Jan 14, 2009 10:32 am

PDFsharp Guru

Joined: Mon Oct 16, 2006 8:16 am
Posts: 3111
Location: Cologne, Germany

Hi!

We need a PDF file to replicate the problem.

_________________
Regards
Thomas Hoevel
PDFsharp Team

Top

simbolo

Post subject:

Posted: Wed Jan 14, 2009 2:17 pm

Joined: Wed Jan 14, 2009 2:10 am
Posts: 7
Location: London, United Kingdom

Yes that would be handy, here is the PDF, it's directly out of Report Services 10.

http://cid-6e64738c64f8f959.skydrive.live.com/self.aspx/Public/101O.pdf

To get PDFSharp to make it into a blank PDF...

Code:

Imports PdfSharp
Imports PdfSharp.Pdf
Imports PdfSharp.Pdf.IO
Imports PdfSharp.Drawing

Dim cStatement As PdfDocument = PdfReader.Open(FilePathToPDF)
cStatement.Info.Subject = "An example change."
cStatement.Save(FilePathToPDF)

The subject text is preserved in the blank pdf.

To create the checksum exception...

Code:

Dim lMerge As XPdfForm = XPdfForm.FromFile(OverlayPDFFilePath)
Dim FirstPage As PdfPage = cStatement.Pages(0)
Dim gfx As XGraphics = XGraphics.FromPdfPage(FirstPage, XGraphicsPdfPageOptions.Prepend)

Dim width As Double = FirstPage.Width
Dim height As Double = FirstPage.Height

Dim box As XRect = New XRect(0, 0, width, height)
gfx.DrawImage(lMerge, box)

cStatement.Save(FilePath)

The purpose of the above code is to merge a PDF containing a letterhead onto the target PDF document. The letterhead PDF has been working fine with PDFSharp. The letterhead is only merged onto page 1 of the document.

I hope this is usefull. Thanks.

Top

simbolo

Post subject: Debug Output

Posted: Mon Jan 19, 2009 6:44 pm

Joined: Wed Jan 14, 2009 2:10 am
Posts: 7
Location: London, United Kingdom

Have been playing around with the Report Services, the only configuration available is to enable compression (as disabled by default), as the error seems related to a ZIP checksum.

Would be terriffic to get this to work, as SQL Report Services 10 can render HTML as PDF, and is free.

I have observed the following debug output messages:

Quote:

UserOrOwnerPasswordRequired: 'At least a user or an owner password is required to encrypt the document.'
UnexpectedToken: 'Token '{0}' was not expected.'
UnknownEncryption: 'The PDF document is protected with an encryption not supported by PDFsharp.'

These messages only appear on the report services 10 PDF, and not the older v9s. There appears to be some unknown symbol (which isn't showing up in the debug message {0}), and curiously, the PDF isn't encrypted or protected.

Still investigating.

Top

simbolo

Post subject: Cannot trace.

Posted: Mon Jan 19, 2009 8:54 pm

Joined: Wed Jan 14, 2009 2:10 am
Posts: 7
Location: London, United Kingdom

I couldn't trace where the encyption checking was done, but the exceptions seems to originate from Parser.cs

Code:

Symbol ReadSymbol(Symbol symbol)
    {
Symbol current = this.lexer.ScanNextToken();
          if (symbol != current)
             throw new PdfReaderException(PSSR.UnexpectedToken(this.lexer.Token));
          return current;
}

the ScanNextToken seems to encounter an unknown Symbol (not implemented"), the exception occours despite the code returning a Symbol.None.

Unsure how to examine the unrecognised symbol, if I break at the end of ScanNextToken() and look at the value for this.symbol in the VS Locals window, the value is set to "PdfSharp.Pdf.IO.Symbol.BeginStream", here are the other values. Setting to return Symbol.BeginStream instead of Symbol.None has no effect.

Quote:

- this {PdfSharp.Pdf.IO.Lexer} PdfSharp.Pdf.IO.Lexer
currChar 136 '?' char
idxChar 3498 int
nextChar 188 '¼' char
+ pdf {System.IO.FileStream} System.IO.Stream {System.IO.FileStream}
PdfLength 8825 int
pdfLength 8825 int
Position 3498 int
symbol BeginStream PdfSharp.Pdf.IO.Symbol
Symbol BeginStream PdfSharp.Pdf.IO.Symbol
+ token {} System.Text.StringBuilder
Token "" string
+ TokenToBoolean 'this.TokenToBoolean' threw an exception of type 'System.IndexOutOfRangeException' bool {System.IndexOutOfRangeException}
+ TokenToInteger 'this.TokenToInteger' threw an exception of type 'System.FormatException' int {System.FormatException}
+ TokenToReal 'this.TokenToReal' threw an exception of type 'System.FormatException' double {System.FormatException}
+ TokenToUInteger 'this.TokenToUInteger' threw an exception of type 'System.FormatException' uint {System.FormatException}
ch 136 '?' char

Removing the debug.asserts just results in the code completing without exception, but just giving me a blank PDF (which is larger in size than the original PDF put in).

Is a BeginStream Symbol abnormal?

Top

simbolo

Post subject: Status update.

Posted: Sat Feb 07, 2009 3:38 pm

Joined: Wed Jan 14, 2009 2:10 am
Posts: 7
Location: London, United Kingdom

I've still not managed to find a solution to this.

Did anyone receive the PDF I sent over?

I notice a similiar Unknown Symbol problem was fixed in the support thread of this forum with just 2 lines of code, so I expect there must be a similiar solution for me?

Top

arrangemonk

Post subject: Re: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Tue Nov 10, 2009 11:10 am

Joined: Tue Nov 10, 2009 11:07 am
Posts: 4

found this relating to itextsharp
http://social.msdn.microsoft.com/Forums ... 77950c386f
they say something about zip unrealted tokens in the object stream

maybe it helps

Top

arrangemonk

Post subject: Re: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Thu Nov 26, 2009 11:56 am

Joined: Tue Nov 10, 2009 11:07 am
Posts: 4

the only difference between rs22008 pdfs and normal pdfs is simply
where the normal pdf has "stream" the rs2008 pdf has "stream "
with regex or something the rs pdf can be modified and then used

Top

Ronin

Post subject: Re: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Mon Apr 26, 2010 3:11 pm

Joined: Mon Apr 26, 2010 3:03 pm
Posts: 1

I had exactly the same bug with PDF generated by SSRS and I fixed it by changing Lexer.cs a bit:

old code:

Code:

case "stream":
    return this.symbol = Symbol.BeginStream;

new code:

Code:

case "stream":
    if (this.nextChar == 32 || this.nextChar == 13 || this.nextChar == '\n')
        ScanNextChar();       
    return this.symbol = Symbol.BeginStream;

Top

marp20x6

Post subject: Re: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Mon May 17, 2010 1:40 am

Joined: Mon May 17, 2010 1:39 am
Posts: 1

thanks for the pointer Ronin, tested it out and it works a treat, cheers.

Top

Frank Racis

Post subject: Re: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Thu Sep 02, 2010 7:53 pm

Joined: Thu Sep 02, 2010 7:36 pm
Posts: 1

That fix will work about 99% of the time. However, if your stream data happens to begin with bytes 10, 13, or 32, it will skip over the beginning of your stream.

ReadStream() expects the reader to be positioned at the CR/LF following the "stream" keyword, so you need skip trailing spaces only.

Code:

   case "stream":
      while (ch == 32)
         ch = ScanNextChar();
      return this.symbol = Symbol.BeginStream;

Top

gerald

Post subject: Re: Corrupted (blank page) when using PDFs from SQL 2008 Reports

Posted: Mon May 09, 2011 8:10 am

Joined: Mon May 09, 2011 8:02 am
Posts: 1

My solution - a combination of Ronin's version and the code supplied in [1] - is as follows:

Code:

case "stream":
  if (this.nextChar == 32) ScanNextChar();
  if (this.nextChar == 13) ScanNextChar();
  return this.symbol = Symbol.BeginStream;

HTH someone,
Gerald

[1] Export PDF in ssrs 2008 vs ssrs 2005 - pdf is different, won't work with itextsharp (possibly other PDF post-processing tools)

Top

Page 1 of 1

[ 12 posts ]

Board index » PDFsharp & MigraDoc » Bug Reports

All times are UTC

Who is online

Users browsing this forum: No registered users and 4 guests

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum