PDFsharp & MigraDoc Foundation • View topic

View unanswered posts | View active topics

Board index » PDFsharp & MigraDoc » Support

All times are UTC

Forum rules

Please read this before posting on this forum: Forum Rules

Problem with trailing '\0'

Moderator: Stefan Lange

Page 1 of 1

[ 5 posts ]

Print view

Previous topic | Next topic

Author

Message

ToniM

Post subject: Problem with trailing '\0'

Posted: Wed Nov 21, 2012 8:18 am

Joined: Thu Aug 20, 2009 6:27 am
Posts: 14

Hello Guys,

i run into a problem (caused by SAP) with some '\0' after the '%%EOF' tag in the pdf.

The problem occures exactly in the

Code:

Parser.cs

in the method

Code:

ReadTrailer()

.
There the trailer is searched within a fixed range from the "real" file end.

I think about more solutions to avoid the error 'Unexpected Token' with this solutions:
- extend from 130 to x (not the best way)
- count the '\0' at the end and add to 130 (i also dont really like, but the better way)

What would you suggest me to do?
and is there a reason for the hardcoded "130 byte from the end" method, what i dont know?
why not search for the 'startxref' within the whole document?

Thanks in advance!

BR ToniM

Top

Thomas Hoevel

Post subject: Re: Problem with trailing '\0'

Posted: Wed Nov 21, 2012 9:51 am

PDFsharp Guru

Joined: Mon Oct 16, 2006 8:16 am
Posts: 3101
Location: Cologne, Germany

Hi!

Code that searches the complete file:
viewtopic.php?p=583#p583

A better approach: search within 130 bytes at the end of the file, then search the whole file as a fallback strategy.

Them guys at SAP don't know what "%%EOF" stands for. They shouldn't write more bytes than the file actually needs. 50000 zero bytes added at the end of the file make no sense.

_________________
Regards
Thomas Hoevel
PDFsharp Team

Top

ToniM

Post subject: Re: Problem with trailing '\0'

Posted: Wed Nov 21, 2012 10:00 am

Joined: Thu Aug 20, 2009 6:27 am
Posts: 14

hi,

is there a reason for the 130 bytes?
or is it just a guessed value with some safety?

but i will follow your approach.
Are you interested in the result to check in?

BR ToniM

Top

Thomas Hoevel

Post subject: Re: Problem with trailing '\0'

Posted: Wed Nov 21, 2012 10:16 am

PDFsharp Guru

Joined: Mon Oct 16, 2006 8:16 am
Posts: 3101
Location: Cologne, Germany

Initially we read only 30 bytes (enough if "%%EOF" is at the end of the file).
Then we found some PDF files where the producer added the product name after "%%EOF" so we searched 130.

Please post your changes here for us and all. Thanks!

_________________
Regards
Thomas Hoevel
PDFsharp Team

Top

ToniM

Post subject: Re: Problem with trailing '\0'

Posted: Wed Nov 21, 2012 10:17 am

Joined: Thu Aug 20, 2009 6:27 am
Posts: 14

sorry, i didnt searched well!

please close this thread!

this is the original thread:
http://forum.pdfsharp.net/viewtopic.php?p=583#p583

Thank you & Sorry for the trouble.

BR ToniM

PS: Here is my solution, maybe you like to implement it for the future, to avoid double postings

(Version: 1.32.2608)

Code:

    /// <summary>
    /// Reads the iref table and the trailer dictionary.
    /// </summary>
    internal PdfTrailer ReadTrailer()
    {
      //Symbol symbol;
      //string token;
      //int xrefOffset = 0;
      int length = lexer.PdfLength;

      int offset = 0;
#if true
      offset = 130;
#else
      offset = 30;
#endif

      string trail = this.lexer.ReadRawString(length - offset - 1, offset); //lexer.Pdf.Substring(length - 30);
      int idx = trail.IndexOf("startxref");
      if (idx < 0)
      {
        trail = this.lexer.ReadRawString(0, length);
        idx = trail.LastIndexOf("startxref");
        // maybe still not found, but ignore it
        this.lexer.Position = idx;
      }
      else
      {
        this.lexer.Position = length - offset - 1 + idx;
      }

      ReadSymbol(Symbol.StartXRef);
      this.lexer.Position = ReadInteger();

      // Read all trailers
      PdfTrailer trailer;
      while (true)
      {
        trailer = ReadXRefTableAndTrailer(this.document.irefTable);
        // 1st trailer seems to be the best..
        if (this.document.trailer == null)
          this.document.trailer = trailer;
        int prev = trailer.Elements.GetInteger(PdfTrailer.Keys.Prev);
        if (prev == 0)
          break;
        //if (prev > this.lexer.PdfLength)
        //  break;
        this.lexer.Position = prev;
      }

      return this.document.trailer;
    }

Top

Page 1 of 1

[ 5 posts ]

Board index » PDFsharp & MigraDoc » Support

All times are UTC

Who is online

Users browsing this forum: No registered users and 31 guests

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum