[TriLUG] To The Oracle:

Robert Dale robdale at gmail.com
Tue Apr 29 21:46:16 EDT 2014


In Java, there's iText and PDFBox.

http://sourceforge.net/projects/itext/

http://pdfbox.apache.org/

On Tue, Apr 29, 2014 at 9:42 PM, Brian McCullough <bdmc at buadh-brath.com> wrote:
> Greetings, all.
>
> Once again, I have what I hope is an interesting question that some or
> many of you, can help with.
>
>
> Last fall, I learned about creating PDFs from PHP code, now I need to go
> the other way, and extract data from PDFs.
>
> I have found more than one method in PHP for reading PDFs, but,
> unfortunately, even the newest methods don't seem to be able to deal
> with "modern" PDFs, version 1.4.
>
> Here, instead of text with other markup, as we see in older PDFs, there
> seem to be blocks of binary code intermixed with markup.
>
>
> Does anybody have any suggestions for dealing with this new version of
> PDF?
>
> Although I would like to do this in PHP, I will take other languages if
> necessary.
>
>
>
> Thanks,
> Brian
>
> --
> This message was sent to: Robert Dale <robdale at gmail.com>
> To unsubscribe, send a blank message to trilug-leave at trilug.org from that address.
> TriLUG mailing list : http://www.trilug.org/mailman/listinfo/trilug
> Unsubscribe or edit options on the web  : http://www.trilug.org/mailman/options/trilug/robdale%40gmail.com
> Welcome to TriLUG: http://trilug.org/welcome



-- 
Robert Dale


More information about the TriLUG mailing list