[TriLUG] To The Oracle:

Michael Peters michael00peters at gmail.com
Wed Apr 30 08:52:51 EDT 2014


While not ideal (because it involves forking out another process if
you're using it inside of PHP) but you can look at the common
pdftotext command line utility.

On Tue, Apr 29, 2014 at 9:42 PM, Brian McCullough <bdmc at buadh-brath.com> wrote:
> Greetings, all.
>
> Once again, I have what I hope is an interesting question that some or
> many of you, can help with.
>
>
> Last fall, I learned about creating PDFs from PHP code, now I need to go
> the other way, and extract data from PDFs.
>
> I have found more than one method in PHP for reading PDFs, but,
> unfortunately, even the newest methods don't seem to be able to deal
> with "modern" PDFs, version 1.4.
>
> Here, instead of text with other markup, as we see in older PDFs, there
> seem to be blocks of binary code intermixed with markup.
>
>
> Does anybody have any suggestions for dealing with this new version of
> PDF?
>
> Although I would like to do this in PHP, I will take other languages if
> necessary.
>
>
>
> Thanks,
> Brian
>
> --
> This message was sent to: Michael Peters <michael00peters at gmail.com>
> To unsubscribe, send a blank message to trilug-leave at trilug.org from that address.
> TriLUG mailing list : http://www.trilug.org/mailman/listinfo/trilug
> Unsubscribe or edit options on the web  : http://www.trilug.org/mailman/options/trilug/michael00peters%40gmail.com
> Welcome to TriLUG: http://trilug.org/welcome


More information about the TriLUG mailing list