pdfminer, Release 0.0.1
• 2010/02/13: Bugfix and enhancement. Thanks to André Auzi.
• 2010/02/07: Several bugfixes. Thanks to Hiroshi Manabe.
• 2010/01/31: JPEG image extraction supported. Page rotation bug fixed.
• 2010/01/04: Python 2.6 warning removal. More doctest conversion.
• 2010/01/01: CMap bug fix. Thanks to Winfried Plappert.
• 2009/12/24: RunLengthDecode filter added. Thanks to Troy Bollinger.
• 2009/12/20: Experimental polygon shape extraction added. Thanks to Yusuf Dewaswala for reporting.
• 2009/12/19: CMap resources are now the part of the package. Thanks to Adobe for open-sourcing them.
• 2009/11/29: Password encryption bug fixed. Thanks to Yannick Gingras.
• 2009/10/31: SGML output format is changed and renamed as XML.
• 2009/10/24: Charspace bug fixed. Adjusted for 4-space indentation.
• 2009/10/04: Another matrix operation bug fixed. Thanks to Vitaly Sedelnik.
• 2009/09/12: Fixed rectangle handling. Able to extract image boundaries.
• 2009/08/30: Fixed page rotation handling.
• 2009/08/26: Fixed zlib decoding bug. Thanks to Shon Urbas.
• 2009/08/24: Fixed a bug in character placing. Thanks to Pawan Jain.
• 2009/07/21: Improvement in layout analysis.
• 2009/07/11: Improvement in layout analysis. Thanks to Lubos Pintes.
• 2009/05/17: Bugfixes, massive code restructuring, and simple graphic element support added. setup.py is sup-
ported.
• 2009/03/30: Text output mode added.
• 2009/03/25: Encoding problems fixed. Word splitting option added.
• 2009/02/28: Robust handling of corrupted PDFs. Thanks to Troy Bollinger.
• 2009/02/01: Various bugfixes. Thanks to Hiroshi Manabe.
• 2009/01/17: Handling a trailer correctly that contains both /XrefStm and /Prev entries.
• 2009/01/10: Handling Type3 font metrics correctly.
• 2008/12/28: Better handling of word spacing. Thanks to Christian Nentwich.
• 2008/09/06: A sample pdf2html webapp added.
• 2008/08/30: ASCII85 encoding filter support.
• 2008/07/27: Tagged contents extraction support.
• 2008/07/10: Outline (TOC) extraction support.
• 2008/06/29: HTML output added. Reorganized the directory structure.
• 2008/04/29: Bugfix for Win32. Thanks to Chris Clark.
• 2008/04/27: Basic encryption and LZW decoding support added.
• 2008/01/07: Several bugfixes. Thanks to Nick Fabry for his vast contribution.
• 2007/12/31: Initial release.
8 Chapter 1. PDFMiner