Metadata-Version: 1.2
Name: python-pdfbox
Version: 0.1.8
Summary: Python interface to Apache PDFBox command-line tools.
Home-page: https://github.com/lebedov/python-pdfbox/
Author: Lev E. Givon
Author-email: lev@columbia.edu
License: Apache
Description: Package Description
        -------------------
        Provides a simple Python 3 interface to the `Apache PDFBox <https://pdfbox.apache.org/>`_
        command-line tools.
        
        .. image:: https://img.shields.io/pypi/v/python-pdfbox.svg
            :target: https://pypi.python.org/pypi/python-pdfbox
            :alt: Latest Version
                  
        Requirements
        ------------
        Aside from Python 3 and those packages specified in
        `setup.py <https://github.com/lebedov/python-pdfbox/blob/master/setup.py>`_,
        python-pdfbox requires ``java`` to be present in the system path.
        
        Installation
        ------------
        The package may be installed as follows: ::
        
            pip install python-pdfbox
        
        One may specify the location of the PDFBox jar file via the ``PDFBOX``
        environmental variable. If not set, python-pdfbox looks for the jar file
        in the platform-specific user cache directory and automatically downloads
        and caches it if not present.
        
        Usage
        -----
        The interface currently exposes only several features in PDFBox (text extraction, conversion to images, extraction
        of images): ::
        
            import pdfbox
            p = pdfbox.PDFBox()
            p.extract_text('/path/to/my_file.pdf')   # writes text to /path/to/my_file.txt
            p.pdf_to_images('/path/to/my_file.pdf')  # writes images to /path/to/my_file1.jpg, /path/to/my_file2.jpg, etc.
            p.extract_images('/path/to/my_file.pdf') # writes images to /path/to/my_file-1.png, /path/to/my_file-2.png, etc.
        
        Development
        -----------
        The latest release of the package may be obtained from
        `GitHub <https://github.com/lebedov/python-pdfbox>`_.
        
        Author
        ------
        See the included `AUTHORS.rst 
        <https://github.com/lebedov/python-pdfbox/blob/master/AUTHORS.rst>`_ file for more 
        information.
        
        License
        -------
        This software is licensed under the
        `Apache 2.0 License <https://opensource.org/licenses/Apache-2.0>`_.
        See the included `LICENSE.rst 
        <https://github.com/lebedov/python-pdfbox/blob/master/LICENSE.rst>`_ file for more 
        information.
        
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Software Development
Requires-Python: >=3
