Metadata-Version: 2.1
Name: insta-scrape
Version: 0.11.0
Summary: Super lightweight Instagram web scraper for data analysis
Home-page: https://github.com/chris-greening/instascrape
Author: Chris Greening
Author-email: chris@christophergreening.com
License: UNKNOWN
Description: ![instascrape logo](/media/logo.png?raw=true)
        
        # instascrape: Instagram scraping for humans
        
        ## What is it?
        > _instascrape_ is a powerful, lightweight library for scraping Instagram data without using their API. It is designed with flexibility and developer productivity in mind so you can stop wasting valuable time collecting data and just start analyzing! 
        
        [![Version](https://img.shields.io/pypi/pyversions/insta-scrape)](https://www.python.org/downloads/release/python-360/)
        [![Language](https://img.shields.io/github/languages/top/chris-greening/instascrape)](https://www.python.org/)
        [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
        [![Release](https://img.shields.io/pypi/v/insta-scrape)](https://pypi.org/project/insta-scrape/)
        [![License](http://img.shields.io/:license-mit-blue.svg?style=flat-square)](https://opensource.org/licenses/MIT)
        
        [![Downloads](https://pepy.tech/badge/insta-scrape)](https://pepy.tech/project/insta-scrape)
        [![Activity](https://img.shields.io/github/last-commit/chris-greening/instascrape)](https://github.com/chris-greening/instascrape)
        [![Dependencies](https://img.shields.io/librariesio/github/chris-greening/instascrape)](https://github.com/chris-greening/instascrape/blob/master/requirements.txt)
        [![Issues](https://img.shields.io/github/issues/chris-greening/instascrape?style=flat)](https://github.com/chris-greening/instascrape/issues)
        [![Size](https://img.shields.io/github/repo-size/chris-greening/instascrape)](https://github.com/chris-greening/instascrape)
        
        ![Example showing tech profile scrapes](/media/techprofiles.gif)
        
        ## Key features
        * :walking: Static HTML scrapers 
          * `Profile`: scrapes 50 data points from a profile  
            * follower count
            * recent posts 
            * verification status  
            * etc.
          * `Post`: scrapes almost 50 data points from a post  
            * likes
            * amount of comments
            * hashtags
            * etc. 
          * `Hashtag`: scrapes over a dozen data points from a hashtag  
            * amount of posts 
            * recent posts
            * featured picture URL 
            * etc. 
        * :floppy_disk: Download post media locally as _png_, _jpg_, _mp4_, and _mp3_
        * :musical_score: Expressive and consistent API for concise and elegant code
        * :bar_chart: Designed for seamless integration with 
          * [_Selenium_](https://selenium-python.readthedocs.io/) 
          * [_Pandas_](https://pandas.pydata.org/)
          * [_Scikit-learn_](https://scikit-learn.org/stable/)
          
          and other industry standard libraries for powerful data analysis 
        * :hammer: Lightweight: you don't have to build a hammer factory when all you need is a hammer 
        * :spider_web: The only hard dependencies are
          * [_Requests_](https://requests.readthedocs.io/en/master/)
          * [_Beautiful Soup_](https://www.crummy.com/software/BeautifulSoup/bs4/doc/)
        
        ---
        
        ## Table of Contents
        * [:computer: Installation](#installation)
          * [pip](#pip)
          * [clone](#clone)
        * [:mag_right: Sample Usage](#features)
        * [:books: Documentation](#documentation)
        * [:newspaper: Blog Posts](#blog-posts)
        * [:pray: Contributing](#contributing)
        * [:spider_web: Dependencies](#dependencies)
        * [:credit_card: License](#license)
        * [:grey_question: Support](#support)
        
        ---
        
        ## :computer: Installation <a name="installation"></a>
        
        ### Minimum Python version
        
        This library currently requires [Python 3.7](https://www.python.org/downloads/release/python-370/) or higher.
        
        
        ### pip
        > Install from PyPI using
        ```shell
        $ pip3 install insta-scrape
        ```
        WARNING: make sure you install _insta-scrape_ and not a package with a similar name! 
        
        ---
        
        ## :mag_right: Sample Usage <a name="features"></a>
        All top-level, ready-to-use features can be imported using:
        ```python
        from instascrape import *
        ```
        
        _instascrape_ uses clean, consistent, and expressive syntax to make the developer experience as _painless_ as possible. 
        
        ```python
        # Instantiate the scraper objects 
        google = Profile('https://www.instagram.com/google/')
        google_post = Post('https://www.instagram.com/p/CG0UU3ylXnv/')
        google_hashtag = Hashtag('https://www.instagram.com/explore/tags/google/')
        
        # Load their respective data 
        google.load()
        google_post.load()
        google_hashtag.load()
        ```
        
        After being scraped, relevant attributes can be accessed with dot (.) or bracket (\[\]) notation
        ```python
        print(google.followers)
        print(google_post['hashtags'])
        print(google_hashtag.amount_of_posts)
        >>> 12262794
        >>> ['growwithgoogle']
        >>> 9053408
        ```
        
        ## :books: Documentation <a name="documentation"></a>
        The official documentation can be found on [Read The Docs](https://instascrape.readthedocs.io/en/latest/index.html) :newspaper:
        
        ---
        
        ## :newspaper: Blog Posts <a name="blog-posts"></a>
        
        Check out blog posts on [DEV](https://dev.to/) for ideas and tutorials!
        - [Scrape data from Instagram with instascrape](https://dev.to/chrisgreening/scrape-data-from-instagram-with-instascrape-5e3e) 
        - [Visualizing Instagram engagement with instascrape](https://dev.to/chrisgreening/visualizing-instagram-engagement-with-instascrape-326h)
        - [Exploratory data analysis of Instagram using instascrape and Python](https://dev.to/chrisgreening/exploratory-data-analysis-of-instagram-using-python-1o5c)
        - [Creating a scatter matrix of Instagram data using Python](https://dev.to/chrisgreening/visualizing-the-relationship-between-instagram-variables-using-python-55gg)
        - [Downloading an Instagram profile's recent photos using Python](https://dev.to/chrisgreening/downloading-an-instagram-profile-s-recent-photos-using-python-25b2)
        
        ---
        
        ## :pray: Contributing <a name="contributing"></a>
        All contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome!
        
        Feel free to [open an Issue](https://github.com/chris-greening/instascrape/issues/new/choose) or look at existing [Issues](https://github.com/chris-greening/instascrape/issues) to get a dialogue going on what you want to see added/changed/fixed.
        
        Beginners to open source are highly encouraged to participate and ask questions :heart:
        
        ---
        
        ## :spider_web: Dependencies <a name="dependencies"></a>
        
        Instascrape primarily relies on two third-party libraries for requesting and scraping Instagram HTML content:
        
        1. [Requests](https://requests.readthedocs.io/en/master/): HTTP requests
        2.  [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/): Scraping and parsing HTML data.
        
        The rest of its functionality is provided directly from Python 3's standard library for unobtrusive code under the hood with little to no overhead.
        
        ---
        
        
        ## :credit_card: License <a name="license"></a>
        [MIT](LICENSE)
        
        ---
        
        ## :grey_question: Support <a name="support"></a>
        Reach out to me if you have questions or ideas!
        * Email:
          * chris@christophergreening.com
        * Instagram: 
          * [@chris_greening](https://www.instagram.com/chris_greening/)
        * Twitter:
          * [@ChrisGreening2](https://twitter.com/ChrisGreening2)
        
        ---
        
        
        <!-- The inspiration for this project began a long time ago in a galaxy far, far away (a.k.a. Summer 2019 on Long Island). I was mindlessly scrolling Instagram for the 1000th hour that week and thought, "How could I access this data programatically?". After 30 seconds of searching it became clear that Instagram's API was not going to be of any use so I was going to have to figure it out myself, and thus the beginning of instascrape was born. -->
        
        <p align="center">
          <img src="media/logopic.png">
        </p>
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.7
Description-Content-Type: text/markdown
