Instagram - Webscraper

On seeing this very particular link (pasted on the repo), I was curious to scrape them data and see what was it all about. The page opened to 100 other page and each one to 10 other pages with 25 x 5 = 125 accounts. That totals upto 125k accounts present in this peculiar indexing.

The scrapper.py scrapped all them accounts and stored in a CSV which then was accessed by main.py to hit the accounts and get the basic profile details such as postCount, followerCount and followingCount.

Above is the screenshot of the terminal with the preview of the scrapped data

My calculations told me that the entire computation will take 15 hours. So I had to kill it after scrapping ~500 accounts. I dont have that much time.

Viv: The final aim is to eventually access the follower and following name list for each individual. Process could be sped up using spider splits.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
chromedriver_linux64		chromedriver_linux64
README.md		README.md
main.py		main.py
nameList.csv		nameList.csv
scrapper.py		scrapper.py
terminal.png		terminal.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instagram - Webscraper

About

Releases

Packages

Contributors 2

Languages

TonyJacb/Instagram-Webscraper

Folders and files

Latest commit

History

Repository files navigation

Instagram - Webscraper

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages