Skip to content

ShinnoT/cryptocrawl-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cryptocurrency Crawl Data Pipeline

This project extracts cryptocurrency data from Yahoo Finance, processes the data, and stores the results in a SQLite database. It is a simple ETL pipeline implemented using Python and the Bonobo ETL framework.

cryptocrawl-pipeline - portfolio site


Table of Contents


About

The script will scrape crypto data from the target website, process the data, and store the results in a SQLite database located at database/crypto.db.

You can view the database using any SQLite database client.


Installation

Before running the project, you need to install the required Python libraries. To do this, navigate to the project directory and run the following command:

pip install -r requirements.txt

NOTE:

from collections import Iterable has been deprecated but is still in use in the bonobo package -- to fix for local usage, modify the import statement in the file bonobo\config\processors.py to from collections.abc import Iterable


Usage

To run the ETL pipeline, execute the main script from the project directory:

python scripts/main.py

License

This project is licensed under the terms of the MIT license.


Made with ❤️ by Shinno Taguchi

About

A bonobo ETL data pipeline that web crawls and loads cryptocurrency data to an SQLite database. 🐒

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages