Skip to content
#

data-comparison

Here are 16 public repositories matching this topic...

GenEC (Generic Extraction & Comparison) is a Python-based tool designed for extracting structured data from source and reference files, then comparing their contents based on defined rules. It allows customization through YAML-based configuration files and supports both command-line and programmatic usage.

  • Updated Oct 14, 2025
  • Python

The goal of this project is to build FastAPI-based web service that processes large datasets using Polars and Pandas, and comparison of the two packages performance in loading, cleaning and transforming large datasets are reviewed. The API is tested via an interactive `/docs` interface to reveal results of the comparison project.

  • Updated Jul 5, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-comparison topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-comparison topic, visit your repo's landing page and select "manage topics."

Learn more