File tree Expand file tree Collapse file tree 2 files changed +23
-0
lines changed Expand file tree Collapse file tree 2 files changed +23
-0
lines changed Original file line number Diff line number Diff line change @@ -10,4 +10,8 @@ Polars is an alternative to pandas that is designed to have better performance.
10
10
directory with the same name.
11
11
1 . ` polars_versus_pandas_benchmarks.ipynb ` : Jupyter notebook that compares the
12
12
performance of polars and pandas on a variety of operations.
13
+ 1 . ` create_csv_data.py ` : Python script to generate one or more large CSV files
14
+ for benchmarking.
15
+ 1 . ` create_csv_data.slurm ` : Slurm script to run ` create_csv_data.py ` on a
16
+ cluster.
13
17
1 . ` data ` : Directory containing the data used in the notebook.
Original file line number Diff line number Diff line change
1
+ #! /usr/bin/env -S bash -l
2
+ # SBATCH --account=lpt2_sysadmin
3
+ # SBATCH --nodes=1
4
+ # SBATCH --ntasks=1
5
+ # SBATCH --cpus-per-task=1
6
+ # SBATCH --mem=2G
7
+ # SBATCH --time=01:00:00
8
+ # SBATCH --mail-user=geertjan.bex@uhasselt.be
9
+ # SBATCH --mail-type=FAIL,END
10
+
11
+ module purge
12
+ module load Python/3.11.3-GCCcore-12.3.0
13
+
14
+ # This should generate a file of approximately 6 GB
15
+ python ./create_csv_data.py \
16
+ --files 1 \
17
+ --cols 100 \
18
+ --rows 2500000 \
19
+ large_data
You can’t perform that action at this time.
0 commit comments