New features: chunking and compression #93

khou2020 · 2022-11-28T22:56:05Z

This PR is to add data chunking and compression features. By using additional metadata and manipulate the space between data object file offsets, these features can be developed without breaking the NetCDF file format specifications.

More information about the design and implementation are described in K. Hou, Q. Kang, S. Lee, A. Agrawal, A. Choudhary, W. Liao. Supporting Data Compression in PnetCDF, published in the International Conference on Big Data, 2021.

An example program is available in ./examples/C/chunk_compress.c

…ng pattern

gsjaardema · 2025-07-02T22:04:21Z

This sounds very interesting and is in line with some work we are planning to do this fiscal year (plan was for using the HDF5 compression under netCDF). If this is planned to go into production in the short term, we would also include this in the work that we do.

wkliao · 2025-07-03T17:45:38Z

Since you mentioned HDF5, I wonder if there is any thing missing in HDF5 but is in this PR of PnetCDF.

gsjaardema · 2025-07-03T17:53:34Z

No, we basically want to give the users some options. Some prefer HDF5-based files and some prefer the native based files. It also gives us flexibility if having problems we can try the other format and see if that does/doesn't have a similar issue.

Dealing with the compression filter plugins on a netCDF-4 file is somewhat problematic since a user needs to know what was used to write the file at the time it is read to make sure the correct filter exists. That is our main delay in implementing some of the compression filters is that we can't guarantee that a reader of the file will have the correct filter or even given a raw hdf5 file, how do we know what filter is needed to read it...

wkliao · 2025-07-03T18:17:28Z

That is indeed a challenge.

Another thing I learned is users may want to use different compression
parameters for different variables. Is this supported in your case?

gsjaardema · 2025-07-03T18:21:39Z

Another thing I learned is users may want to use different compression parameters for different variables. Is this supported in your case?

We have not investigated that yet. Currently using the same parameters for all datasets. We do have some integer and some double-precision floating point data sets, so different algorithms might help there, but we haven't looked at it yet.

wkliao · 2025-07-03T18:46:13Z

If you plan to give this PR a try, I very much welcome your feedback.

FYI. PnetCDF also supports compression/decompression in its nonblocking
APIs, which allow to compress/decompress multiple variables in one
call to ncmpi_wait_all. More information and timing results can be found in
our 2021 paper mentioned above.

dqwu · 2025-09-05T16:36:15Z

@wkliao
Have you tested this PR with --enable-sz --with-sz=/sz/install/path?

It looks like the SZ headers are installed under
/sz/install/path/include/sz
rather than directly in
/sz/install/path/include.

Because of that, the following check fails to find sz.h:

if test "x${have_sz}" = xyes; then
   AC_CHECK_HEADERS([sz.h], [], [have_sz=no])
fi

A possible fix would be to adjust the include path:

if test "x${SZ_INSTALL}" != x ; then
   CPPFLAGS+=" -I${SZ_INSTALL}/include/sz"
   ...
fi

wkliao · 2025-09-05T17:35:22Z

For such unusually installation, I suggest to set CPPFLAGS environment variable
to the SZ include path when running configure. Please let me know if that works.

dqwu · 2025-09-05T18:06:18Z

For such unusually installation, I suggest to set CPPFLAGS environment variable to the SZ include path when running configure. Please let me know if that works.

Actually that is the default installation of SZ.

Setting CPPFLAGS environment variable to the SZ include path before calling configure works:
export CPPFLAGS="-I /sz/install/path/include/sz"

dqwu · 2025-09-08T22:56:32Z

@wkliao
The following example runs correctly with the PnetCDF master branch, but hangs at the ncmpi_wait_all call when tested with this PR. Could you please take a look?

#include <mpi.h>
#include <pnetcdf.h>
#include <stdio.h>

#define DIM_LEN 8

int main(int argc, char **argv)
{
    int ncid, dimid, varid, rank;
    int vals[DIM_LEN] = {-1, -2, -3, -4, -5, -6, -7, -8};
    MPI_Offset start, count;
    MPI_Info info;
    int req;

    MPI_Init(&argc, &argv);
    MPI_Comm_rank(MPI_COMM_WORLD, &rank);

    MPI_Info_create(&info);
    MPI_Info_set(info, "nc_chunking", "enable");

    ncmpi_create(MPI_COMM_WORLD, "test.nc", NC_CLOBBER, info, &ncid);
    MPI_Info_free(&info);

    ncmpi_def_dim(ncid, "x", DIM_LEN, &dimid);
    ncmpi_def_var(ncid, "var", NC_INT, 1, &dimid, &varid);

    ncmpi_enddef(ncid);

    if (rank == 0)
    {
        start = 0;
        count = DIM_LEN;
        ncmpi_iput_vara_int(ncid, varid, &start, &count, vals, &req);
    }
    else
        req = NC_REQ_NULL;
        
    printf("rank = %d, before ncmpi_wait_all\n", rank); fflush(stdout);
    ncmpi_wait_all(ncid, 1, &req, NULL);
    printf("rank = %d, after ncmpi_wait_all\n", rank); fflush(stdout);

    ncmpi_close(ncid);

    MPI_Finalize();

    return 0;
}

Notes:

The hang does not occur if nc_chunking is set to "disable".
The hang does not occur if ncmpi_wait_all is called with NC_REQ_ALL, e.g.:
ncmpi_wait_all(ncid, NC_REQ_ALL, NULL, NULL);

wkliao · 2025-09-09T18:19:08Z

Hi, @dqwu

Thanks. I was able to reproduce the error.
I pushed a fix. Please let me know if it fixes the problem.

dqwu · 2025-09-10T23:34:05Z

Hi, @dqwu

Thanks. I was able to reproduce the error. I pushed a fix. Please let me know if it fixes the problem.

The problem has been fixed in the latest feature branch, thanks.

wkliao changed the title ~~Add chunking and compression driver~~ New features: chunking and compression Dec 4, 2022

khou2020 added 6 commits December 4, 2022 16:33

add config option to enable chunking feature

a890ba1

add chunk drivere

3c1a9e3

add ncmpi_var_get/set_chunk and ncmpi_var_get/set_filter

f8f4a62

enable chunk driver when chunking hint is set

63b5b61

rename default filter hint to nc_chunk_default_filter

13d9d15

add documentation about chunked I/O driver

e2ec339

khou2020 force-pushed the chunk branch from b178f38 to e2ec339 Compare December 5, 2022 06:39

khou2020 added 6 commits December 5, 2022 00:48

bugfix:chunk driver build condition is ENABLE_CHUNKING

ad13798

always build chunk related APIs

490526c

bug fix: add ncchunkio to DIST_DIR

21831e5

bugfix: add new m4 scripts to extra dist

eb6d336

bugfix: add ncchkioi_profile_timers.m4 to extra_dist

49c761e

bugfix: add nchunk io to m4 include

d3b160d

wkliao force-pushed the master branch 2 times, most recently from 051cdc1 to f1db1d6 Compare May 23, 2024 21:24

wkliao force-pushed the master branch 3 times, most recently from 9c403de to 29e55b9 Compare November 11, 2024 22:41

wkliao added 11 commits May 8, 2025 14:47

define ENABLE_COMPRESSION if --enable-zlib or --enable-sz is set

2eec8aa

add example of using chunking and compression

98443b7

run chunk_compress.c only when compression is enabled

fe3537d

Remove unused variable, initialize err to NC_NOERR

84c7137

add chunking example, examples/C/chunk_io.c

7dfe5e6

Add verbose to print which processes perform compression/decompression

9de0381

comment out unused printf

3c275c5

test: add examples/C/chunk_2D.c which uses 2D checkerboard partitioni…

c8fb993

…ng pattern

bug fix: inquire number of record variables

bddd068

examples/C/chunk_2D.c one wait_all for all nonblocking reads

3900239

chunk_2D.c: fix use of NX by mistake

fc19c89

wkliao added 2 commits June 30, 2025 21:06

use MPI_COMM_SELF to call MPI_Pack and MPI_Unpack

c845ecb

bug fix: sbuf index, array of size nsend

eeb6dfc

wkliao added 2 commits July 9, 2025 15:27

chunk: fix printing expected number of records

e066187

chunk_2D.c add partitioning along time for decompression

1c28433

wkliao force-pushed the chunk branch from 0efdeb0 to 1c28433 Compare July 9, 2025 23:54

wkliao force-pushed the master branch from c55ac21 to c86260b Compare July 20, 2025 20:59

wkliao force-pushed the master branch 2 times, most recently from 92f2720 to 767df42 Compare August 1, 2025 00:58

wkliao added 2 commits September 9, 2025 13:14

bug fix: all ranks participate wait call whenin collective data mode

fdc5f41

add test program for chunking nonblocking API call

38a6a63

New features: chunking and compression #93

Are you sure you want to change the base?

New features: chunking and compression #93

Uh oh!

Conversation

khou2020 commented Nov 28, 2022 • edited by wkliao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gsjaardema commented Jul 2, 2025

Uh oh!

wkliao commented Jul 3, 2025

Uh oh!

gsjaardema commented Jul 3, 2025

Uh oh!

wkliao commented Jul 3, 2025

Uh oh!

gsjaardema commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wkliao commented Jul 3, 2025

Uh oh!

dqwu commented Sep 5, 2025

Uh oh!

wkliao commented Sep 5, 2025

Uh oh!

dqwu commented Sep 5, 2025

Uh oh!

dqwu commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wkliao commented Sep 9, 2025

Uh oh!

dqwu commented Sep 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

khou2020 commented Nov 28, 2022 •

edited by wkliao

Loading

gsjaardema commented Jul 3, 2025 •

edited

Loading

dqwu commented Sep 8, 2025 •

edited

Loading