SparseDistance

python3 -m pip install git+https://github.com/jpata/SparseDistance.git@v0.1
(or just copy the files from the repo to your project)

Efficiently generate sparse graph adjacency matrices using tensorflow, including gradient propagation and minibatches, for graph sizes up to 100k+ in subquadratic time.

On the following images, you see the input set on the left and the learned graph structure (edges) on the right for a toy clustering problem with approx. 5000 input elements per graph.

Here, we show the learned distance matrix on the left and the scaling of the training time on the right.

Here's how it works:

Input: a set of elements with features, shape=(N_batch, N_elem, N_feat), possibly in minibatches for efficient training (e.g. a minibatch may consist of several sets/graphs padded to the same size)
Output: a sparse adjacency matrix for each input set shape=(N_batch, N_elem, N_elem), the elements of which can be differentiated with respect to the input
Hyperparameters: bin size M, number of neighbors K, LSH codebook size (maximum number of bins) L

The input data is divided into equal-sized bins with a locality sensitive hashing (LSH) which is based on random rotations. In each bin, we run a dense k-nearest-neighbors algo and update the final sparse adjacency matrix. The generated graph consists of N_elem/bin_size disjoint graphs. The maximum input size is determined by the pre-generated LSH codebook size. Since the bin size is much smaller than the input size, the k-nearest-neighbors evaluation is efficient. The input features to the hashing and knn can be learnable, so that the binning & knn graph construction can adapt to the problem based on gradient descent.

import tensorflow as tf
import numpy as np
from sparsedistance.models import SparseHashedNNDistance
from sparsedistance.utils import sparse_dense_matmult_batch

num_batches = 10
num_points_per_batch = 1000
num_features = 32

X = np.array(np.random.randn(num_batches, num_points_per_batch, num_features), dtype=np.float32)
y = np.array(np.random.randn(num_batches, num_points_per_batch, ), dtype=np.float32)

#show that we can take a gradient of stuff with respect to the distance matrix values (but not indices!)
dense_transform = tf.keras.layers.Dense(128)
dm_layer = SparseHashedNNDistance(max_num_bins=200, bin_size=500, num_neighbors=5)

with tf.GradientTape(persistent=True) as g:
    X_transformed = dense_transform(X)
    dm = dm_layer(X_transformed)

    ret = sparse_dense_matmult_batch(dm, X)

    #reduce the output to a single scalar, just for demonstration purposes
    ret = tf.reduce_sum(ret)

grad = g.gradient(ret, dense_transform.weights)

Features:

Works on a modest GPU (e.g. 2060S) or a CPU
Uses only native TF 2.x operations, no compilation needed
Fast evaluation and efficient memory use
TF graph mode for easy deployment
TF eager mode for debugging

Based on the Reformer [1] (LSH approach and description) and GravNet [2] (knn graph construction) papers.

If you use this code academically, please cite this repository as follows:

Joosep Pata. (2020, October 22). jpata/SparseDistance v0.1 (Version v0.1). Zenodo. http://doi.org/10.5281/zenodo.4117570

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
images		images
sparsedistance		sparsedistance
test		test
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SparseDistance

About

Uh oh!

Releases 1

Packages

Languages

License

jpata/SparseDistance

Folders and files

Latest commit

History

Repository files navigation

SparseDistance

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages