Skip to content
Open
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions data-pipeline/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,11 @@ The ClinVar pipelines must be run on clusters with an appropriate version of VEP
./deployctl dataproc-cluster start <cluster-name>
```

NOTE: Starting with `gnomad version 4.1`, it is highly recommended to specify a
custom OS image that was built for that gnomAD version. To see these images, run
`gcloud compute images list --filter="family=dataproc-custom-image"`. You can then
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Running this tells me 0 images were found:

❯ gcloud compute images list --filter="family=dataproc-custom-image"
Listed 0 items.

My gcloud config's project is set to exac-gnomad, maybe that's the reason?

pass the desired image name to the above `start` command using the `--image` flag.

- Run a pipeline.

A list of all pipelines can be seen with `./deployctl data-pipeline run --help`.
Expand Down