-
Notifications
You must be signed in to change notification settings - Fork 22
Description
Hi,
I have the following problem with graphtyper v2.7.7. I recalled SVs from a number of individuals. I started by using svimmer to merge SVs from these individuals in a first step, used the output to recall SVs from the raw cram files by chromosome and concatenated the subfiles by using the graphtyper vcf_concatenate -sv command. Now, as a final step, I want to merge the files to create an aggregated VCF file. During that processing, I get the following error:
<error> FATAL ERROR - SNP-HWE: Current genotype configuration (%d %d %d ) includes a negative count-1359751756 332284134 1018685522
Do you know how to fix that error? I have no idea where the negative count comes from.
Help would be appreciated.
Edit:
I checked all the genotypes in the used VCF files. No negative values are present. One suspicion I have, is that the negative value could be caused by an integer overflow. My suspicion is caused by the size of the numbers reported in the error message. Both the first and last number are ints with 10 positions. The error message is emitted by the function p_hwe_excess_het (https://github.com/DecodeGenetics/graphtyper/blob/49643915ed69a20d408d5758afdb62dbd88c4d33/src/utilities/snp_hwe.cpp#L19C1-L32C24), which takes three inputs in int (I assume 32bit) format. If the first value would exceed 2,147,483,647, it would flip to negative values and emit the error message.
My main question would be, why the value could be so high?
As additional information: I am merging 20 VCF files with 100 Individuals each.
Edit2:
I checked the VCF files and found no genotype stats (number of het, homRef, homAlt) either below 0 or over 100 (which is the expected number of individuals).