I find that there are 420372 images generated according to this sampling scheme and SOA/captions. Is it true?