Clostridium botulinum Group I and Clostridium sporogenes are closely related bacteria responsible for foodborne, infant and wound botulism. A comparative genomic study with 556 highly diverse strains of C. botulinum Group I and C. sporogenes (including 417 newly sequenced strains) has been carried out to characterise the genetic diversity and spread of these bacteria and their neurotoxin genes. Core genome single-nucleotide polymorphism (SNP) analysis revealed two major lineages; C. botulinum Group I (most strains possessed botulinum neurotoxin gene(s) of types A, B and/or F) and C. sporogenes (some strains possessed a type B botulinum neurotoxin gene). Both lineages contained strains responsible for foodborne, infant and wound botulism. A new C. sporogenes cluster was identified that included five strains with a gene encoding botulinum neurotoxin sub-type B1. There was significant evidence of horizontal transfer of botulinum neurotoxin genes between distantly related bacteria. Population structure/diversity have been characterised, and novel associations discovered between whole genome lineage, botulinum neurotoxin sub-type variant, epidemiological links to foodborne, infant and wound botulism, and geographic origin. The impact of genomic and physiological variability on the botulism risk has been assessed. The genome sequences are a valuable resource for future research (e.g., pathogen biology, evolution of C. botulinum and its neurotoxin genes, improved pathogen detection and discrimination), and support enhanced risk assessments and the prevention of botulism.