To calculate pik, we grouped 229 reference genomes into subgroups based on information gathered from [26,27] (see Table 1). It is assumed that pik is identical within each subgroup for each gene. Then pik is taken to be the ratio of number of genomes that has an orthologous gene to the total number of genomes in the subgroup.