The protein sequences used in this study are available from public sources: SARS-CoV-2 sequences: https://viralzone.expasy.org/89966, NCBI "non-redundant" protein database, version 5: ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA, Protein reference sequences: ftp://ftp.ncbi.nlm.nih.gov/refseq/release/viral. The workflow and accompanying Python scripts is available as a Snakefile for use with Snakemake under https://gitlab.com/svenrahmann/corona.