Briefly, reads containing adaptor sequences and low-complex regions were removed from the dataset.