Wrangling a de novo sequencing benchmark
In any machine learning study, high quality data for training and validating the model is critical. This paper describes the result of an iterative process of data wrangling and quality control, which ultimately produced a benchmark dataset for de novo peptide sequencing from mass spectrometry data.