Abstract:Streptomyces is a genus of diverse actinomycetes known for producing antibiotics, showcasing high value for research and development. However, the limited number of Streptomyces species included in commercial databases restricts the application of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) in the identification of Streptomyces.Objective To establish a MALDI-TOF MS database for the identification of Streptomyces based on the mining of whole genome sequencing data and test its accuracy.Methods By searching against a genome database, we selected all the genome-sequenced type strains of Streptomyces. Subsequently, the sequences of 12 ribosomal proteins were extracted from their genome sequences and theoretical MALDI mass/charge (m/z) values were calculated. Accordingly, a database was established based on the combination of 12 theoretically calculated m/z values, which can be used as the identification criteria for Streptomyces species. Finally, type strains were randomly collected for MALDI-TOF MS testing. To test the accuracy of the database construction method and comparison results, we matched the experimental m/z values with the theoretically calculated m/z values of the species in the database.Results We established a MALDI-TOF MS database including 615 Streptomyces species. The mass spectra of the tested strains were accurately matched with those of corresponding species in the database. In addition, we proposed a method for the identification of Streptomyces by database comparison.Conclusion The database established in this study laid a foundation for identifying Streptomyces based on MALDI-TOF MS.