I was facing a familiar bottleneck. I had Staphylococcus aureus genomes and needed answers—sequence types, spa types, SCCmec cassettes, resistance, and virulence genes. But getting that data meant wrangling half a dozen tools with conflicting dependencies and output formats. The irony was sharp: I was using cutting-edge genomic data, but my workflow felt stuck in the dark ages. For MRSA, time matters. An outbreak doesn't wait for you to debug a script.
That frustration sparked StaphScope. I built the tool I wished existed: one optimized solely for S. aureus, installable with a single command, that would take assembled genomes and return a complete, unified profile in minutes—with epidemiological context included. The key was a curated global lineage database integrated into the pipeline. Now, when you run StaphScope, it doesn't just spit out "ST8, t008, SCCmec IV". It tells you: "This is the USA300 clone, commonly PVL-positive."
The first time I ran it on 24 genomes, I barely had time for coffee. In under 14 minutes, it was done. An 8-10x speed gain over generalist pipelines. We had turned fragmented data into a coherent biological narrative in the time it takes to watch an episode of a TV show.
Our new paper in BMC Genomics confirms StaphScope's accuracy against reference strains like USA300 and N315. But more importantly, analyzing a diverse set of isolates revealed insights—dominant clones, PVL distribution, and even potential plasmid transfer between lineages.
We also built StaphScope Web , bringing the same power to clinicians and lab managers through a simple drag-and-drop interface. Democratizing access was a core goal.
I invite you to try it. Whether you use the command-line tool or the web interface, I hope StaphScope gives you back your most valuable resource: time to think about the biology, not the bioinformatics.
Here's to turning fragmented data into coherent narratives, one S. aureus genome at a time.