Abstract
High-resolution MS-based proteomics generates large amounts of data, even in the standard LC–tandem MS configuration. Adding an ion mobility dimension vastly increases the acquired data volume, challenging both analytical processing pipelines and especially data exploration by scientists. This has necessitated data aggregation, effectively discarding much of the information present in these rich datasets. Taking trapped ion mobility spectrometry (TIMS) on a quadrupole TOF (Q-TOF) platform as an example, we developed an efficient indexing scheme that represents all data points as detector arrival times on scales of minutes (LC), milliseconds (TIMS), and microseconds (TOF). In our open-source AlphaTims package, data are indexed, accessed, and visualized by a combination of tools of the scientific Python ecosystem. We interpret unprocessed data as a sparse four-dimensional matrix and use just-in-time compilation to machine code with Numba, accelerating our computational procedures by several orders of magnitude while keeping to familiar indexing and slicing notations. For samples with more than six billion detector events, a modern laptop can load and index raw data in about a minute. Loading is even faster when AlphaTims has already saved indexed data in an HDF5 file, a portable scientific standard used in extremely large-scale data acquisition. Subsequently, data accession along any dimension and interactive visualization happens in milliseconds. We have found AlphaTims to be a key enabling tool to explore high-dimensional LC-TIMS-Q-TOF data and have made it freely available as an open-source Python package with a stand-alone graphical user interface at https://github.com/MannLabs/alphatims or as part of the AlphaPept ‘ecosystem’.
Highlights
Efficient indexing of billions of five-dimensional data points in seconds. Fast accession of arbitrary LC-trapped ion mobility spectrometry (TIMS)-QTOF data slices in milliseconds. Easy and fast visualization of LC-TIMS-QTOF data along multiple axes. Freely available graphical user interface (GUI), command-line interface (CLI), and Python module on Windows, Linux, and macOS
Taking trapped ion mobility spectrometry (TIMS) on a quadrupole TOF (Q-TOF) platform as an example, we developed an efficient indexing scheme that represents all data points as detector arrival times on scales of minutes (LC), milliseconds (TIMS), and microseconds (TOF)
To better explain the indexing procedure at the heart of AlphaTims, we shortly summarize the data structures used in the vendor’s software in their TIMS data format
Summary
MS, a leading analytical method in clinical and (bio) chemical research, is no exception This issue is compounded when coupling MS with other techniques such as LC and ion mobility spectrometry [1], which allow separating analytes efficiently in scientific domains such as proteomics, lipidomics, and metabolomics [2,3,4]. A detector at the end of the TOF accelerator discretizes continuous ion arrival times into TOF peaks of a few hundred picoseconds wide This combination of analytical techniques, in brief LC-TIMS-QTOF, has received much attention since the introduction of the timsTOF Pro instrument (Bruker Daltonics).
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have