Modern biology generates huge quantities of complex data
Data is being generated at a rapidly accelerating rate within the life-sciences industry. The most well documented example of this being the explosion of data within genomics; where fuelled by advances in sequencing hardware, it is projected that up to 40 exabytes of sequencing data will be generated yearly by 2025. However, genomic sequencing represents only one aspect of modern biology. What of all the data that is being, or could be, captured in other areas of biological research and development?
Faced with an abundance of data streams, even the experienced laboratory scientist often faces challenges in handling large, complex datasets across multiple disparate software packages:
- Manual manipulation and transfer of complex data sets, from machines into generalist analysis packages such as Excel.
- No or poor data provenance (the traceable history of the data).
- Vendor software packages which are overly complex and focused on a specific niche – making data integration across instruments challenging, as well as introducing a technical hurdle for laboratory scientists burdened with the tedium of having their workflow disrupted by a spectrum of different isolated software packages.
A new way of doing Biology: Synthace
To address these problems, Synthace developed an operating system for biology, built by scientists for scientists, that provides end-to-end data integration and automation across the whole spectrum of the life-sciences industry. Antha works across multiple hardware platforms, running the design, execution and analysis of the experiment.
The data can then be rapidly presented to powerful visualisation and analysis tools from the platform such that new insights can be quickly obtained. This raw, flexible, power is leveraged by an easy to use user interface, allowing even the infrequent user to rapidly become an expert.
Bioprocessing Case Study: Ambr 250™ data visualisation powered by Synthace
Bioprocess development is a labour and data intensive aspect of the biopharmaceutical industry, which is increasingly identified as a bottleneck to the delivery of breakthrough therapeutics. By using Synthace, a single bioprocess engineer can design, execute, analyse and visualise a "visualise a 12-reactor run in just three days, when it otherwise could take two weeks: increasing productivity 5x"
The commonly used Satorious Ambr 250™ bioprocess optimisation system produces large quantities of on-line data - up to 50,000,000 data points per experiment –its robotic sampling arm allowing for high frequency offline bioreactor sampling. These resulting data files, often >1Gb in size, are therefore rendered unmanageable in common off-the-shelf solutions. By leveraging Synthace, this data can be collected, cleaned, structured (e.g. off-line and online data linked), versioned and displayed in a dynamic visualisation or in a format applicable to further interrogation via advanced analysis methods (python based machine learning packages). Within the bioprocessing field advanced multivariate data analysis (MVDA) has become a routine part of many workflows as a result of an increased use of PAT (Process Analytical Technology). Likewise, advanced data analytics, in particular machine learning, have begun to find applications in all areas of research and development. By structuring the data for the user, Synthace solves one of the challenges in modern data science – obtaining clean and structured data sets for advanced machine learning applications.
What we integrate with
Ready to Revolutionise your Workflow?
The example given here illustrates a simple proof of concept case for easy integration of Synthace into your bioprocess workflow. With applications rapidly coming on-line for more advanced bioprocessing solutions, construct assembly, assay development, and drivers for more pieces of lab hardware the opportunities for transforming your lab’s productivity are endless.