Data Analytics Activities

CAMS Data Analytics 

The insights and discussions from the Data Analytics State of the Nation event have significantly contributed to shaping the CAMS Data Analytics thematic area. CAMS has identified several gaps between industry and academia. To address these challenges, the event working group has proposed the establishment of two steering committees to drive both short-term and long-term projects:

  1. Data Analytics Teaching and Training (DATT) Steering Committee
  2. Data Analytics Network and Infrastructure (DANI) Steering Committee

DANI - Data Analytics Webinar Series

The next webinar will be hosted on 20th June at 2pm. Our next speaker will be Dr Jorem Posma. To register your interest, please sign up here

DATT Steering Committee

Data Analytics Teaching and Training (DATT)

What have we achieved in 2024?

1) Invited partners and members to form the committee, ensuring a balanced mix of academics and industry professionals.

2) Held discussions to identify focus areas and tasks, and determined the best approach to address them.

3) Distributed a survey to gather insights into the current state of data analytics in the industry, identify opportunities and skills needed, and understand where we can help tackle these challenges.

 

The Committee

 

 

Dr Lucy M. Morgan

Senior Scientist in Analytical Chemistry (and Data Science)

Pfizer Ltd.

Lucy has 6 years experience in teaching chemistry and data analytics in academia and 3 years experience training  in analytical chemistry and data science in the pharmaceutical industry. Her background is in batteries and pharmaceuticals, with skills in molecular dynamic and DFT modelling, predictive  modelling (LC, Raman, CCS, UV), and coding in python.

 

 

Allyson McIntyre 

Principal Scientist 

AstraZeneca 

Allyson works in and around the area of data analytics and is keen to ensure we have strong links between industry & academia to enable improved training and guidance in this area. She will help to bring an industrial perspective in what we do and what skills we need to improve and work alongside other  members of DATT to provide improvements in this area. 

 

 

Diane C. Turner PhD FRSC

Chair of Trustees of the Analytical Chemistry Trust Fund, Director & Senior Consultant Anthias Consulting Ltd

Consultant Anthias Consulting Ltd

                                     Through my role as a consultant and trainer in analytical sciences across most industries globally, I teach how to use software from many manufacturers for data analysis, alongside how to plan, collect and use the data that is needed for a project through to data analytics. I have been using                                                             Chemometrics most of my career including for my PhD in disease diagnosis. I am using this knowledge and experience within DATT to look at how to improve teaching and training in data analytics.

 

 

Claire White

Chemist

Selden Research Ltd

With over 17 years experience working in industry laboratories, I have helped train and mentor numerous students and new employees in the field of chemistry including the analysis of data. This background equips me with practical insights on how to enhance teaching and training in data analytics, helping bridge the gap between academia and industry.

 

 

 

Data Analytics Webinar Series

Missed our first webinar in the Data Analytics series? No worries! The recording is now available, allowing you to revisit Kate's presentation and our panel discussion.
Access the recordings here Don’t miss out!
 

Q&A summary

How did Kate come about her current role and what funding mechanisms were used?

  • Kate explained that she has worked with a foot in industry for about 10 years, initially at the Quadrim Institute and the Institute of Food Research. She collaborated closely with Oxford Instruments on an Innovate UK project, which led to her dual role. She continued working as a consultant via Qib Extra for Oxford Instruments until COVID-19 caused an abrupt end. Later, she took on a consultancy with Mester Lab, which eventually supported her fully as a scientific director and in a role at the University of East Standard. 

How good does analytical data need to be for AI to be successful?

  • Kate emphasized the importance of robust statistics to handle errors in large datasets. She mentioned using trimmed means and medians instead of actual means or standard deviations to avoid the influence of outlying values. Additionally, she highlighted the importance of understanding and eyeballing the data to spot errors. 

How can we overcome the reluctance to adopt machine learning in industry?

  • Kate suggested that explainable AI is crucial for industry adoption. She mentioned the importance of independent test data and proving the model's performance on completely novel substances. She also noted that the complexity of models, which makes them effective, can be a challenge in unravelling their workings. 

Sustainability of AI methods in industry:

  • Kate clarified that her work does not involve generative AI, which is computationally expensive. For her models, predictions are instantaneous and not an issue for sustainability. The training phase is computationally intensive, but they have made progress in streamlining it by reorganizing training data to reduce redundancy. 

Deciding on variable input for FTIR spectrum in machine learning:

  • Kate advised against including noisy data in neural networks. She recommended cutting out regions of the spectral baseline that are not informative. She mentioned that random forests can work with raw data, but it depends on the specific application and the amount of work one wants to do. 

Skills crucial for analytics professionals:

  • Kate highlighted the importance of experience and practice, noting that there are plenty of data repositories and free platforms like Python for learning. She emphasized the need for a numerate background and the ability to think in multidimensional spaces. 

Addressing data openness and privacy concerns:

  • Kate mentioned that UEA is committed to data sharing, especially for publicly funded projects, and aims to place collected data in the public domain after publication. She acknowledged the challenges of combining data from different sources and the proprietary nature of some business data. David added that instrument vendors might be more willing to share data, while software vendors might find it difficult due to the proprietary value of their datasets.

DANI Steering Committee

Data Analytics Network and Infrastructure (DANI)

What have we achieved in 2024?

1) Invited partners and members to form the committee, ensuring a balanced mix of academics and industry professionals.

2) Held discussions to review focus areas and confirm committee members.

3) Developed a regular webinar series on Data Analytics.

4) Created a clear landing page on the CAMS website to provide resources on good data and infrastructure practices.

 

The Committee

 

Dr Drupad Trivedi 

Lecturer in Analytical and Measurement Sciences

The University of Manchester | Analytical Chemisty, Chemometrics, Metabolomics

Dr Drupad Trivedi is a CAMS lecturer and data analytics MSI co-chair. His research expertise spans mass spectrometry techniques, data analytics, and point-of-use sensor development for health and disease monitoring as well as prediction. His current research focuses on translating laboratory assays into wearable sensor technologies, utilizing data-driven approaches to decode complex signals. With nearly a decade of research and leadership experience, he has built a multidisciplinary research program focused on signal processing, data analysis, and big data modelling in analytical research. His work has been strengthened through active international collaborations, industry consultancy and academic collaborations, contributing to the advancement of analytical measurement sciences.

 

Rebecca Ingle

Lecturer

University College London

Rebecca's research involves the development and application of  advanced spectroscopic techniques to problems in molecular photochemistry and new applications in the analytical sciences. Many of her experiments involve dealing with large, multidimensional datasets and often require extensive post-processing and statistical analysis. She is particularly interested in how better standardisation of experimental techniques and analysis methods can improve the value of data for the scientific community.

 

 

Martin Strachon

Digital Scientist, Analytical R&D

Pfizer

In my role within analytical R&D, I focus on streamlining laboratory workflows to enhance data management and analysis, while also enabling further data analytics. This expertise contributes to the steering committee's understanding of scientists' needs and software standards in an industry setting

 

 

 

Chiara Giorio

Professor of Atmospheric Chemistry

Yusuf Hamied Department of Chemistry, University of Cambridge

My expertise is in multivariate statistical analysis, chemometrics, mass spectrometry, source apportionment

 

 

Alex Henderson

Senior Technical Specialist (Data Systems Architect)

The University of Manchester