C-GAPS: Cellular to Global Assessment of Phytoplankton Stoichiometry 

JGI Seedcorn funding 2024-25: Nicola Wiseman & Léo Gorman

Introduction

The oceans play a key role in keeping the earth cool by absorbing carbon dioxide from the atmosphere into the surface waters. This carbon dioxide is either slowly transported to the deep ocean by currents or taken up by microscopic marine algae called phytoplankton and used for photosynthesis. Like plants on land, phytoplankton need sunlight and nutrients to be able to perform photosynthesis, which gives them the energy they need to grow. As sunlight and carbon dioxide are abundantly available in the surface ocean, the rate of phytoplankton photosynthesis is limited by the nutrients that are available, particularly nitrogen and phosphorus (Moore et al., 2013). The ratio of nutrients to carbon in phytoplankton is referred to as “stoichiometry” and can be used to understand the efficiency of carbon storage in the global ocean.  

What were the aims of the seed corn project?

Previously, stoichiometry was assumed to be relatively stable throughout the ocean, known as the Redfield Ratio (Redfield, 1934). This ratio was used to simplify climate models and heavily controls the amount of carbon storage in the ocean. However, field observations and inverse modelling studies have shown that stoichiometry varies significantly with latitude and nutrient availability (Martiny et al., 2013; Galbraith & Martiny, 2015; Weber and Deutsch, 2010). Laboratory culture studies have tested the responses of individual species to changing environmental conditions, but this level of species-specific detail is not represented in climate models, which instead focus on a small number (n  = 3 or 4) of “functional groups” which aim to summarize diverse species by their traits or distributions within the ocean (Le Quéré et al., 2005). To further our understanding of ocean carbon storage and phytoplankton stoichiometric response, we ask the following questions: 

  • How do different phytoplankton species vary their stoichiometry based on environmental conditions?  
  • What does the response to temperature, light, or individual nutrients look like?  
  • What are the primary drivers of these responses?  
  • Does grouping phytoplankton by function groups underestimate their unique stoichiometry responses? 
  • Which drivers are key for informing current state-of-the-art climate models? 

Methods

We used data from Tanioka & Matsumoto, 2020, which is a compilation of laboratory experiments that looked at the stoichiometric (carbon:nitrogen and carbon:phosphorus) response of individual plankton species to different perturbations of environmental conditions (temperature, light intensity, photoperiod, nitrogen limitation, phosphorus limitation, and iron limitation).  

We needed to select a model which could account for the fact that there are similarities between species and functional groups. Variance components models, a type of multi-level model, are able to quantify how much of the variation in stoichiometry can be attributed to differences between species, and differences between functional groups. These models can also be extended to assess: 

  1. What is the association between stoichiometry and explanatory variables (such as light), accounting for the fact that there are similarities between samples from the same species? 
  1. Does the association between stoichiometry and these explanatory variables vary across species? (i.e. can stoichiometry in different species be more or less sensitive to light). 

A range of models were compared both in terms of their interpretability and their predictive power. Data cleaning, visualisation, and modelling were carried out in R with the package “brms” used for fitting the Bayesian models. 

What was achieved? 

With the model, we first focused on the carbon:nitrogen stoichiometry response to the explanatory variables. Categorizing the phytoplankton by genus provided the most effective predictive power, followed by the combined four phytoplankton with eight subgroups model.  

The greatest variance for the fixed effects was due to nitrogen limitation, which had a positive effect (where the distribution was greatly positive, such that nitrogen limitation is associated with higher carbon:nitrogen ratios). This is a logical primary driver in variation as if there is less nitrogen available, it will be more difficult for phytoplankton to assimilate nitrogen into their cells. The next strongest effect was phosphorus limitation, followed by iron limitation. Interestingly, these are acting in opposite directions where phosphorus limitation is associated with a positive effect, while iron limitation is associated with a negative effect. Increased nitrogen and phosphorus limitation have previously been proposed to be positively associated with increased C:N ratios (Tanioka & Matsumoto, 2020), where proposed mechanisms involving the changing allocations to phosphorus and nitrogen rich cell components. Iron, however, is associated with protein cofactors as part of electron transport for photosynthesis. This could then potentially impact production of carbon-rich carbohydrates by limiting photosynthetic efficiency, thereby reducing the carbon:nitrate ratio. 

Graph showing the fixed effects of the explanatory variables (nitrogen limitation, iron limitation, phosphorus limitation, temperature, photoperiod and light) carbon:nitrogen stoichiometry for marine phytoplankton. 
Figure 1. The fixed effects of the explanatory variables on carbon:nitrogen stoichiometry for marine phytoplankton. 

Next Steps

These are just the preliminary results from the first iteration of the model. From here, the effects of each explanatory variable on specific phytoplankton groups can also be examined, which could highlight potential variations in biogeography and phytoplankton adaptation. For example, are globally adapted genera less susceptible to temperature effects than genera found within the polar regions? One advantage of the model we are using is that we can incorporate categorical variables of phytoplankton traits like motility and size to investigate the impacts of trait ecology on cellular nutrient uptake. This is the next step planned to incorporate into the dataset. Additionally, we will apply this approach to carbon:phosphorus stoichiometry as well. This project is the initial step to start investigating the response of phytoplankton to a changing climate and will be used as a proof of concept for continued efforts to disentangle the variability in phytoplankton stoichiometry to ultimately be used to improve the biogeochemical models used for future climate prediction on multi-century timescales. 


Contact Details

The code for this project can be found at https://github.com/nicola-wiseman/C-GAPS. Nicola can be contacted via email a nicola.wiseman@bristol.ac.uk

Comparing two devices for measuring body temperature across the menstrual cycle  

JGI Seedcorn funding 2024-25: Sarah Koerner & Louise Millard

Sarah Koerner sitting at a table facing the participant. Sarah is holding an Oura Ring and showing it to the participant.
Study researcher Sarah Koerner (PhD student) showing device to participant during participant’s first visit to study centre. Note: This is a posed image with a non-participant who provided consent to be photographed. 

Body temperature changes across the menstrual cycle with a small rise indicating ovulation. New wearable technologies make it possible to track these changes continuously during the user’s sleep. This type of data can provide valuable insights into menstrual health patterns and cycle variability, offering researchers ways to study reproductive health in real-world settings. 

What are the aims of the project?  

This study investigates two distinct wearable devices: the OvuSense sensor, which is worn vaginally, and the Oura Ring a smart ring worn on the finger. Both devices are worn during sleep and measure body temperature to detect ovulation. The aims of this project are to (1) assess how similar the temperature measurements are between the two devices and (2) to explore participants experiences of wearing the devices for a research study.  

On the left is an image of the OvuSense sensor shown next to a ruler and tampon. On the right is an image of a hand wearing a silver coloured ring (Oura Ring) on the index finger.
OvuSense sensor shown next to a ruler and tampons for size comparison, and the Oura Ring worn on the index finger.  

What has been achieved so far?  

We spent a considerable amount of time refining the study design to ensure it effectively addressed our research questions while being sensitive to the intimate nature of vaginal temperature measurements. One particular aspect that needed careful consideration was the duration of data collection. We had initially planned to collect data over the course of a single menstrual cycle. However, this approach would mean that participants were wearing the devices for different durations, which could make their feelings towards wearing the devices less comparable. We would have also needed to exclude those with longer cycles due to budget and time constraints of the project. Therefore, to ensure inclusivity and consistency, we instead opted for a fixed 40-night data collection period.  

We also paid close attention to the language used in our study materials. We aimed to make all participant communications inclusive, and in particular used gender-inclusive language throughout (e.g. our study requires participants to have a vagina rather than identify as a women).   

The design of our study is shown in the image below. Each participant is asked to wear both the OvuSense and Oura Ring devices during their sleep for 40 consecutive nights. They also complete questionnaires at the start and end of the data collection period. The first questionnaire collects clinical and demographic information, while the second questionnaire asks about their experiences using each device. 

A diagram showing a graphic of a questionnaire paper with an arrow pointing to a graphic of a person with a thermometer graphic at their pelvis and their hand to represent temperature collection. An arrow then points to another questionnaire paper and another arrow points to a line graph to represent data collection.
Diagram illustrating the design of our study. 

After receiving ethical approval, we began recruitment through university networks and successfully enrolled our target of fifteen participants. We have finished collecting data for seven participants and are in the process of collecting data for three participants, with five due to start in the next couple months.  

To ensure good research practice, we have been developing a detailed data analysis plan before beginning any actual analysis. This pre-specified plan outlines how the data will be processed and analysed, helping to reduce bias and ensure transparency. We will compare nightly temperature readings from both devices to assess how closely they compare. Additionally, we will analyse questionnaire responses to evaluate how acceptable participants found each device, for example, how willing they would be to participate in future research studies using these devices. 

Future plans for the project 

Data collection is on track to conclude by November 2025. The next steps include analysing the collected temperature and questionnaire data and then writing up our findings for publication.  

Our findings will help inform appropriate use of these technologies in future studies on reproductive disorders, menstrual health and fertility. 


Contact Details

Blog post written by Miss Sarah Koerner sarah.koerner@bristol.ac.uk and Dr Louise AC Millard louise.millard@bristol.ac.uk

Study researchers: Miss Sarah Koerner, Dr Louise AC Millard (PI), Professor Tom R Gaunt, Dr Kayleigh Easey, Dr Robyn Wootton, Professor Jon Heron. 

Meet the Ask-JGI team – Adrianna, Fahd, Yujie & Huw

The new Ask-JGI helpdesk cohort started in September 2024 and have been busy answering queries from researchers across the university! We introduced half of the team in our January blog. Meet the other half of the team below:

Adrianna Jezierska (she/her) – Ask-JGI PhD Student

Headshot of Adrianna Jezierska
Adrianna Jezierska, PhD candidate in in the School of Business

I’m a PhD student at the University of Bristol Business School. My project focuses on social media influencers and their vegan content on YouTube. Using language derived from video transcripts, I analyse to what extent they legitimise veganism so that it becomes popular and desirable in society. Whilst most organisation and management scholars have developed theories based on qualitative data, resulting in small datasets and case study approaches, in my work, I highlight the role of computational social sciences and big data in helping social scientists answer their research questions.

Coming from a social science background, I was initially hesitant about joining the Ask-JGI team. However, this decision has turned out to be the most rewarding and challenging experience. Being part of the team is a continuous learning journey. The questions we receive span various disciplines, often pushing us out of our comfort zones. The most exciting part of the job is the opportunity to communicate with other researchers and receive their positive feedback. On the other hand, we constantly collaborate with other team members and learn from each other, which makes it a very supportive environment. I’m pleased to see more queries from social scientists and humanities researchers. The growing popularity of computational approaches and the shift towards interdisciplinary research is a trend that I find inspiring and exciting

Fahd Abdelazim (he/him) – Ask-JGI PhD Student

Headshot of Fahd Abdelazim
Fahd Abdelazim, PhD student on the Interactive AI CDT in the School of Computer Science

I am a PhD student in the Interactive Artificial Intelligence CDT, specializing in model understanding for Vision-Language models. My research focuses on introducing improvements to Vision-Language models that allow for better linking of specific ideas or attributes to physical items, in order to help models recognize and understand the properties of objects in images.

I first heard of the Ask-JGI team through fellow PhD students, and it was recommended to me as a way to apply data science skills to real-world applications. Joining the Ask-JGI helpdesk has been a unique experience where I’ve been able to delve into various domains and learn about topics that I would otherwise not have had the chance to learn about. The team truly values cross-functional collaboration and encourages tackling new challenges and learning on the job.

Working at Ask JGI is incredibly rewarding. I enjoy the diversity of challenges presented by each query which gives me the chance to improve as a data scientist and gain a better understanding of how data science can help improve academic research. I really enjoy the collaborative spirit within the team. The Ask-JGI team are from many different disciplines and interacting with them allows for interesting exchanges of ideas and problem-solving approaches. This allows me to grow not just as a data scientist but as a researcher as well.

Yujie Dai (she/her) – Ask-JGI PhD Student

Headshot of Yujie Dai
Yujie Dai, PhD student in the Digital Health and Care CDT

I am a PhD student in the Digital Health and Care CDT, specializing in population health data science. My research focuses on leveraging large-scale real-world health data to address critical challenges in infectious diseases. Specifically, I utilize explainable AI (XAI) techniques to characterize and diagnose diseases, aiming to bridge the gap between data science and public health.

 My journey with Ask-JGI began with a recommendation from a friend who was previously part of the team. They spoke highly of the collaborative and dynamic environment, and I was intrigued by the opportunity to apply my skills in real-world research settings. Joining Ask-JGI is an extension of my academic and research pursuits. I was drawn to the idea of supporting researchers across diverse disciplines, helping them navigate technical challenges in their projects, and learning from their different perspectives. The chance to engage with cutting-edge problems and contribute to solutions beyond the scope of my own research was exciting.

There’s so much to love about being part of Ask JGI. I love the variety of work. Each question I encounter presents a new challenge, whether it’s developing a data analysis pipeline, troubleshooting code, or brainstorming creative solutions for a computational problem. The variety keeps me constantly learning and growing as a data scientist. I also love the collaborative atmosphere. Working closely with researchers from different fields gives me diverse ways of thinking and problem-solving. It’s an opportunity to not only apply my skills but also to know more about the scientific community.

Huw Day (he/him) – Ask-JGI Lead

Headshot of Huw Day
Huw Day, JGI Data Scientist

I am a JGI Data Scientist with a background in mathematics, working on a variety of data science projects with researchers across the university using a variety of data science methodologies and techniques. I also help run the Data Ethics Club.

As Ask-JGI Lead, I am responsible for recruiting, training and the general managing of the Ask-JGI team. They’re a fantastic group and I consider myself really lucky to be able to work with them. I support some of the general queries and I’m also responsible for talking with researchers interested in costing out data science support in grant applications.

To me, the Ask-JGI helpdesk is based on the idea that any researcher who wants to do data science should be empowered to do so. Whilst we often do the data science for people, I think the most rewarding outputs from our helpdesk is when we empower researchers to do data science themselves, guiding and validating their work. It’s also a wonderful opportunity for myself and the rest of the helpdesk to learn about research across the university.


All University of Bristol researchers (including PhDs) are entitled to a day of free data science support from the Ask-JGI helpdesk. Just email ask-jgi@bristol.ac.uk with your query and one of our team will get back to you to see how we can support you.

If you’re a PhD student interested in joining the Ask-JGI team, we will do recruiting for the next academic year in summer of 2025 so keep an eye on the JGI mailing list for when we have our recruiting call. We recruit a new cohort every year but do not accept speculative applications outside of the recruiting call.

Meet the Research Data Advocate team

We are delighted to announce a new pilot training scheme led by our newly-appointed JGI Research Data Science Advocates. This is a new way to take part in training in a low-stress, collaborative and supportive environment, and at the same time form a community of data scientists in your area. 

The pilot will run JGI training events over a whole week in Schools, supported by a local Data Science Advocate. They will run sessions to support a cohort to undertake the training together, over the course of a week. The formal training takes only around 2-3 hours to complete, but it is anticipated that this format will allow deeper learning and more useful application to research.  

To take part in the pilot (which is aimed at relatively inexperienced coders within a discipline), please email to jgi-training@bristol.ac.uk. If your school doesn’t have a volunteer, you would be welcomed at a research-adjacent community. Bios for our Advocates are below and even if you don’t need this particular training, they would love to include you in an ongoing data science community, so please get in touch. 

Ruolin Wu

Headshot of Ruolin Wu

I am a PhD student of paleobiology diving into the mysteries of evolutionary history. Armed with code, fossils, and molecular data, I craft stories about topological and temporal pattern of animals and plants. Outside of academia, I like climbing, handcrafts, succulents and ferns of any kind.

Zhiyuan Xu

Headshot of Zhiyuan Xu

I am a 1st year PhD student focusing on data science and artificial intelligence, with a particular focus on large language models and their applications. My background includes experience in machine learning, data-driven research, and interdisciplinary collaboration to address complex problems.

Bryony Clifton

Headshot of Bryony Clifton

I’m a PhD student in Biochemistry, studying the molecular details underpinning neurotransmission. My project focuses on identifying the biological role for an uncharacterised intramembrane protease found in the human brain. During my PhD, I have become aware of the importance of developing tools to present complex datasets in a clear and informative way. I am excited to begin my role with the JGI where I can support others to build these skills too.

Catherine Upex

Headshot of Catherine Upex

I’m Catherine and I’m a first year PhD student based in the medical school. I’m using data science and AI to understand the shape and movement patterns of the heart over different disease states. I’m also currently working on a mini-project using AI protein folding tools, like AlphaFold, and computer simulations to uncover interactions between synthetic cannabinoids and the hERG potassium channel and its relation to arrythmia risk.

Kaan Deniz

Headshot of Kaan Deniz

Aerospace Engineer who has intensive industrial experience in numerical modelling with a MSc degree from the University of Bristol/ Aerospace Engineering.  Current PhD student in Aerospace Engineering at the University of Bristol. Research focus is numerical modelling of composite manufacturing processes. 

Boy Li

Headshot of Boy Li

I study how to synergize domain-specific knowledge with data-driven deep learning models to extract information from remote sensing imagery.

Vaishnudebi Dutta

Headshot of Vaishnudebi Dutta

I am an Engineering Mathematics PhD student working on model and data-driven design of combination therapies for non-small cell lung cancer. Beyond my research, I serve as the School of Engineering Mathematics and Technology (SEMT) PhD Student Representative, advocating for and supporting the academic community. I also hold a key position as the PhD Representative for the Bristol Cancer Research Network where I get the opportunity to share research updates to Clinicians, and others in the network. Additionally, I manage the network’s official X (formerly Twitter) presence, helping to disseminate research developments and maintain engagement with the broader scientific community.

Zhengzhe Peng

Headshot of Zhengzhe Peng

I am a PhD student with a diverse background in computer science, business, and over a year of IT work experience. My research applies advanced data science methods, with a focus on AI, to explore real-world challenges. I am dedicated to expanding my knowledge in these fields and eager to help others who are new to data science, working together to advance and explore new possibilities in this ever-evolving domain.

Winfred Gatua

Headshot of Winfred Gatua

Winfred Gatua is a PhD Fellow at the University of Bristol, specializing in Molecular Genetics and Life Course Epidemiology. Her research focuses on the triangulation of evidence between Mendelian randomization and randomized controlled trials for complex diseases. She holds an MSc in Bioinformatics, a Postgraduate Diploma in Health Research Methods, and a BSc in Biomedical Science and Technology. Transitioning from wet lab biomedical sciences to dry lab bioinformatics, Winfred is a self-taught coder passionate about open science, automation, and reproducible research in genetics. Beyond research, Winfred is dedicated to capacity building, particularly in increasing computational and data literacy among non-computer science researchers. Since 2021, she has been a volunteer instructor with The Carpentries, securing funding, hosting and instructing carpentries lessons that equip researchers with essential skills in data analysis, open science, reproducible research and best practices in scientific computing in different institutions across the globe.

The Royal Statistical Society Annual Conference 2024

The Royal Statistical Society meets annually for their internationally attended conference. It serves as the UK’s annual showcase for statistics and data science. This year they met in Brighton for a conference attended by over 600 attendees from around the world, including JGI Data Scientist Dr Huw Day.

The conference had over 250 presentations, including contributed talks, rapid-fire talks, and poster presentations. At any one time, there could be as many as 6 different talks going on, so it was impossible to go to everything but below are some of Huw’s highlights of the conference.

Pre-empting misunderstandings is part of trustworthy communication

From left to right; Dr Huw Day, Professor Sir David Spiegelhalter and Dr Simon Day
From left to right; Dr Huw Day, Professor Sir David Spiegelhalter and Dr Simon Day (RSS Fellow and Huw’s dad) at the RSS International Conference 2024.

As part of a session on communicating data to the public, Professor Sir David Spiegelhalter talked about his experiences trying to pre-bunk misinformation when displaying data.

Data in June 2021 showed that the majority of COVID deaths are in the vaccinated group. The Brazilian president President Jair Bolsonaro used this data to support claims that Covid vaccines are killing people. Spiegelhalter and his colleague Anthony Masters tried explaining why this wasn’t a sign the vaccine was bad in an article in The Observer “Why most people who now die with Covid in England have had a vaccination”.

Consider the following analogy: most car passengers who die in car accidents are wearing seatbelts. Intuitively, we understand that just because these two variables are associated, it doesn’t mean that one causes the other. Having a story like that means you don’t have to talk about base rates, stratification or even start to use numbers in your explanations.

We should try to make the caveats clearer of data before we present them. We should be upfront from what you can and can’t conclude from the data.

Spiegelhalter pointed to an academic paper: “Transparent communication of evidence does not undermine public trust in evidence” where participants were shown either persuasive or balanced messages about the benefits of Covid vaccines and nuclear power. It’s perhaps not surprising to read that those who already had positive opinions about either topic continued to have positive views after reading either messages. Far more interesting is that the paper concluded that “balanced messages were consistently perceived as more trustworthy among those with negative or neutral prior beliefs about the message content.”

Whilst we should pre-empt misconceptions and caveats, being balanced and more measured might prove to be an antidote to those who are overly sceptical. Standard overly positive messaging is actively reducing trust in groups with more sceptical views.

Digital Twins of the Human Heart fueled Synthetic 3D Image Generation

Digital twins are a digital replica/simulator of something from the real world. Typically it includes some sort of virtual model which is informed by real world data.

Dr Dirk Husmeiser at the University of Glasgow has been exploring the application of digital twins of the human heart and other organs to investigate behaviour of the heart during heart attacks, as well as trying to use ultrasound to measure blood flow to estimate pulmonary blood pressure (blood pressure in the lungs). Usually, measuring pulmonary blood pressure is an extremely invasive procedure, so using ultrasound methods has clear utility.

One of the issues of building a digital twin is having data about what you’re looking at. In this case, the data looks like MRI scans of the human heart, taken at several “slices”. Because of limitations in existing data, Dr Vinny Davies and Dr Andrew Elliot, (both colleagues of Husmeiser at the University of Glasgow)have been attempting to develop methods of making synthetic 3D models of the human heart, based on their existing data. They broke the problem down into several steps, working to generate synthetic versions of the slices of the heart (which are 2D images) first.

The researchers were using a method called Generative Adversarial Networks (GANs), where two systems compete against each other. The generator system generates the synthetic model and the discriminator system tries to distinguish between real and synthetic images. You can read more about using GANs for synthetic data generation in a recent JGI blog about Chakaya Nyamvula’s JGI placement.

Slide on “Generating Deep Fake Left Ventricle Images for Improved Statistical Emulation”.
A slide from Dr Vinny Davies and Dr Andrew Elliot’s talk on “Generating Deep Fake Left Ventricle Images for Improved Statistical Emulation”. The slide depicts how progressive GANs work, where the generator learns how to generate smaller, less detailed images first and gradually improves until it can reproduce 2D slices of MRIs of the human heart.

Because the job of the generator is far harder than that of the discriminator (consider the task of reproducing a famous painting, versus spotting the difference between an original painting and a version drawn by an amateur), it’s important to find ways to make the generator’s job easier early on, and the discriminator’s job harder so that the two can improve together.

The researchers used a method called a Progressive GAN. Initially they gave the generator the task of drawing a lower resolution version of the image. This is easier and so the generator did easier. Once the generator could do this well, they then used the lower resolution versions as the new starting point and gradually improved the correlation. Consider trying to replicate a low resolution image – all you have to do is colour in a few squares in a convincing way. This then naturally makes the discriminator job’s harder, as it’s tasked with telling the difference between two, extremely low resolution images. This allows the two systems to gradually improve in proficiency.

The work is ongoing and the researchers at Glasgow are looking for a PhD student to get involved with the project!

Data Hazards

On the last day of the conference, Huw alongside Dr Nina Di Cara from the School of Psychology at the University of Bristol presented to participants about the Data Hazards project.

Participants (including Hadley Wickam, keynote speaker and author of the famous R package tidyverse) were introduced to the project, shown examples of how it has been used and then shown an example project where they were invited to take part in discussions about which different data hazards might apply and how you might go about mitigating for those hazards. They also discussed the importance of focussing on which hazards are most relevant and prominent.

Dr Huw Day (left) and Dr Nina Di Cara in front of a screen that says 'Data Hazards Workshop'
Dr Huw Day (left) and Dr Nina Di Cara (right) about to give their Data Hazards workshop talk at the RSS International Conference 2024.

All  the participants left with their own set of the Data Hazard labels and a new way to think about communicating hazards of data science projects, as well as invites to this term’s first session of Data Ethics Club.