Using Machine Learning to Explore the Impact of Physical Activity on Cancer Development in Golden Retrievers

Most of us know someone who has been impacted by cancer. Unfortunately, cancer is also a leading cause of death among our pet dogs.
Using Machine Learning to Explore the Impact of Physical Activity on Cancer Development in Golden Retrievers
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

BioMed Central
BioMed Central BioMed Central

Physical activity predictors of cancer in Golden Retrievers: it’s about frequency and intensity, not type - Veterinary Oncology

Background Canine cancer is a leading cause of canine deaths, often resulting from complex interactions between germline-risk genetics, somatic mutations, and environmental exposures. To help identify major dietary, genetic, and environmental exposure risk factors for canine cancer, Morris Animal Foundation launched the Golden Retriever Lifetime Study, the first prospective longitudinal study in veterinary medicine. We hypothesized that responses from the physical activity section of the GRLS annual questionnaire could be used to develop a BiMM forest model that accurately classifies which Golden Retrievers develop cancer within the first seven years of the study. Furthermore, we expected that the most important predictors of cancer development would be the frequency and duration of the physical activity, with more rigorous activities—such as swimming—would be the most important predictors of cancer development. Methods Activity and lifestyle questionnaire data for 3,044 purebred Golden Retrievers enrolled in the Golden Retriever Lifetime Study were obtained from Morris Animal Foundation. Two BiMM forest models were developed to predict the development of cancer: the “Years 0–7” model using consistently asked questions over the seven years, and the “Years 3–7” model, which incorporated additional questions about the pace and duration of physical activity starting in study year 3. Results Of the enrolled dogs, 277 were diagnosed with cancer. The “Years 3–7” model achieved the best performance, with overall accuracy of 80.7%, a F1 score of 74.9% and a fair ROC AUC of 0.763. Key predictors of cancer development included year in study, frequency, pace, duration, and the frequency of warm and cold weather swimming. After Golden Retrievers were diagnosed with cancer, owners reported an 8–10% increase in exercise frequency and a 15.6% to 68.88% increase in cold weather swimming whereas warm weather swimming decreased by 2.0% to 13.9%. Similar declines in the pace and duration were also observed. The surface type where the exercise took place and the specific types of physical activity were lower in importance. Conclusions Including pace and duration of the physical activity improved model performance, highlighting these predictors as high importance alongside the frequency of the physical activity. Future prospective studies should seek to determine specific physical activity guidelines for dogs, focusing on frequency, duration, and pace to potentially reduce cancer risk.

How many dogs will die from cancer?

Approximately 1 in 4 dogs will die from cancer. For certain breeds, such as Golden Retrievers, that risk increases to about 1 in 2.

Can we prevent this high cancer rate?

That is a multifaceted question, and there is no single or simple answer.  Canine cancer develops due to complex interactions involving genetic factors, DNA mutations, environmental exposures, and lifestyle choices made by the owners. 

This study focuses on one small—but important—aspect of the lifestyle choices of the owners, the physical activity or exercise that they provide to their Golden Retrievers.

Why did we focus on Golden Retrievers?

A better question is, are you familiar with the Golden Retriever Lifetime Study (GRLS)?  Launched in 2012 by the Morris Animal Foundation (MAF), GRLS was the first prospective longitudinal study in veterinary medicine. More than 3,000 Golden Retrievers have been enrolled.

Each year, the enrolled Golden Retrievers receive a yearly physical by a veterinarian and have samples such as blood and feces collected for researchers.  Additionally, each year the owners and veterinarians fill out detailed surveys. These surveys and medical records provide a rich source of "big data" that offers researchers the ability to helps researchers explore links between lifestyle, environment, and disease—like cancer.

This incredible dataset had never been analyzed using machine learning, until now.

Who was involved in this project?

As a veterinarian trained in data science and analytics, I believe it’s vital to involve veterinary students in research that uses machine learning, artificial intelligence, and other data-driven methods. One of my goals is to help graduate veterinarians and veterinary students understand the value in learning these techniques and how the techniques can help enhance veterinary care within their veterinary clinics and on a wider-population level.

With this in mind, I recruited Dennis, a second year veterinary student to work on this project. He played a key role in developing the study aims and hypotheses, and received a MAF Veterinary Student Scholarship to support his role in the study.

Why did we focus on physical activity?

The GRLS provides access to numerous datasets derived from the owner surveys and medical records of the enrolled dogs, while we explored the datasets and the canine cancer literature, we noticed a gap: few—if any—studies had explored how exercise might influence cancer risk in pet dogs. 

We saw this as an opportunity to investigate this area.  Specifically, our hypothesis was that the frequency and duration of activity—especially more intense forms like swimming—would be the strongest predictors of cancer development.

What did we do?

After cleaning, combining and preparing the relevant datasets, we used a Binary Mixed Model (BiMM) Forest, an analysis approach that combines generalized linear mixed models (GLMMs), a classical statistical model, with the random forest machine learning algorithm.  This BiMM Forest model classified Golden Retrievers as either having or not having a cancer diagnosis.  The model made these classification decisions by looking at "predictors", or in other words, all the different types of physical activity as well as specific information about the physical activity (e.g. frequency of the physical activity).

What did we learn?

Our findings partially supported the hypothesis. We determined frequency, pace, and duration of physical activity were more important predictors of cancer development than the type of activity or the surface/location where the activity occurred.

Interestingly, dogs diagnosed with cancer tended to have lower activity levels before their diagnosis. After their diagnosis,  owners began to increase the frequency of the physical activity, but the duration and pace were decreased.  Want to dive deeper into our results?  Check out our newly published study in BMC Veterinary Oncology.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Veterinary Clinical Medicine
Life Sciences > Biological Sciences > Veterinary Science > Veterinary Clinical Medicine
Veterinary Epidemiology
Life Sciences > Biological Sciences > Zoology > Animal Science > Veterinary Epidemiology
Data Science
Mathematics and Computing > Computer Science > Artificial Intelligence > Data Science
Cancer Epidemiology
Life Sciences > Biological Sciences > Cancer Biology > Cancer Epidemiology
Machine Learning
Mathematics and Computing > Statistics > Statistics and Computing > Machine Learning
  • BMC Veterinary Oncology BMC Veterinary Oncology

    This journal will aim to cover all aspects of cancer research and clinical management in veterinary medicine. It will focus on the diagnosis, treatment, and prevention of cancer in various animal species, including but not limited to dogs, cats, horses, food animals, wildlife and exotic/zoo animals.