Behind the Paper: Ensemble ML and SHAP for hybrid polypropylene composites

We asked if tensile strength and stiffness of hybrid PP can be forecast from small, costly datasets. Here’s why we used an ensemble + SHAP workflow, what we learned, and how it suggests practical composition windows for real parts.

Published in Materials

Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

SpringerLink
SpringerLink SpringerLink

Proof-of-concept ensemble machine-learning forecasting of tensile properties in hybrid polypropylene composites reinforced with flax, basalt, and rice husk powder - Discover Materials

Accurate forecasting of tensile properties is important for efficient design of eco-friendly composites. We present a proof-of-concept ensemble workflow to predict tensile strength and tensile modulus of hybrid polypropylene (PP) composites reinforced with long flax fiber bundles, basalt fibers (BF), and rice husk powder (RHP). A lab-scale dataset (n = 65) was generated under standardized testing. Preprocessing (Savitzky–Golay denoising, feature standardization) preceded Optuna-tuned support vector regression (SVR) and XGBoost, whose predictions were combined via a stacked linear meta-learner. Under ten-fold cross-validation, the ensemble achieved R2 = 0.881 (RMSE = 0.639 GPa) for modulus and R2 = 0.907 (RMSE = 1.569 MPa) for strength. The framework is a computationally efficient complement to simulation-based analyses for early-stage screening within the explored domain. This is a proof-of-concept study based on a small, single-lab dataset (n = 65) without external validation; future work will enlarge the dataset across laboratories, extend to additional properties, and incorporate independent validation. Within the explored domain, the workflow yields actionable composition windows (e.g., 30–35 wt% BF, 4–6 wt% RHP, 3–5 flax plies) that balance stiffness and strength.

What was the problem?
Polypropylene (PP) is everywhere—from appliance housings to automotive trim—but tuning its mechanical performance with eco-friendly reinforcements is still a balancing act. We explored a hybrid system of long flax fiber bundles (LFF), basalt fibers (BF), and rice husk powder (RHP). Each ingredient pulls properties in different directions: BF boosts stiffness, flax can raise strength at modest loadings, and RHP helps processability and sustainability but can soften the matrix if overused. Running a full factorial campaign would be slow and costly, so we asked: could machine learning help us map the design space with only a compact Box–Behnken plan?

Why ensembles, not a single model?
Small materials datasets are noisy and rarely linear. Single learners often overfit, especially when variables interact. We therefore stacked two strong but complementary base learners and tuned capacity carefully with cross-validation. Stacking reduces variance by letting models “vote,” while still capturing non-linear trends. We also kept a strict separation of training, validation, and a 20% hold-out set so our reported skill reflects true generalization inside the tested route.

How did we keep it honest?
Before modeling, we treated the data like we would treat a specimen: inspect it. Histograms, KDEs and boxplots ensured every factor level was represented and flagged one clear modulus outlier (>10 GPa). Pair plots helped us see raw tendencies (%BF and %RHP versus tensile strength/modulus) without hiding behind a model. During training, we used k-fold cross-validation and capped complexity (depth, estimators, regularization) to avoid optimistic results.

Why SHAP (and friends)?
A prediction is only useful if engineers can act on it. SHAP values provide per-feature, per-sample attributions for tree models, telling us how much each variable nudged a prediction up or down. We paired SHAP with permutation importance (model-agnostic), partial dependence (PDP) and accumulated local effects (ALE) to see both global and local behavior. The takeaway was intuitive and actionable: BF dominates modulus gains, but strength benefits from a balanced trio—too much of any one component quickly flattens returns. These insights align with micromechanics expectations about stiffness transfer and embrittlement at high rigid-filler content.

What did we learn for design?
The ensemble reproduced measured trends on cross-validation and on the unseen hold-out set. From the SHAP/PDP landscape we highlighted composition windows where tensile strength improves ~2× over neat PP while modulus climbs strongly: moderate BF with supportive flax plies and controlled RHP. In practice, this narrows down trial-and-error—teams can start within these windows, then fine-tune around processing realities such as fiber length retention or porosity.

Limits you should care about
Our goal wasn’t a universal model for “any” PP composite. We fixed processing (extrusion → hot press → injection molding) to isolate composition effects. That means the model explains variance due to %BF, %RHP, and flax ply count within this route; jumping to a very different line or tool will change the microstructure, and models should be retrained or updated. Also, small materials datasets make uncertainty quantification valuable; future work could add conformal prediction intervals so designers see both a forecast and its confidence band.

Why this matters beyond our system
The workflow—compact DOE, capacity-controlled ensembles, and transparent explanations—generalizes to many data-limited materials problems: bio-fillers in thermoplastics, multi-phase binders, or even cementitious systems. The key is to pair domain priors with honest diagnostics and keep explanations close to mechanisms engineers trust.

A note on sustainability
Flax and rice-husk powder are renewable or waste-derived. Being able to forecast properties with fewer physical trials lowers energy, labor, and scrap. In other words, better data practices are also greener practices.

Where to read the paper
The research article is open access in Discover Materials: DOI 10.1007/s43939-025-00406-4. A shareable link is available via Springer Nature’s initiative: rdcu.be/eQUMB.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Composites
Physical Sciences > Chemistry > Materials Chemistry > Composites
Composites
Physical Sciences > Materials Science > Structural Materials > Composites
Materials Characterization Technique
Physical Sciences > Materials Science > Materials Characterization Technique
Materials Chemistry
Physical Sciences > Chemistry > Materials Chemistry
Structural Materials
Physical Sciences > Materials Science > Structural Materials
Plant Materials
Physical Sciences > Chemistry > Materials Chemistry > Plant Materials

Related Collections

With Collections, you can get published faster and increase your visibility.

Exclusive Papers of Editorial Board Members

Note that articles in this collection are by Editorial Board Members only.

Publishing Model: Open Access

Deadline: Ongoing

Advanced Functional Materials: Disordered Systems, Nanoparticles, and Emerging Application in Energy, Photonics, and Security

Disordered systems, nanostructured materials, and functional composites play a pivotal role in advancing next-generation technologies. With growing demands for high-performance materials in energy, photonics, and security applications, understanding and harnessing the physical properties of complex and disordered systems is crucial. Research in nanoparticle design and synthesis, light scattering phenomena, and thermo-optical conversion mechanisms offers promising solutions for efficient radiative cooling, energy harvesting, and heat management. Additionally, the development of anticounterfeiting technologies and physical unclonable functions (PUFs) is key to ensuring secure identification and authentication in a digitized world.

This Collection invites contributions from researchers, engineers, and material scientists to share breakthroughs, methodologies, and practical innovations in these areas. We welcome submissions related, but not limited to, disordered systems, functional nanoparticles, advanced scattering techniques, novel radiative cooling materials, thermo-optical conversion devices, anticounterfeiting technologies, and PUF-based systems.

This Collection supports and amplifies research related to SDG 7 and SDG 9.

Keywords: Disorderd system, nanoparticles, functional materials, scattering, radiative cooling, thermooptical conversion, anticounterfeiting, physical unclonable function (PUF)

Publishing Model: Open Access

Deadline: Jun 30, 2026