This website stores cookies on your computer. These cookies collect information about how you interact with our website and allow us to remember you. We use this information to improve and customise your browsing experience and for analytics and metrics about our visitors on this website and other media. To find out more about the cookies we use, see our Privacy Policy.

If you decline, your information won’t be tracked when you visit this website. A single cookie will be used in your browser to remember your preference not to be tracked.

Predicting selective herbicide activity with machine learning

Author

Bailey Montefiore

The agrochemical industry is facing growing challenges around resistance, stringent regulations, and pressures to reduce the time and cost of development. Teams are being asked to develop better active compounds faster and cheaper.

We recently presented a poster at SCI’s Innovation in Crop Protection, showing a case study using deep learning imputation to improve the accuracy in selecting active molecules. This allows agrochemical teams to make better resourcing decisions, by prioritising which molecules to submit to costly screening experiments.

You can view the full study here or download the poster pdf.

The need for selective weed control

Selectivity was key in this project. Compounds needed to effectively control a broadleaf weeds species, while protecting corn and soybean crops. The need to balance multiple activities make compound optimisation a more complicated process.

Deep learning imputation: Optimising compounds with machine learning

In this work, we applied Cerella to train machine learning models on physicochemical properties, in vitro assay data, and bioactivity data spanning the development pipeline.

Unlike traditional QSAR methods that can struggle with sparse datasets, Cerella’s deep learning approach can handle missing data by learning the relationships between experimental endpoints.

Incorporating multi-parameter optimisation

We combined Cerella’s predictions with multi-parameter optimisation to create a comprehensive scoring profile. Our approach evaluated compounds across three dimensions: activity against the target broadleaf weed species, safety to corn crops, and safety to soybean crops.

This time-saving approach allows chemists to identify compounds that meet all success criteria simultaneously, rather than optimising individual properties in isolation.

Better performance: Deep learning vs traditional QSAR

When compared to Random Forest QSAR models, Cerella’s deep learning approach showed:

Better correlation between predicted and measured scores

Higher discrimination power with an area under the curve (AUC) of 0.91 versus 0.84 for Random Forest

Robust uncertainty estimates that enable confident elimination of poor compounds

Additionally, Cerella’s most confident low-scoring predictions corresponded to genuinely poor experimental outcomes. This means teams can confidently deprioritise compounds will likely fail and avoid wasting valuable time and cost.

Cross-species machine learning

One interesting finding was Cerella’s ability to leverage data from related broadleaf species to predict activity against the target species. Related species data proved more informative than even preliminary screening data, highlighting the value of cross-species learning in this approach.

Practical impact

This machine learning approach using Cerella offers a way to reduce testing costs by eliminating poor compounds early with confidence. It enables better compound prioritisation through accurate predictions and mitigates risk through uncertainty-aware decision making.

Download poster pdf

Cookies

The need for selective weed control

Deep learning imputation: Optimising compounds with machine learning

Incorporating multi-parameter optimisation

Better performance: Deep learning vs traditional QSAR

Cross-species machine learning

Practical impact

More about AI in drug discovery

Maximising the ROI of AI – A comprehensive evaluation of Cerella for drug discovery success

Deep imputation on large-scale drug discovery data

AI is not magic: A scientist’s guide to cutting through the hype