Artificial Intelligence (AI) Systems Conduct Research in Chemistry Autonomously  

Scientists have successfully integrated latest AI tools (e.g. GPT-4) with automation to develop ‘systems’ capable of autonomously designing, planning and performing complex chemical experiments. ‘Coscientist’ and ‘ChemCrow’ are two such AI-based systems developed recently that display emergent capabilities. Driven by GPT-4 (the latest version of generative AI of OpenAI), Coscientist demonstrated advanced reasoning and experimental design capabilities. ChemCrow effectively automated a set of tasks and executed discovery and synthesis of chemical agents. ‘Coscientist’ and ‘ChemCrow’ offer a new way of conducting research synergistically in partnership with machines and can come handy in executing experimental tasks in automated robotic laboratories.  

Generative AI Is about creation or generation of new contents by a computer programme. Google Translate that came into being 17 years ago in 2007 is an example of generative artificial intelligence (AI). It generates translations (output) from a give language (input). OpenAI’s ChatGPT , Microsoft’s Copilot, Google Bard, Meta (formerly Facebook) ’s Llama , Elon Musk’s Grok etc are some of important AI tools currently available.  

ChatGPT, launched last year on 30 November 2022, has become very popular. It is said to have acquired 1 million users within 5 days and 100 million monthly users within two months. ChatGPT is based on a large language model (LLM). The key principle is language modelling i.e. pre-training the model with the data so that the model predicts what comes next in the sentences when prompted. A language model (LM) thus makes a probabilistic prediction of the next word in a natural language given preceding one(s). When based of neural network, it is called ‘neural network language model’ in which case data is processed in a way like in the human brain. A large language model (LLM) is a large-scale model that can perform a variety of natural language processing tasks for general-purpose language understanding and generation. Transformer is neural network architecture used to build ChatGPT. The name ‘GPT’ is acronym for ‘Generative pre-trained Transformer’. OpenAI uses transformer-based large language models.  

GPT-4, ChatGPT’s fourth version, was released on 13 March 2023. Unlike earlier versions which accept only text inputs, GPT-4 accepts both image and texts inputs (hence the prefix Chat is not used for fourth version). It is a large multimodal model. GPT-4 Turbo, launched on 06 November 2023, is an improved and more powerful version of GPT-4.  

Coscientist is made up of five interacting modules: planner, web searcher, code execution, documentation and automation. These modules exchange messages with each other for web and documentation search, code execution and performance of experiments. The interaction is through four commands – ‘GOOGLE’, ‘PYTHON’, ‘DOCUMENTATION’ and ‘EXPERIMENT’.  

The planner module is the main module. It is driven by GPT-4 and is tasked with planning. Based on simple pain text prompt from the user, the planner issues necessary commands to other modules to collect knowledge. The web searcher module which also is a LLM is invoked by the GOOGLE command to search internet and related sub-actions for effective planning. The code execution module performs code execution through PYTHON command. This module does not use any LLM. Documentation module acts through DOCUMENTATION command to retrieve and summarise necessary documentation. Based on this, the planner module invokes EXPERIMENT command to the automation module for performance of experiments.  

On appropriate prompt, Coscientist synthesised painkillers paracetamol and aspirin and the organic molecules nitroaniline and phenolphthalein and many other known molecules correctly. The planner module could optimise reactions for the best reaction yields.  

In another study, an LLM chemistry agent ChemCrow autonomously planned and synthesised an insect repellent, three organocatalysts, and guided the discovery of a novel chromophore. ChemCrow was effective in automating diverse chemical tasks.  

The two non-organic, artificial intelligent systems, Coscientists and ChemCrow display the emergent capabilities of autonomous planning and executing chemical tasks for synthesis of known molecules and discovery of novel molecules. They have advanced reasoning, problem solving and experimental design capabilities which can come handy in chemical research.  

Such AI agent systems can be utilised by non-experts for executing routine tasks in chemistry thus reducing cost and efforts. They also have potential to fasten discovery of new molecules  

*** 

References:  

  1. Boiko, D.A., et al 2023. Autonomous chemical research with large language models. Nature 624, 570–578. Published: 20 December 2023. DOI: https://doi.org/10.1038/s41586-023-06792-0  
  2. Carnegie Mellon University 2023 News – CMU-Designed Artificially Intelligent Coscientist Automates Scientific Discovery. Posted 20 December 2023. Available at https://www.cmu.edu/news/stories/archives/2023/december/cmu-designed-artificially-intelligent-coscientist-automates-scientific-discovery  
  3. Bran AM, et al 2023. ChemCrow: Augmenting large-language models with chemistry tools. arXiv:2304.05376v5. DOI: https://doi.org/10.48550/arXiv.2304.05376 

*** 

Introductory lectures on AI:

***

Latest

Brain-Computer Interfaces (BCI): Towards Humans’ Merger with AI 

The ongoing clinical trials of Brain-Computer Interfaces (BCIs) such...

Tumour Treating Fields (TTFields) approved for Pancreatic cancer

Cancer cells have electrically charged parts hence are influenced...

Scientific European invites Co-founder

Scientific European (SCIEU) invites you to join as a Co-Founder and investor, with both...

Future Circular Collider (FCC): CERN Council reviews Feasibility Study

The quest for the answers to the open questions (such as, which...

Chernobyl Fungi as Shield Against Cosmic Rays for Deep-Space Missions 

In 1986, the 4th unit of Chernobyl Nuclear Power Plant in Ukraine...

Myopia Control in Children: Essilor Stellest Eyeglass Lenses Authorised  

Myopia (or near-sightedness) in children is a highly prevalent...

Newsletter

Don't miss

New Nanofiber Dressing for Efficient Wound Healing

Recent studies have developed new wound dressings which accelerate...

MHRA Approves Moderna’s mRNA COVID-19 Vaccine

Medicines and Healthcare products Regulatory Agency (MHRA), the regulator...

Remembering Professor Peter Higgs of Higgs boson fame 

British theoretical physicist Professor Peter Higgs, renowned for predicting...

What caused the Mysterious Seismic Waves Recorded in September 2023 

In September 2023, uniform single frequency seismic waves were...

A Novel Method Which Could Help Forecast Earthquake Aftershocks

A novel artificial intelligence approach could help predict location...

Correcting Genetic Conditions in Unborn Babies

Study shows promise for treating genetic disease in a...
Umesh Prasad
Umesh Prasad
Umesh Prasad is a researcher-communicator who excels at synthesizing peer-reviewed primary studies into concise, insightful, and well-sourced public articles. A specialist in knowledge translation, he is driven by a mission to make science inclusive for non-English speaking audiences. Toward this goal, he founded “Scientific European,” this innovative, multilingual, open-access digital platform. By addressing a critical gap in global science dissemination, Prasad acts as a key knowledge curator whose work represents a sophisticated new era of scholarly journalism, bringing the latest research to the doorstep of common people in their native languages.

Brain-Computer Interfaces (BCI): Towards Humans’ Merger with AI 

The ongoing clinical trials of Brain-Computer Interfaces (BCIs) such as Neuralink’s “Telepathy” implant involve establishing communication links between the brains of participants who have unmet medical needs due...

Tumour Treating Fields (TTFields) approved for Pancreatic cancer

Cancer cells have electrically charged parts hence are influenced by electric fields. Application of alternating electric fields (TTFields) to solid tumours selectively target and...

Scientific European invites Co-founder

Scientific European (SCIEU) invites you to join as a Co-Founder and investor, with both strategic investment and active contribution in shaping its future direction.  Scientific European is an England-based media outlet providing multilingual...

LEAVE A REPLY

Please enter your comment!
Please enter your name here

For security, use of Google's reCAPTCHA service is required which is subject to the Google Privacy Policy and Terms of Use.

I agree to these terms.