Skip to main content
ukiyo journal - 日本と世界をつなぐ新しいニュースメディア Logo
  • All Articles
  • 🗒️ Register
  • 🔑 Login
    • 日本語
    • 中文
    • Español
    • Français
    • 한국어
    • Deutsch
    • ภาษาไทย
    • हिंदी
Cookie Usage

We use cookies to improve our services and optimize user experience. Privacy Policy and Cookie Policy for more information.

Cookie Settings

You can configure detailed settings for cookie usage.

Essential Cookies

Cookies necessary for basic site functionality. These cannot be disabled.

Analytics Cookies

Cookies used to analyze site usage and improve our services.

Marketing Cookies

Cookies used to display personalized advertisements.

Functional Cookies

Cookies that provide functionality such as user settings and language selection.

Transforming the Future of Pharmaceutical Development! AI Mastering "Molecular Language": The Potential of Chemical LLM in Collaboration with Rhodium

Transforming the Future of Pharmaceutical Development! AI Mastering "Molecular Language": The Potential of Chemical LLM in Collaboration with Rhodium

2025年08月16日 00:56

1|Key Points of the News――"Treating Molecules as 'Language'" in Drug Discovery AI

Southwest Research Institute (SwRI) has announced a large language model (LLM) specialized in the field of chemistry called "GAMES." GAMES is designed to understand and generate the industry-standard SMILES (Simplified Molecular Input Line Entry System), which represents molecular structures as short strings, aiming to accelerate the drug discovery process, including virtual screening. The release date is set for August 14, 2025, and it has been reported by Phys.org.swri.org


GAMES is designed to work in conjunction with SwRI's molecular docking software "Rhodium." While Rhodium handles three-dimensional docking and property visualization, GAMES extends the "text side" of the search space, providing a more diverse and valid range of SMILES candidates.swri.org


2|Technical Details――Lightweight Fine-Tuning with LoRA/QLoRA, Reducing Invalid SMILES

According to SwRI, GAMES is efficiently fine-tuned using LoRA and QLoRA, reducing the hardware and energy load required for learning and inference. Experiments have shown an increase in the generation rate of valid SMILES and a decrease in invalid outputs. Although still in the early stages, it is said that integrating it into Rhodium's workflow could accelerate the generalization approach in drug design.swri.org


The background of the technical choice lies in the recent trend of treating chemistry as "strings" rather than "graphs," adapting the language capabilities of general-purpose LLMs to chemical language. Research on repurposing and expanding LLMs into chemical language models (CLM), such as SmileyLlama and SMILES-Mamba, has been increasing since 2024.


Furthermore, reports indicate that SMILES embeddings using LLaMA-based LLMs can match or surpass competing models in molecular property prediction. The direction of GAMES is positioned as an extension of this context.


3|Why "LLM Speaking SMILES" Speeds Up Drug Discovery

Drug discovery is a search problem to find "potential drug" candidates from a vast chemical space. Since SMILES can represent molecules as a continuous string, it naturally maps to the strong areas of LLMs (generation and transformation of token sequences). The vast number of candidates generated as text are passed to structure-based methods like Rhodium, where they are narrowed down through docking, property prediction, and filtering. GAMES aims to improve overall throughput by making this preliminary "candidate expansion" smarter and faster.swri.org


SwRI lists future directions such as ranking compound libraries based on "drug-likeness" using GAMES and systematically exploring the chemical landscape. This involves shortening the loop between generative models and property/safety (ADMET) evaluations, potentially alleviating laboratory bottlenecks.swri.org


4|Reactions on Social Media――Both Expectations and Skepticism Visualized

On the day of the announcement, SwRI's official X account announced the "Chemical LLM GAMES to Accelerate Drug Discovery." While there were positive responses from the community, such as "AI application in drug discovery is very welcome," the overall thread also included reactions demanding specific benchmarks and gains in practical use.


Meanwhile, in general discussions on chemistry and LLMs, skepticism is repeatedly presented on Reddit's chemistry and AI subreddits, stating that "(general-purpose) LLMs are not good at SMILES conversion or machine-readable structuring." In one thread, an expert in cheminformatics shared the opinion that "LLMs are weak with SMILES." GAMES claims to address this pain point by increasing the "validity rate" with domain-specific data and fine-tuning.swri.org


Additionally, on Hacker News, there is a recent positive view that "LLM/AGI can be a significant boost in fields that heavily use parallel experiments and simulations, like drug discovery." However, there are also counterarguments pointing out the costs of parallelizing and automating wet experiments and physical constraints. The community's "temperature" is currently at a stage of balancing expectations and reality.


5|Risks and Limitations――"Correct SMILES" Alone Is Not Enough

Even if an LLM can arrange the correct vocabulary, it is a separate issue whether it is "synthesizable" and free from "toxicity or metabolic pitfalls." While GAMES' explanation touches on reducing invalid SMILES (improving grammatical validity), it suggests that synthesizability (SA) and ADMET's "real-world applicability" should still be ensured through downstream evaluation and verification.swri.org


Furthermore, in the safety of LLMs in the chemical domain, vulnerabilities such as "jailbreak" through SMILES or procedural expressions have been pointed out. When releasing and operating the model, design and governance to prevent the dissemination and misuse of dangerous substance synthesis information are essential.


6|Context of Surrounding Research――Bridging "General LLM to Chemical Language"

Since 2024, methods for adapting general LLMs to chemical language (such as SFT, DPO, and self-supervised pre-training) have emerged one after another, with reports of effectiveness in molecular property prediction and guided generation. GAMES can be seen as an example of advancing this trend into an industry-oriented implementation (in conjunction with Rhodium).


On the other hand, past research teaches that evaluation design that withstands practical use—data partitioning to avoid leaks, canonicalization, deduplication, and realistic baseline comparisons—is essential. Attention should be paid to the level of disclosure of evaluation metrics and data that SwRI will release in the next stage.


7|What Constitutes "Winning"――KPIs from a Practical Perspective

  • Balancing Valid SMILES Rate and Novelty: Not just "validity rate," but also chemical diversity and ease of synthesis (SA score) should be noted.

  • Improvement of Downstream Tasks: Distribution of docking scores in Rhodium, actual hit rate in follow-up experiments, and reduction of false positives.

  • Computational Efficiency: Training and inference costs (GPU time/power) and throughput improvement with LoRA/QLoRA.swri.org

  • Safety: Suppression and detection of dangerous chemical information output, log and audit systems.


8|Editorial Perspective――The Reality of the Strategy to "Cut Chemistry with Language"

The significance of GAMES lies in treating chemistry as text, bringing the ecosystem of general-purpose LLMs (lightweight fine-tuning, prompt design, tool integration) into drug discovery. Comments that it has already impacted real projects within the research institute likely express confidence beyond a proof of concept. However, its true value will only be confirmed when experimental results —measured and reproducible hit compounds—are demonstrated. What is awaited is the publication of benchmarks, external verification, and open evaluation protocols.swri.org


Reference Articles

Chemical LLM Developed to Accelerate Drug Discovery
Source: https://phys.org/news/2025-08-chemistry-llm-faster-drug-discovery.html

Powered by Froala Editor

← Back to Article List

Contact |  Terms of Service |  Privacy Policy |  Cookie Policy |  Cookie Settings

© Copyright ukiyo journal - 日本と世界をつなぐ新しいニュースメディア All rights reserved.