A Pilot Study on Using Large Language Models to Simplify Machine Learning for Mass Spectrometry-Based Classification of Erectile Dysfunction Drug Analogues

Choi Eunwoo; Oh Han Bin

doi:10.5478/MSL.2025.16.4.111

오늘 하루 그만보기

P-ISSN2233-4203
E-ISSN2093-8950
ESCI, SCOPUS, KCI

Home

Browse Articles

Article Detail

Home > Article Detail

P-ISSN 2233-4203
E-ISSN 2093-8950

e-Submission

Vol.16, No.4

PDF Citation

A Pilot Study on Using Large Language Models to Simplify Machine Learning for Mass Spectrometry-Based Classification of Erectile Dysfunction Drug Analogues

Mass Spectrometry Letters / Mass Spectrometry Letters, (P)2233-4203; (E)2093-8950

2025, v.16 no.4, pp.111-117

https://doi.org/10.5478/MSL.2025.16.4.111

Choi Eunwoo (Sogang University)
Oh Han Bin (Sogang University)

Choi, E., & Oh, H. B. (2025). A Pilot Study on Using Large Language Models to Simplify Machine Learning for Mass Spectrometry-Based Classification of Erectile Dysfunction Drug Analogues. , 16(4), 111-117, https://doi.org/10.5478/MSL.2025.16.4.111

copy

Downloaded
Viewed

PDF Download

Abstract

The recent emergence of large language models (LLMs) has transformed the process of machine learning (ML) model development, markedly reducing the need for advanced coding expertise and enabling domain scientists to directly con- struct computational workflows through natural language prompts. In this study, we demonstrate the application of Google Gemini Pro 2.5 for developing classification models of erectile dysfunction (ED) drug analogues using tandem mass spectromet- ric (MS/MS) data. The dataset consisted of 149 compounds, including sildenafil, vardenafil, tadalafil analogues, and structurally unrelated compounds, represented as binary barcode spectra (m/z 50–800) derived from fragment ion intensities. Through step- wise prompting, the LLM generated executable Python code for data preprocessing, model construction, hyperparameter optimi- zation, and ensemble learning using random forest, artificial neural networks (ANN), and support vector machines (SVM). The resulting models achieved high classification performance comparable to that of a manually programmed ANN reported in our previous work, while requiring markedly less programming effort. Beyond reproducing classification accuracy, this study high- lights the efficiency, accessibility, and reproducibility of LLM-assisted ML workflows, underscoring their potential to democra- tize computational methods in mass spectrometry and analytical chemistry.

keywords: machine learning, large language model (LLM), Erectile dysfunction drugs, mass spectrometry, forensic analysis

Received: 2025-09-30

Revised: 2025-12-12

Accepted: 2025-12-17

Published: 2025-12-31

Downloaded
Viewed

0KCI Citations
0WOS Citations

PDF Download

Recommanded Articles

상단으로 이동

Indexed By

Sitemap

Article Detail

A Pilot Study on Using Large Language Models to Simplify Machine Learning for Mass Spectrometry-Based Classification of Erectile Dysfunction Drug Analogues

Abstract

Other articles from this issue

Recommanded Articles

About the Journal

Notice

View Articles

Ethical Guideline

Mass Spectrometry Letters