Sirui Li

Lecturer, School of Information Technology, Murdoch University

Natural language processing

Knowledge graph

Data analysis

Machine learning

Large language model

Graph neural networks

Multi-modal models

Conference paper Open access

FUZZY LOGIC-BASED MOBILE PHASE OPTIMIZATION AND DIFFERENTIAL ANALYSIS OF LIQUID CHROMATOGRAPHY-TANDEM MASS SPECTROMETRY DETECTION OF COENZYME Q10 AND 25-HYDROXYVITAMIN D

by Hongkun Liu, Jianli Cui, Kok Wai Wong, Sirui Li, Xi Huang and Yujie Li

Date presented 07/2025

ICMLC & ICWAPR 2025, 12/07/2025–15/07/2025, Bali, Indonesia

Liquid chromatography-tandem mass spectrometry (LC-MS/MS) serves as a key tool for the test of lipophilic substances in laboratory medicine and is widely employed in the analysis of coenzyme Q10 (CoQ10) and 25-hydroxyvitamin D (25OHD). In this paper, fuzzy concept was applied to improve the LC-MS/MS methods used for CoQ10 and 25OHD detection. The focus was placed on selecting the optimal mobile phase for CoQ10 analysis and examining the differences between LC-MS/MS and chemiluminescence immunoassay (CLIA) methods for 25(OH)D measurement. Through screening various organic phase combinations and employing fuzzy inference, the optimal mobile phase ratio for CoQ10 test is determined to be methanol and isopropanol at a ratio of 8:2. Additionally, fuzzy logic was employed to analyze the variations in 25OHD concentrations across different sexes and age groups. The results showed that women aged 30–40 exhibited greater differences in 25(OH)D levels compared to other groups. This study shows that the use of fuzzy concepts can enhance the adaptability and accuracy of LC-MS/MS detection, offering a novel approach to the analysis of lipophilic substances.

Conference paper Open access

DocSpiral: A Platform for Integrated Assistive Document Annotation through Human-in-the-Spiral

by Qiang Sun, Sirui Li, Tingting Bi, Du Huynh, Mark Reynolds, Yuanyi Luo and Wei Liu

Date presented 2025

, 267 - 274

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 27/07/2025–01/08/2025, Vienna, Austria

Acquiring structured data from domain-specific, image-based documents—such as scanned reports—is crucial for many downstream tasks but remains challenging due to document variability. Many of these documents exist as images rather than as machine-readable text, which requires human annotation to train automated extraction systems. We present DocSpiral, the first Human-in-the-Spiral assistive document annotation platform , designed to address the challenge of extracting structured information from domain-specific, image-based document collections. Our spiral design establishes an iterative cycle in which human annotations train models that progressively require less manual intervention. DocSpiral integrates document format normalization, comprehensive annotation interfaces, evaluation metrics dashboard, and API endpoints for the development of AI / ML models into a unified workflow. Experiments demonstrate that our framework reduces annotation time by at least 41% while showing consistent performance gains across three iterations during model training. By making this annotation platform freely accessible, we aim to lower barriers to AI/ML models development in document processing, facilitating the adoption of large language models in image-based, document-intensive fields such as geoscience and healthcare. The system is freely available at: https://app.ai4wa.com.

Conference paper Open access

Enhancing Question Answering through Effective Candidate Answer Selection and Mitigation of Incomplete Knowledge Graphs and over-smoothing in Graph Convolutional Networks

by Sirui Li, Kok Wai Wong, Dengya Zhu and Lance Chun Che Fung

Published 2024

The IEEE World Congress on Computational Intelligence, 30/06/2024–05/07/2024, Yokohama, Japan

—Question answering over knowledge graphs (KGQA) seeks to automatically answer natural language questions by retrieving triples within the knowledge graph (KG). In the context of multi-hop KGQA, reasoning across multiple edges of the KG becomes crucial for obtaining answers. Existing methods align with either the path-searching-based mainstream, emphasizing structural KG analysis, or the subgraph-based mainstream, focusing on semantic KG embeddings. Both streams have two primary challenges: (1) KG incompleteness, where path searching or subgraph construction faces limitations in the absence of links between entities; (2) candidate answer selection, wherein most approaches employ pre-defined searching sizes or heuristics. Many recent studies incorporate Graph Convolutional Network (GCN) to encode KGs, yet they overlook the potential over-smoothing issue inherent in GCNs. The over-smoothing problem arises from the tendency of closely connected nodes to exhibit similar embeddings within the deep convolutional architecture of GCNs. To address these challenges, this paper proposes a two-stage framework named ComPath, leveraging insights from both mainstreams. ComPath utilizes GCN to tackle KG incompleteness and introduces a path analyser to mitigate the over-smoothing issue associated with GCN. Candidate answers are selected using semantic similarity. The ablation studies and comparative experiments on the three KGQA benchmark datasets shown that the proposed ComPath performed better than the other KGQAs.

Conference paper Open access

Improving API Caveats Accessibility by Mining API Caveats Knowledge Graph

by Hongwei Li, Sirui Li, Jiamou Sun, Zhenchang Xing, Xin Peng, Mingwei Liu and Xuejiao Zhao

Date presented 09/2023

The International Conference on Software Maintenance and Evolution (ICSME), 23/09/2018–29/09/2018, Madrid, Spain

API documentation provides important knowledge about the functionality and usage of APIs. In this paper, we focus on API caveats that developers should be aware of in order to avoid unintended use of an API. Our formative study of Stack Overflow questions suggests that API caveats are often scattered in multiple API documents, and are buried in lengthy textual descriptions. These characteristics make the API caveats less discoverable. When developers fail to notice API caveats, it is very likely to cause some unexpected programming errors. In this paper, we propose natural language processing(NLP) techniques to extract ten subcategories of API caveat sentences from API documentation and link these sentences to API entities in an API caveats knowledge graph. The API caveats knowledge graph can support information retrieval based or entity-centric search of API caveats. As a proof-of-concept, we construct an API caveats knowledge graph for Android APIs from the API documentation on the Android Developers website. We study the abundance of different subcategories of API caveats and use a sampling method to manually evaluate the quality of the API caveats knowledge graph. We also conduct a user study to validate whether and how the API caveats knowledge graph may improve the accessibility of API caveats in API documentation.

Sirui Li

Lecturer, School of Information Technology, Murdoch University

Output list