Peer Review of “Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study”

doi:10.2196/28922

Peer-Review Report

José Alberto Benítez Andrades, PhD

Related ArticlesPreprint: https://preprints.jmir.org/preprint/25560
Authors' Response to Peer-Review Reports: https://med.jmirx.org/2021/2/e28917/
Published Article: https://med.jmirx.org/2021/2/e25560/

JMIRx Med 2021;2(2):e28922

doi:10.2196/28922

Keywords

data collection; herpes simplex; registry; machine learning; risk assessment; artificial intelligence; predictor; risk

This is a peer-review report submitted for the paper “Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study.”

General Comments

The authors of this research [1] discuss a platform containing a random forest classifier applied to the medical reports of patients suffering from the herpes virus. The manuscript describes an introduction to the proposed topic, the problem the authors intend to solve, the solution, and a discussion. Although the research seems interesting, the manuscript has some weaknesses that the authors must resolve.

Specific Comments

Major Comments

Authors should read the authors’ guidelines at https://www.jmir.org/content/author-instructions. I suggest that they adapt their manuscript to the templates offered by JMIR; the title does not match the format proposed by the journal, the appendices do not have a caption, the tables can go in the manuscript, etc.
In relation to the content of the manuscript, there is no exhaustive bibliographic review in which existing studies applied to a classification problem such as the one the authors present are mentioned. Because of this, the justification for the development they propose is quite weak and can be improved upon.
Authors indicate that they separated the data sets by train_test_split; however, there is no clear description of the content of these two data sets. It is not known whether the classes are balanced or not, and no data preprocessing was done to ensure that the generated model is optimal for any type of data. Authors should indicate if they have done a cross-validation when training their model or not. If not, I recommend that they do it.
It would be enlightening to show the matrix of confusion as well as to indicate in a table a comparison of the measures of precision and accuracy on random forest with different hyperparameters.
To search for the best hyperparameters, I suggest using GridSearchCV or similar.
Finally, it is necessary to make a comparison between the proposed model and others that already exist.
Authors are requested to upload their code and the models to a repository to guarantee their reproducibility.

I thank the authors for their work in improving this manuscript. They have responded correctly to all my suggestions, and I consider that the manuscript has improved in quality and can be considered for publication in this journal.

Conflicts of Interest

None declared.

Surodina S, Lam C, Grbich S, Milne-Ives M, van Velthoven M, Meinert E. Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study. JMIRx 2021 Jun 10;2(2):e25560 [FREE Full text] [CrossRef]

Edited by G Eysenbach; This is a non–peer-reviewed article. submitted 18.03.21; accepted 18.03.21; published 11.06.21

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIRx Med, is properly cited. The complete bibliographic information, a link to the original publication on https://med.jmirx.org/, as well as this copyright and license information must be included.

Citation

Please cite as:

Benítez Andrades JA
Peer Review of “Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study”
JMIRx Med 2021;2(2):e28922
doi: 10.2196/28922 PMCID: 10414425

This paper is in the following e-collection/theme issue:

Peer Review of “Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study”