<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.0 20040830//EN" "http://dtd.nlm.nih.gov/publishing/2.0/journalpublishing.dtd">
<article article-type="article-commentary" dtd-version="2.0" xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">JMIRxMed</journal-id>
      <journal-id journal-id-type="nlm-ta">JMIRx Med</journal-id>
      <journal-title>JMIRx Med</journal-title>
      <issn pub-type="epub">2563-6316</issn>
      <publisher>
        <publisher-name>JMIR Publications</publisher-name>
        <publisher-loc>Toronto, Canada</publisher-loc>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">v2i3e31547</article-id>
      <article-id pub-id-type="pmid"/>
      <article-id pub-id-type="doi">10.2196/31547</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Peer-Review Report</subject>
        </subj-group>
        <subj-group subj-group-type="article-type">
          <subject>Peer-Review Report</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Peer Review of “Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method”</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="editor">
          <name>
            <surname>Meinert</surname>
            <given-names>Edward</given-names>
          </name>
        </contrib>
      </contrib-group>
      <contrib-group>
        <contrib id="contrib1" contrib-type="author" corresp="no">
          <collab>Anonymous</collab>
        </contrib>
      </contrib-group>
      <pub-date pub-type="collection">
        <season>Jul-Sep</season>
        <year>2021</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>11</day>
        <month>8</month>
        <year>2021</year>
      </pub-date>
      <volume>2</volume>
      <issue>3</issue>
      <elocation-id>e31547</elocation-id>
      <history>
        <date date-type="received">
          <day>24</day>
          <month>6</month>
          <year>2021</year>
        </date>
        <date date-type="accepted">
          <day>24</day>
          <month>6</month>
          <year>2021</year>
        </date>
      </history>
      <copyright-statement>© Anonymous. Originally published in JMIRx Med (https://med.jmirx.org), 11.08.2021.</copyright-statement>
      <copyright-year>2021</copyright-year>
      <license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
        <p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIRx Med, is properly cited. The complete bibliographic information, a link to the original publication on https://med.jmirx.org/, as well as this copyright and license information must be included.</p>
      </license>
      <self-uri xlink:href="https://med.jmirx.org/2021/3/e31547" xlink:type="simple"/>
      <related-article related-article-type="companion" id="preprint21249239" ext-link-type="doi" xlink:href="ttps://doi.org/10.1101/2021.01.05.21249239" vol="2" page="21249239" xlink:title="Preprint (medRxiv):" xlink:type="simple">https://www.medrxiv.org/content/10.1101/2021.01.05.21249239v1</related-article>
      <related-article related-article-type="companion" id="preprint27017" ext-link-type="doi" xlink:href="https://doi.org/10.2196/preprints.27017" vol="2" page="e27017" xlink:title="Preprint (JMIR Preprints):" xlink:type="simple">https://preprints.jmir.org/preprint/27017</related-article>
      <related-article related-article-type="companion" id="v2i3e31568" ext-link-type="doi" xlink:href="10.2196/31568" vol="2" page="e31568" xlink:title="Authors' Response to Peer-Review Reports:" xlink:type="simple">https://med.jmirx.org/2021/3/e31568/</related-article>
      <related-article related-article-type="companion" id="v2i3e27017" ext-link-type="doi" xlink:href="10.2196/27017" vol="2" page="e27017" xlink:title="Published Article:" xlink:type="simple">https://med.jmirx.org/2021/3/e27017/</related-article>
    </article-meta>
  </front>
  <body>
    <p>
      <italic>This is a peer-review report submitted for the paper “Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method”</italic>
    </p>
    <sec>
      <title>Round 1 Review</title>
      <sec>
        <title>General Comments</title>
        <p>This study [<xref ref-type="bibr" rid="ref1">1</xref>] is trying to develop a new method to identify attributed and unattributed potential adverse events (AEs) using the unstructured text of electronic health records (EHRs).</p>
        <list list-type="order">
          <list-item>
            <p>After reading the manuscript, I feel the title does not match the study contents. First, the title seems to repeat a fact that is already self-evident.</p>
          </list-item>
          <list-item>
            <p>The core of the so-called Shakespeare method is still the latent Dirichlet allocation (LDA) method; I cannot see that any novel methods have been developed.</p>
          </list-item>
          <list-item>
            <p>There is no related literature review, as many studies have used LDA methods in EHR data. To really find any AE in unstructured text, natural language processing (NLP) is indispensable.</p>
          </list-item>
          <list-item>
            <p>What is the difference between the so-called “Shakespeare method” and LDA topic modeling?</p>
          </list-item>
          <list-item>
            <p>What are the three parts in the following statement:</p>
            <p>The Shakespeare method has three parts:</p>
            <list list-type="bullet">
              <list-item>
                <p>Convert each document into a vector of n-gram frequencies.</p>
              </list-item>
              <list-item>
                <p>Create two groups of vectors: target and comparison.</p>
              </list-item>
              <list-item>
                <p>Trim the n-gram vectors in the target group to those that are significant for the target group.</p>
              </list-item>
              <list-item>
                <p>Apply topic analysis to the trimmed target group vectors.</p>
              </list-item>
              <list-item>
                <p>Interpret the original documents with topic scores of interest.</p>
              </list-item>
            </list>
          </list-item>
          <list-item>
            <p>The description of the method is hard to understand. As stated, “Crucially, events can be described in text but not necessarily attributed to being medical care AEs [14,25,41]; we wanted to develop an unstructured method that would identify them.” What is this unstructured method?</p>
          </list-item>
        </list>
      </sec>
    </sec>
    <sec>
      <title>Round 2 Review</title>
      <sec>
        <title>General Comments</title>
        <p>This revision provided more details of the Shakespeare method. However, it seems the authors do not quite understand the alternative method: NLP. This may lead to mistaken conclusions. The questions below need reconsideration.</p>
      </sec>
      <sec>
        <title>Specific Comments</title>
        <sec>
          <title>Major Comments</title>
          <list list-type="order">
            <list-item>
              <p>It is claimed that “Many methods for finding AEs in text rely on predefining possible AEs before searching for prespecified words and phrases or manual labeling (standardization) by investigators.” The dictionary method in the NLP tool could extract most terms, for example, included in the Unified Medical Language System, which can be limited to a “disorder” semantic group as a potential transfusion AE (PTAE) group.</p>
            </list-item>
            <list-item>
              <p>The PTAE terms identified through the Shakespeare method actually are a mixture of reasons for transfusion, consequences of the reasons for transfusion, or alternate reasons for PTAEs. The Shakespeare method is not able to identify specific AEs with a causal relationship with transfusion. Then, what is the difference between this method and the NLP dictionary method?</p>
            </list-item>
            <list-item>
              <p>It is advisable to include potential use scenarios of the method (eg, will more manual reviews be needed for the results?).</p>
            </list-item>
          </list>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <app-group/>
    <glossary>
      <title>Abbreviations</title>
      <def-list>
        <def-item>
          <term id="abb1">AE</term>
          <def>
            <p>adverse event</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb2">EHR</term>
          <def>
            <p>electronic health record</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb3">LDA</term>
          <def>
            <p>latent Dirichlet allocation</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb4">NLP</term>
          <def>
            <p>natural language processing</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb5">PTAE</term>
          <def>
            <p>potential transfusion adverse event</p>
          </def>
        </def-item>
      </def-list>
    </glossary>
    <fn-group>
      <fn fn-type="conflict">
        <p>None declared.</p>
      </fn>
    </fn-group>
    <ref-list>
      <ref id="ref1">
        <label>1</label>
        <nlm-citation citation-type="journal">
          <person-group person-group-type="author">
            <name name-style="western">
              <surname>Bright</surname>
              <given-names>RA</given-names>
            </name>
            <name name-style="western">
              <surname>Dowdy</surname>
              <given-names>K</given-names>
            </name>
            <name name-style="western">
              <surname>Rankin</surname>
              <given-names>SK</given-names>
            </name>
            <name name-style="western">
              <surname>Blok</surname>
              <given-names>SV</given-names>
            </name>
            <name name-style="western">
              <surname>Palmer</surname>
              <given-names>LAM</given-names>
            </name>
            <name name-style="western">
              <surname>Bright</surname>
              <given-names>SJ</given-names>
            </name>
          </person-group>
          <article-title>Finding Potential Adverse Events in the Unstructured Text of Electronic Health Care Records: Development of the Shakespeare Method</article-title>
          <source>JMIRx Med</source>
          <year>2021</year>
          <month>8</month>
          <day>11</day>
          <volume>2</volume>
          <issue>3</issue>
          <fpage>e27017</fpage>
          <comment>
            <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://med.jmirx.org/2021/3/e27017/">https://med.jmirx.org/2021/3/e27017/</ext-link>
          </comment>
          <pub-id pub-id-type="doi">10.2196/27017</pub-id>
        </nlm-citation>
      </ref>
    </ref-list>
  </back>
</article>
