<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD 2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">EXCLI J</journal-id>
      <journal-title>EXCLI Journal</journal-title>
      <issn pub-type="epub">1611-2156</issn>
      <publisher>
        <publisher-name>Leibniz Research Centre for Working Environment and Human Factors</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">2022-5602</article-id>
      <article-id pub-id-type="doi">10.17179/excli2022-5602</article-id>
      <article-id pub-id-type="pii">Doc84</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Original article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>PARP1pred: a web server for screening the bioactivity of inhibitors against DNA repair enzyme PARP-1</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Lerksuthirat</surname>
            <given-names>Tassanee</given-names>
          </name>
          <xref ref-type="corresp" rid="COR1">&#x0002a;</xref>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Chitphuk</surname>
            <given-names>Sermsiri</given-names>
          </name>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Stitchantrakul</surname>
            <given-names>Wasana</given-names>
          </name>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Dejsuphong</surname>
            <given-names>Donniphat</given-names>
          </name>
          <xref ref-type="aff" rid="A2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Malik</surname>
            <given-names>Aijaz Ahmad</given-names>
          </name>
          <xref ref-type="aff" rid="A3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Nantasenamat</surname>
            <given-names>Chanin</given-names>
          </name>
          <xref ref-type="aff" rid="A4">4</xref>
        </contrib>
      </contrib-group>
      <aff id="A1">
        <label>1</label>Research Center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand</aff>
      <aff id="A2">
        <label>2</label>Program in Translational Medicine, Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Samut Prakan 10540, Thailand</aff>
      <aff id="A3">
        <label>3</label>Center of Excellence in Computational Molecular Biology, Faculty of Medicine, Chulalongkorn University, Bangkok 10330, Thailand</aff>
      <aff id="A4">
        <label>4</label>Streamlit Open Source, Snowflake Inc., USA</aff>
      <author-notes>
        <corresp id="COR1">*To whom correspondence should be addressed: Tassanee Lerksuthirat, Research Center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand, E-mail: <email>tassanee.ler@mahidol.ac.th</email></corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>05</day>
        <month>01</month>
        <year>2023</year>
      </pub-date>
      <pub-date pub-type="collection">
        <year>2023</year>
      </pub-date>
      <volume>22</volume>
      <fpage>84</fpage>
      <lpage>107</lpage>
      <history>
        <date date-type="received">
          <day>14</day>
          <month>11</month>
          <year>2022</year>
        </date>
        <date date-type="accepted">
          <day>23</day>
          <month>12</month>
          <year>2022</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>Copyright &#xA9; 2023 Lerksuthirat et al.</copyright-statement>
        <copyright-year>2023</copyright-year>
        <license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
          <p>This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (http://creativecommons.org/licenses/by/4.0/) You are free to copy, distribute and transmit the work, provided the original author and source are credited.</p>
        </license>
      </permissions>
      <self-uri xlink:href="https://www.excli.de/vol22/excli2022-5602.pdf">This article is available from https://www.excli.de/vol22/excli2022-5602.pdf</self-uri>
      <abstract><p>Cancer is the leading cause of death worldwide, resulting in the mortality of more than 10 million people in 2020, according to Global Cancer Statistics 2020. A potential cancer therapy involves targeting the DNA repair process by inhibiting PARP-1. In this study, classification models were constructed using a non-redundant set of 2018 PARP-1 inhibitors. Briefly, compounds were described by 12 fingerprint types and built using the random forest algorithm concomitant with various sampling approaches. Results indicated that PubChem with an oversampling approach yielded the best performance, with a Matthews correlation coefficient &#x3E; 0.7 while also affording interpretable molecular features. Moreover, feature importance, as determined from the Gini index, revealed that the aromatic&#x2F;cyclic&#x2F;heterocyclic moiety, nitrogen-containing fingerprints, and the ether&#x2F;aldehyde&#x2F;alcohol moiety were important for PARP-1 inhibition. Finally, our predictive model was deployed as a web application called PARP1pred and is publicly available at https:&#x2F;&#x2F;parp1pred.streamlitapp.com, allowing users to predict the biological activity of query compounds using their SMILES notation as the input. It is anticipated that the model described herein will aid in the discovery of effective PARP-1 inhibitors.</p></abstract>
      <kwd-group>
        <kwd>PARP-1</kwd>
        <kwd>DNA repair</kwd>
        <kwd>machine learning</kwd>
        <kwd>QSAR</kwd>
        <kwd>webserver</kwd>
        <kwd>cheminformatics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>Introduction</title><p>Precision medicine is becoming increasingly important in treating many cancers because it can reduce side effects compared with conventional therapies (Baudino, 2015[<xref ref-type="bibr" rid="R5">5</xref>]). Several clinical trials have shown evidence of success, especially targeting DNA repair (Brown et al., 2017[<xref ref-type="bibr" rid="R8">8</xref>]). For example, an ovarian phase 2 clinical trial, in which platinum-sensitive patients were given the PARP-1 inhibitor olaparib as a maintenance treatment, showed an improvement in progression-free survival (Ledermann et al., 2012[<xref ref-type="bibr" rid="R36">36</xref>]). In a phase 3 OlympiA clinical trial, in which olaparib was administered as an adjuvant to BRCA1&#x2F;2-mutated breast cancer patients following completion of local treatment and neoadjuvant or adjuvant chemotherapy, the treatment group exhibited significantly longer survival, free of invasive or distant disease, than the placebo group (Tutt et al., 2021[<xref ref-type="bibr" rid="R74">74</xref>]). Moreover, the phase 2 TOPARP-A trial showed that patients who had metastatic prostate cancer, who were no longer responding to standard treatments, and who had defects in DNA-repair genes, had a high response rate toward olaparib (Mateo et al., 2015[<xref ref-type="bibr" rid="R42">42</xref>]).</p><p>DNA repair is a critical cellular process that ensures the integrity of the genome, allowing the parental cell to pass genetic information on to the progeny cell. Defective DNA repair causes accumulation of genetic mutations, thus leading to carcinogenesis. However, retaining some DNA repair activities is also important for cancer survival, especially when cells are under genotoxic stress (such as radio- and chemotherapy) (Helleday et al., 2008[<xref ref-type="bibr" rid="R26">26</xref>]). DNA double-strand break (DSB) lesions are the most toxic form of DNA damage, which, if left unrepaired, result in cell death (Shibata and Jeggo, 2014[<xref ref-type="bibr" rid="R68">68</xref>]). Therefore, drugs are of interest if their mode of action leads to the accumulation of DSBs (Srivastava and Raghavan, 2015[<xref ref-type="bibr" rid="R70">70</xref>]). </p><p>Poly (ADP-ribose) polymerase (PARP) is an enzyme that catalyzes the ADP-ribosylation of a specific protein, resulting in the covalent binding of a single ADP-ribose unit or polymers of ADP-ribose units (Gupte et al., 2017[<xref ref-type="bibr" rid="R24">24</xref>]). In humans, there are 17 members of the family, although only three (PARP-1, PARP-2, and PARP-3) are involved in DNA repair (Beck et al., 2014[<xref ref-type="bibr" rid="R6">6</xref>]). Among the three, PARP-1 (EC 2.4.2.30) was identified in 1963 and is the most extensively investigated DNA repair enzyme (Gupte et al., 2017[<xref ref-type="bibr" rid="R24">24</xref>]). By inhibiting PARP-1, DSB accumulation was induced in cancer cells deficient in <italic>BRCA1&#x2F;2</italic>, indicating that PARP-1 is a druggable target (Mateo et al., 2019[<xref ref-type="bibr" rid="R43">43</xref>]). Olaparib was the first well-known PARP-1 inhibitor, and it has been used as a targeted therapy to treat ovarian, breast, prostate, and pancreatic cancer patients with <italic>BRCA1&#x2F;2</italic> mutations (de Bono et al., 2020[<xref ref-type="bibr" rid="R14">14</xref>]; Fong et al., 2009[<xref ref-type="bibr" rid="R20">20</xref>]; Golan et al., 2019[<xref ref-type="bibr" rid="R23">23</xref>]; Kim et al., 2015[<xref ref-type="bibr" rid="R33">33</xref>]). Recently, five more PARP-1 inhibitors, rucaparib (Balasubramaniam et al., 2017[<xref ref-type="bibr" rid="R3">3</xref>]), niraparib (Mirza et al., 2016[<xref ref-type="bibr" rid="R47">47</xref>]; Scott, 2017[<xref ref-type="bibr" rid="R67">67</xref>]), talazoparib (Hoy, 2018[<xref ref-type="bibr" rid="R27">27</xref>]), fluzoparib (Li et al., 2021[<xref ref-type="bibr" rid="R38">38</xref>]), and pamiparib (Xu et al., 2021[<xref ref-type="bibr" rid="R79">79</xref>]) have been approved by the Food and Drug Administration (FDA). However, access to targeted therapy has been restricted in certain countries, particularly middle- and low-income countries, because of a lack of affordability or the capability to develop domestic pharmaceutical technology, which poses a threat to health security (Fundytus et al., 2021[<xref ref-type="bibr" rid="R21">21</xref>]; Ocran Mattila et al., 2021[<xref ref-type="bibr" rid="R53">53</xref>]). As a result, accelerating drug discovery in such countries is an important factor to minimize such risk.</p><p>The computational-aided drug design (CADD) approach significantly reduces the time and cost associated with drug discovery (Nantasenamat and Prachayasittikul, 2015[<xref ref-type="bibr" rid="R51">51</xref>]). With the availability of public bioactivity databases such as BindingDB (Gilson et al., 2016[<xref ref-type="bibr" rid="R22">22</xref>]), PubChem (Kim et al., 2016[<xref ref-type="bibr" rid="R34">34</xref>]), GtoPdb (Armstrong et al., 2020[<xref ref-type="bibr" rid="R2">2</xref>]), and ChEMBL (Mendez et al., 2019[<xref ref-type="bibr" rid="R44">44</xref>]), we can retrieve the bioactivity data and analyze the relationship between the chemical structures of compounds and their biological activities, termed the quantitative structure-activity relationship (QSAR) (Carracedo-Reboredo et al., 2021[<xref ref-type="bibr" rid="R10">10</xref>]; Nantasenamat and Prachayasittikul, 2015[<xref ref-type="bibr" rid="R51">51</xref>]). Developing a QSAR model involves two main steps: 1) molecular structure description; and 2) multivariate analysis to correlate molecular descriptors with observed biological activities (Nantasenamat et al., 2009[<xref ref-type="bibr" rid="R50">50</xref>]). The first step is to define chemical structures as numerical representations of their physicochemical properties. The second step employs statistical methods to establish the relationship between the independent variables (e.g., molecular descriptors) and the dependent variables (e.g., biological activities). As a result, the QSAR model is used to predict the effects of molecular descriptor changes on biological activities, as shown by the design of inhibitors against a variety of targets, such as antiviral (Malik et al., 2020[<xref ref-type="bibr" rid="R40">40</xref>]; Worachartcheewan et al., 2014[<xref ref-type="bibr" rid="R78">78</xref>]), anti-inflammatory (Kanan et al., 2021[<xref ref-type="bibr" rid="R31">31</xref>]), and anticancer (Nantasenamat et al., 2014[<xref ref-type="bibr" rid="R52">52</xref>]; Schaduangrat et al., 2021[<xref ref-type="bibr" rid="R66">66</xref>]). We constructed predictive models for drug discovery using a biological dataset of PARP-1 inhibitors.</p><p>Many studies have investigated <italic>in silico</italic> screening of PARP-1 inhibitors, including QSAR, molecular modeling, molecular docking, molecular dynamics simulation (MD), and proteochemometric modeling (Abbasi-Radmoghaddam et al., 2021[<xref ref-type="bibr" rid="R1">1</xref>]; Cortes-Ciriano et al., 2015[<xref ref-type="bibr" rid="R12">12</xref>]; Halder et al., 2015[<xref ref-type="bibr" rid="R25">25</xref>]; Li et al., 2016[<xref ref-type="bibr" rid="R37">37</xref>]; Revathi et al., 2021[<xref ref-type="bibr" rid="R61">61</xref>]). Halder and colleagues (2015[<xref ref-type="bibr" rid="R25">25</xref>]) used comparative <italic>in silico</italic> studies, including 2D-QSAR, kernel-based partial least square (KPLS) analysis, pharmacophore search engine (PHASE) pharmacophore mapping, molecular docking, molecular mechanics with generalized Born and surface area solvation (MM-GBSA) analysis, and Gaussian-based 3D-QSAR analyses on docked poses to explore the structure-activity relationship of PARP-1 inhibitors (Halder et al., 2015[<xref ref-type="bibr" rid="R25">25</xref>]). They used 254 compounds targeting PARP-1 from Merck Research Laboratories to conduct the analysis. They found that polar interactions play an important role to leverage the activity of PARP-1. Moreover, the positive ionizable feature of ligands also plays a key role to differentiate between highly active and inactive compounds. Revathi and colleagues (2021[<xref ref-type="bibr" rid="R61">61</xref>]) used 71 compounds that were phthalazinone and 4-carboxamide benzimidazole derivatives to develop ligand-based pharmacophores (Revathi et al., 2021[<xref ref-type="bibr" rid="R61">61</xref>]). They used Pharmacophore Alignment and Scoring Engine to identify the pharmacophore sites and later developed the ADHRR.1031 pharmacophore hypothesis as a 3D-QSAR model. Furthermore, the model was validated using 1,000,000 ligands from various databases and analyzed through virtual screening. The docking analysis revealed the importance of hydrogen bonding between Gly863 and Ser904 of PARP-1 with ligands. Additionally, hydrogen bond formation with Ser864 and &#x3C0;-&#x3C0; interaction with His862, Arg878, and His909 were also observed in the docking analysis. Sahin and Durdagi (2021[<xref ref-type="bibr" rid="R64">64</xref>]) aimed to identify novel piperazine-based PARP-1 inhibitors (Sahin and Durdagi, 2021[<xref ref-type="bibr" rid="R64">64</xref>]). They used text mining to search for molecules containing piperazine as a main scaffold from the Specs-SC database. The sorted molecules were then analyzed by molecular docking, in which the ten highest docking scores were further subjected to molecular dynamics (MD) to calculate the free binding energy using the molecular mechanics&#x2F;generalized born surface area method. They identified molecule-1388 as a potential candidate compound to selectively inhibit PARP-1. This compound had crucial hydrogen bonds with Gln759 and Met890 and &#x3C0;-&#x3C0; interaction with Tyr889. Abbasi-Radmoghaddam and colleagues (2021[<xref ref-type="bibr" rid="R1">1</xref>]) conducted a QSAR and molecular modeling study that predicted the IC<sub>50</sub> values (the concentration of inhibitor at which the enzymatic activity is reduced by half) of 1H-benzo&#x5B;d&#x5D;immidazole-4-carboxamide derivatives (Abbasi-Radmoghaddam et al., 2021[<xref ref-type="bibr" rid="R1">1</xref>]). They built a QSAR model based on the genetic algorithm-multiple linear regression (GA-MLR) and least squares-support vector machine (LS-SVM) methods. Moreover, they performed molecular docking analysis to reveal the chemical interactions between the substructure in each compound and PARP-1, as well as to calculate the free energy binding. They reported nine compounds, which given the best value of IC<sub>50</sub>, showed an improvement in PARP-1 inhibition of 33.394 &#x25;. Li and colleagues (2016[<xref ref-type="bibr" rid="R37">37</xref>]) used a molecular docking approach to screen compounds from the ZINC database against PARP-1 (Li et al., 2016[<xref ref-type="bibr" rid="R37">37</xref>]). Grid and amber scoring were used to calculate the area under the curve from the receiver operating characteristic. The selected compounds were further analyzed through MD. Finally, they proposed ZINC67913374 as a candidate compound to inhibit PARP-1 activity. Proteochemometry was also performed by Cort&#xE9;s-Ciriano and colleagues (2015[<xref ref-type="bibr" rid="R12">12</xref>]) to develop a model to explore the relationship between PARP inhibitors and various PARP isoforms, including PARP-1 (Cortes-Ciriano et al., 2015[<xref ref-type="bibr" rid="R12">12</xref>]). They used both chemical (Morgan fingerprints) and protein (binding site amino acid (AADescs) and full protein sequence (SeqDescs) descriptors as independent variables, while thermal shift values retrieved from Differential Scanning Fluorimetry (DSF) were treated as dependent variables. The models were built based on random forests, which were then further examined for the confidence intervals to understand the reliability of the predictive performance for either new compounds or PARP isoforms. Altogether, these studies show that computational approaches are useful to identify novel inhibitors of PARP-1.</p><p>In this study, we used Python-based programming to retrieve the biological activities of human PARP-1 from ChEMBL (Mendez et al., 2019[<xref ref-type="bibr" rid="R44">44</xref>]). We extracted a total of 2018 non-redundant compounds with known IC<sub>50</sub> values. All the inhibitors were converted to 12 different molecular descriptors and further built with 12 different machine learning models. Of the 144 models, the PubChem random forest model was chosen, because it was interpretable and it robustly classified substances as active or inactive, as indicated by MCC values &#x3E; 0.7 of the training and CV sets in all three sampling approaches. Additionally, the important chemical fingerprints that contributed to the constructed model were examined. In-depth analysis of the top 20 descriptors demonstrated that aromatic&#x2F;heterocyclic and nitrogen-containing characteristics are important for PARP-1 inhibition. Lastly, a web server was built to make this prediction accessible in the public domain. This will accelerate the discovery of new and diverse inhibitors against PARP-1.</p></sec>
    <sec sec-type="materials|methods">
      <title>Materials and Methods</title><sec><title>Data compilation and curation</title><p>The dataset of PARP-1 (ChEMBL ID: CHEMBL3105) inhibitors was compiled using data from the ChEMBL database, release 29 (Mendez et al., 2019[<xref ref-type="bibr" rid="R44">44</xref>]), which includes an initial set of 5094 bioactivity data points and 3738 compounds. The data were retrieved through a Python-based library (<ext-link ext-link-type="uri" xlink:href="https:&#47;&#47;pypi.org&#47;project&#47;chembl-webresource-client&#47;">https:&#47;&#47;pypi.org&#47;project&#47;chembl-webresource-client&#47;</ext-link>) which enables users to cache all results in the local file system for faster retrieval (Davies et al., 2015[<xref ref-type="bibr" rid="R13">13</xref>]). The IC<sub>50 </sub>values, containing 2815 data points and 2429 compounds, were chosen for further curation. Because the purpose of this study was to create a classification model for PARP-1 inhibition, we defined active as &#x2264; 1 &#xB5;M (n &#x3D; 1720) and inactive as &#x2265; 10 &#xB5;M (n &#x3D; 298). The intermediates with concentrations ranging between 1 and 10 &#xB5;M were discarded (n &#x3D; 334). Finally, we obtained 2018 non-redundant and curated active and inactive compounds for further analysis.</p></sec><sec><title>Molecular descriptor analysis</title><p>The PaDEL-Descriptor software was used to calculate molecular fingerprints for each compound in the dataset (Yap, 2011[<xref ref-type="bibr" rid="R82">82</xref>]). As previously described by Malik and colleagues (2020[<xref ref-type="bibr" rid="R40">40</xref>]), molecular fingerprints are numerical values that represent both qualitative and quantitative chemical structures (Malik et al., 2020[<xref ref-type="bibr" rid="R40">40</xref>]). Thus, they are crucial for QSAR studies. The software computes 12 types of fingerprints which belong to nine classes, namely, Atom Pairs 2D, CDK, CDK extended, CDK graph only, E-state, Klekota-Roth, MACCS, PubChem, and Substructure. Moreover, Atom Pairs 2D, Klekota-Roth, and Substructure are available in two versions. The first version indicates the presence or absence of the descriptors using the values 1 and 0, while the second version indicates the descriptor&#x27;s frequency value. The structures in SMILES format were pre-processed by removing salt, detecting aromaticity, standardizing nitro groups, and standardizing tautomers, before being subjected to molecular fingerprint calculation.</p></sec><sec><title>Data filtering</title><p>During the feature selection process, low variance variables were not useful for the model&#x27;s predictive capability. Therefore, constant and near constant variables were omitted from the selection of fingerprint descriptor sets to reduce model complexity and bias. The constants of the fingerprint descriptors were calculated using a standard deviation (SD) of 0.1 as a cut-off value. Thus, variables with SD values of less than 0.1 were selected for further analysis.</p></sec><sec><title>Data splitting for model construction</title><p>The Kennard-Stone algorithm was used to divide the data into an 80&#x2F;20 ratio (Kennard and Stone, 1969[<xref ref-type="bibr" rid="R32">32</xref>]), of which 80 &#x25; was assigned as an internal set (1614 compounds, active &#x3D; 1380, inactive &#x3D; 234) and the remaining 20 &#x25; was used as an external set (404 compounds, active &#x3D; 340, inactive &#x3D; 64) to validate the model. The internal dataset was further divided into balanced and imbalanced datasets and used as the training dataset, which was subjected to five-fold cross-validation.</p></sec><sec><title>Statistical analysis</title><p>We present chemical descriptors of each molecule according to the previous study by Schaduangrat and colleagues (2021[<xref ref-type="bibr" rid="R66">66</xref>]). Briefly, this uses six common descriptive statistical parameters: minimum (Min), first quartile (Q1), median, mean, third quartile (Q3), and maximum (Max). All the parameters were visualized as a box plot using the seaborn and matplotlib data visualization packages in Python. Lipinski&#x27;s rule-of-five parameters were compared between active and inactive groups using the Mann-Whitney <italic>U</italic> test, with <italic>p</italic> &#x3C; 0.05 indicating a significant difference.</p></sec><sec><title>Multivariate analysis</title><p>Twelve machine learning classification models were constructed from the internal dataset: decision trees, extra trees, Gaussian Naive Bayes, Gaussian process, gradient boosting, K-neighbors, light gradient boosted machine, multi-layer perceptron, quadratic discriminant analysis, random forest, C-support vector, and extreme gradient boosting. The model construction was developed using the scikit-learn library (Pedregosa et al., 2011[<xref ref-type="bibr" rid="R59">59</xref>]) in Python. Each type of model had different characteristics to determine the relationship between the dependent variables and the independent variables. Gradient boosting, random forest, extra trees, light gradient boosted machine, and extreme gradient boosting were grouped as ensemble methods, which generate many models and combine them to get the best model. Multi-layer perceptron was part of the neural network, which was considered a black box model and could not be interpreted. Decision tree was used to learn simple decision rules retrieved from the data features. K-neighbors is a type of instance-based learning in which the classification of certain data is based on most of its nearest neighbors. Support vector machine draws a hyperplane to separate two or more classes in the best possible manner. The Gaussian process uses a Gaussian distribution to fit random points of data, whereas quadratic discriminant analysis estimates the means and covariances from the data and assigns a new observed data point to the class with the greatest likelihood. Lastly, Gaussian Naive Bayes assumes each feature follows Gaussian distribution, calculates the probability from each feature at a given class, and multiplies all the probabilities of each feature.</p></sec><sec><title>Model validation</title><p>We used a variety of statistical parameters to evaluate the performance of the models, including true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN). The model&#x27;s fitness was determined using the following statistical parameters: overall prediction accuracy (Ac), sensitivity (Sn), specificity (Sp), and Matthews correlation coefficient (MCC).</p><p><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-i-001" ></inline-graphic></p></sec><sec><title>Applicability domain analysis</title><p>To estimate the chemical space in which the model can make reliable and accurate predictions for compounds based on similarity with the compounds on which the model was constructed, we used the PCA bounding box to determine the applicability domain (AD) of compounds from the training (internal) and test (external) sets. Compounds that fall inside the AD of the model are typically predicted reliably.</p></sec><sec><title>Reproducibility research</title><p>The data and code used in the study are deposited on GitHub at <ext-link ext-link-type="uri" xlink:href="https:&#47;&#47;github.com&#47;tlerksuthirat&#47;data&#95;driven&#95;PARP1">https:&#47;&#47;github.com&#47;tlerksuthirat&#47;data&#95;driven&#95;PARP1</ext-link>.</p></sec><sec><title>Development of the PARP-1 web server</title><p>The best predictive model was exported as model.pkl and is used in the deployed web server developed in Python using Streamlit version 1.12.0. Particularly, the Streamlit web app accepts the input SMILES notation of query molecule and converts this into an image file of the 2D chemical structure via rdkit-pypi version 2022.3.5. Subsequently, the SMILES notation is used to compute the PubChem molecular fingerprint using padelpy version 0.1.10. The best machine learning model, which was built using the random forest algorithm with scikit-learn version 1.0.2, is applied on the computed fingerprint of the query molecule where the bioactivity is predicted. The PARP1pred web app is publicly available at <ext-link ext-link-type="uri" xlink:href="https:&#47;&#47;parp1pred.streamlit.app&#47;">https:&#47;&#47;parp1pred.streamlit.app&#47;</ext-link> while the data and code used for building this app is deposited on GitHub at <ext-link ext-link-type="uri" xlink:href="https:&#47;&#47;github.com&#47;dataprofessor&#47;parp1">https:&#47;&#47;github.com&#47;dataprofessor&#47;parp1</ext-link>.</p></sec></sec>
    <sec sec-type="discussion">
      <title>Results and Discussion</title><p>The entire workflow for constructing the model is summarized in Figure 1<xref ref-type="fig" rid="F1">(Fig. 1)</xref>.</p><sec><title>Chemical space analysis</title><p>The aim of performing chemical space analysis between active and inactive compounds is to understand the difference in chemical characteristics between two groups. We first explored the relationship between molecular weight (MW) and the Ghose-Crippen-Viswanadhan octanol-water partition coefficient (LogP), as shown in Figure 2<xref ref-type="fig" rid="F2">(Fig. 2)</xref> (Wildman and Crippen, 1999[<xref ref-type="bibr" rid="R77">77</xref>]). LogP is a lipophilic descriptor that can be used to determine the permeability of molecules to the cell membrane, thereby indicating their drug-likeness molecule (van de Waterbeemd, 2008[<xref ref-type="bibr" rid="R76">76</xref>]). Next, Lipinski&#x27;s rule-of-five (Ro5) descriptors were employed to investigate the difference in chemical features between active and inactive compounds, as shown in Figure 3<xref ref-type="fig" rid="F3">(Fig. 3)</xref>. The Ro5 are composed of four parameters, namely MW (&#x3C; 500 kDa), LogP (&#x3C; 5), the number of H-bond donors (NumHDonors &#x3C; 5), and the number of H-bond acceptors (NumHAcceptors &#x3C; 10) (Lipinski et al., 2001[<xref ref-type="bibr" rid="R39">39</xref>]). If any compounds have values out of range for two parameters, they are likely to have poor absorption or permeability, and thus a higher rate of drug development failure. As illustrated in Figure 2<xref ref-type="fig" rid="F2">(Fig. 2)</xref>, most compounds clustered between 300 and 500 MW with a LogP of 2-4. Moreover, the Ro5 analysis and statistical analysis revealed that most of the active and inactive compounds following the Ro5 as illustrated by the box plots were under the cut-off values (dashed line, Figure 3<xref ref-type="fig" rid="F3">(Fig. 3)</xref>). The Mann-Whitney <italic>U </italic>test found a significant difference in MW, NumHDonors, and NumHAcceptors between active and inactive molecules, but no difference in LogP. Active molecules had a higher MW, NumHDonors, and NumHAcceptors than inactive molecules, as demonstrated by the circle in the boxplot (Figure 3<xref ref-type="fig" rid="F3">(Fig. 3)</xref>). The mean &#xB1; SD of MW in the active and inactive groups were 381.66 &#xB1; 87.93 and 349.35 &#xB1; 119.32, respectively. NumHDonors had a mean &#xB1; SD of 1.80 &#xB1; 0.82 for active molecules and 1.39 &#xB1; 0.79 for inactive molecules, whereas the mean SD for NumHAcceptors was 4.56 &#xB1; 1.53 for active molecules and 4.22 &#xB1; 2.06 for inactive molecules. Between the active and inactive molecules, the logP value was 2.66 &#xB1; 1.17 for active molecules and 2.61 &#xB1; 1.52 for inactive molecules.</p></sec><sec><title>QSAR modeling</title><p>To develop a robust QSAR model, we followed the guidelines of the Organization for Economic Co-operation and Development (OECD, 2014[<xref ref-type="bibr" rid="R54">54</xref>]). Briefly, a robust model should include, at least: 1) a defined endpoint for the dataset; 2) an unambiguous learning algorithm; 3) a defined applicability domain of the QSAR model; 4) appropriate measures of goodness-of-fit, robustness, and predictability; and 5) mechanistic interpretation of the QSAR model. Thus, to develop interpretable QSAR models, the molecular fingerprints indicated in Table 1<xref ref-type="fig" rid="T1">(Tab. 1)</xref> were calculated using the PaDEL-Descriptor software, from which three fingerprints (PubChem, Substructure, and Klekota-Roth) are readily interpretable.</p><p>We constructed 12 machine learning models from 12 molecular fingerprints to determine which model gave the best performance and was the most robust and interpretable. Because our imbalanced data contained more active compounds (n &#x3D; 1720) than inactive compounds (n &#x3D; 298), we compared the models generated from both balanced and imbalanced approaches. Prior to data splitting, we reduced the dimensionality of the data by selecting the fingerprint that rendered SD &#x3C; 0.1. The data were split into external and internal sets in an 80:20 ratio. The internal dataset (n &#x3D; 1614), which contained 1380 active and 234 inactive compounds, was further divided into balanced and imbalanced datasets. For the balanced dataset, the models were created based on two methods: 1) undersampling, which randomly selected the majority class equal to the number of the minority classes; 2) oversampling, which amplified the number of minority classes equal to the number of the majority class. </p><p>For the non-class weight balance of an imbalanced dataset, the data were randomly selected to develop the model without consideration of the ratio between major and minority classes. Figures 4<xref ref-type="fig" rid="F4">(Fig. 4)</xref> and 5<xref ref-type="fig" rid="F5">(Fig. 5)</xref> demonstrate the heat maps of MCC<sub>train</sub>, MCC<sub>CV</sub>, MCC<sub>test</sub>, MCC<sub>train&#x2212;CV</sub>, and MCC<sub>train&#x2212;test</sub> for each fingerprint, machine learning model, and sampling approach. </p><p>Results showed that a balanced oversampling approach yielded the best value-most of the MCC<sub>train</sub> and MCC<sub>CV</sub> values were more than 0.8. Moreover, most of the MCC<sub>test</sub> values of oversampling were more than 0.7. The values of MCC<sub>train&#x2212;CV</sub> in the oversampling group were lower than 0.2, whereas the values of MCC<sub>train&#x2212;test</sub> in both balanced oversampling and imbalanced non-class weight were generally better than balanced undersampling, as the MCC<sub>train&#x2212;test</sub> values of undersampling were mostly greater than 0.3. As a result, we considered the oversampling approach as a good candidate to compare the performance among each model and fingerprint. Figure 4B<xref ref-type="fig" rid="F4">(Fig. 4)</xref> demonstrates that Gaussian Naive Bayes and quadratic discriminant analysis did not yield acceptable MCC values (&#x3C; 0.7) for all fingerprints. We further selected random forest (RF) over other machine learning methods because relevant features were able to be observed and the model was easily interpretable. As mentioned in the Methods section, RF is an ensemble method that has a root node as a starting point and splits into an N number of decision trees to learn the inherent patterns from the input data (Breiman, 2001[<xref ref-type="bibr" rid="R7">7</xref>]). Following a thorough examination of all MCC values for the interpretable fingerprints- PubChem, Substructure, and Klekota-Roth- the result suggested that a model based on PubChem was a good candidate. This was demonstrated by the MCC values for PubChem in the training, cross-validation, and test sets of 1, 0.96, and 0.74, respectively, whereas the MCC values for Substructure and Klekota-Roth in the test set were 0.66 and 0.68, respectively. As a result, the RF model that was developed using the oversampling approach from the PubChem fingerprint was the best option for model interpretation. Furthermore, as indicated in Figure 6<xref ref-type="fig" rid="F6">(Fig. 6)</xref>, the applicability domain was determined using the PubChem fingerprint as the input for PCA analysis. A total of 2018 compounds were split into two subsets, which consisted of internal (80 &#x25;) and external (20 &#x25;) datasets using the Kennard-Stone algorithm (Kennard and Stone, 1969[<xref ref-type="bibr" rid="R32">32</xref>]). The internal set was used as the training dataset, subjected to random sampling, and the predictive model was constructed with five-fold cross-validation. The result showed that the chemical space distribution of the external dataset fits well with the internal dataset, indicating that the applicability domain was well defined for the QSAR-based classification model. </p></sec><sec><title>Mechanistic interpretation of feature importance</title><p>To gain a better understanding of the mechanisms underlying PARP-1 activity and the significance of the features used to develop a PARP-1 activity predictability model using RF, the mean decrease of the Gini index was used to rank the importance of the PubChem feature descriptors. Measuring feature importance in RF can be evaluated by the mean decrease accuracy and the mean decrease in Gini; however, the latter gives more robust results (Calle and Urrea, 2010[<xref ref-type="bibr" rid="R9">9</xref>]). Thus, we selected the top 20 PubChem substructures with the highest Gini index, illustrated in Figure 7<xref ref-type="fig" rid="F7">(Fig. 7)</xref>, and their corresponding substructure descriptions are shown in Table 2<xref ref-type="fig" rid="T2">(Tab. 2)</xref>. We grouped the functional groups of the PubChem fingerprints into four classes: 1) aromatic, cyclic&#x2F;heterocyclic, and ring counts; 2) nitrogen-containing, consisting of hydrazine, amine, imine, and amide; 3) atom counts; and 4) ether, aldehyde, and alcohol. However, some PubChem fingerprints had more than one feature; for example, PubChemFP695 had aldehyde and amine functional groups, and PubChemFP821 had cyclic and amine functional groups.</p></sec><sec><title>Aromatic, cyclic&#x2F;heterocyclic, and ring count functional groups</title><p>The fingerprints belonging to these groups consisted of PubChem191, PubChem734, PubChem797, PubChem821, and PubChem192. PubChem192 was on the lowest rank of the top 20 and it was not specified whether it was aromatic- or heteroatom-containing, but it must have a ring size of six for at least three rings. Thus, the aromatic (PubChemFP734) and cyclic (PubChem191, PubChem797, and PubChem821) moieties overlapped with PubChem192. Based on aromatic, cyclic&#x2F;heterocyclic, and ring counts, PubChem191, PubChem797, and PubChem821 were at the 7<sup>th</sup>, 12<sup>th</sup>, and 16<sup>th</sup> positions of the top 20. Taking a closer look at our post-processing dataset (2018 compounds), there were 35 compounds in total containing all three fingerprints, of which 34 compounds were considered active. Moreover, a total of 33 compounds contained both cyclic (PubChem191, PubChem797, and PubChem821) and aromatic moieties (PubChemFP734), and all of them were active. This meant that aromatic and cyclic&#x2F;heterocyclic functional groups with a ring size equal to six or more than two were the important features of the active compounds. The first generation of PARP-1 inhibitors was designed to mimic the benzamide scaffold of NAD<sup>&#x2B;</sup> (Steffen et al., 2013[<xref ref-type="bibr" rid="R71">71</xref>]). Later the efficacy was improved by using quinazolinone as a scaffold to synthesize PARP-1 inhibitors (Malyuchenko et al., 2015[<xref ref-type="bibr" rid="R41">41</xref>]). Inhibitors derived from those two scaffolds contain both the aromatic and cyclic&#x2F;heterocyclic moieties and play an important role in the NAD<sup>&#x2B;</sup> binding pocket. The aromatic ring forms &#x3C0;-&#x3C0; interactions with the tyrosine residues in the NAD<sup>&#x2B;</sup> binding pocket, and both the aromatic ring and cyclic&#x2F;heterocyclic moieties form hydrophobic interactions with the hydrophobic residues in the NAD<sup>&#x2B;</sup> binding pocket. The crystal structure of human PARP-1 revealed a hydrophobic interaction between the quinazolinone part of the FR257517 inhibitor and the phenyl ring of Tyr907 and a CH-&#x3C0; interaction with C&#x3B2; of Tyr869 (Kinoshita et al., 2004[<xref ref-type="bibr" rid="R35">35</xref>]). Moreover, docking analysis between PARP-1 and tricyclic compounds containing a non-aromatic A-ring demonstrated the fit within the NAD<sup>&#x2B;</sup> binding pocket, even though the non-aromatic A-ring was not flat (Park et al., 2010[<xref ref-type="bibr" rid="R56">56</xref>]). Most of the active compounds reported herein had IC<sub>50</sub> values ranging from 0.013-0.695 &#xB5;M. It should be noted that PubChem191 was in the highest rank among the aromatic, cyclic&#x2F;heterocyclic, and ring counts groups. This could be explained by the nitrogen in the non-aromatic moiety of the inhibitors contributing to hydrogen bonds forming with the glycine in the NAD<sup>&#x2B;</sup> binding pocket. The crystal structure of PARP-1 conjugated with FR257517 revealed three hydrogen bonds, one from the NH of the quinazolinone part of FR257517 to Gly863-C&#x3D;O (Kinoshita et al., 2004[<xref ref-type="bibr" rid="R35">35</xref>]). In addition, cyclic benzamide derivatives increased potency in PARP-1 and led to the optimization of novel PARP-1 inhibitors. Steinhagen and colleagues (2002[<xref ref-type="bibr" rid="R72">72</xref>]) reported that core variations within the cyclohexene moiety of PubChem191 affected the potency of inhibitors (Steinhagen et al., 2002[<xref ref-type="bibr" rid="R72">72</xref>]). Moreover, the study demonstrated that substitution of the 3,6-dihydro-2-thiopyrane subunit yielded a three- to tenfold increase in potency compared with the cyclohexenyl moiety.</p></sec><sec><title>Nitrogen-containing functional groups, including hydrazine, amine, imine, and amide</title><p>This class of functional groups possessed the largest number of fingerprints, including hydrazine (PubChemFP300), amine (PubChemFP358, PubChemFP391, PubChemFP695, PubChemFP540, PubChemFP607, PubChemFP569, PubChemFP821, and PubChemFP611), amide (PubChemFP646), and imine (PubChemFP576). There were two fingerprints in this group, PubChemFP695 and PubChemFP821, also containing aldehyde and cyclic functional groups, respectively.</p><p>PubChemFP300 was in the first rank of important fingerprints based on all features. This is because PubChemFP300 is part of the basic scaffold during PARP-1 inhibitor development (Ferraris, 2010[<xref ref-type="bibr" rid="R19">19</xref>]). Banasik and colleagues (1992[<xref ref-type="bibr" rid="R4">4</xref>]) introduced pthalazine derivatives and analogues as part of the development of PARP-1 inhibitors (Banasik et al., 1992[<xref ref-type="bibr" rid="R4">4</xref>]). Moreover, Xu and colleagues (2014[<xref ref-type="bibr" rid="R80">80</xref>]) synthesized a series of compounds which contained tetraaza phenalen-3-one as a main scaffold to inhibit PARP-1 (Xu et al., 2014[<xref ref-type="bibr" rid="R80">80</xref>]). The compounds sensitized tumor cells to ionizing radiation and temozolomide. Ji and colleagues (2015[<xref ref-type="bibr" rid="R29">29</xref>]) used phthalic hydrazide as a pharmaceutical scaffold to synthesize novel PARP-1 inhibitors (Ji et al., 2015[<xref ref-type="bibr" rid="R29">29</xref>]). Another study produced novel PARP-1 inhibitors by fusing a pyrazolo pyridin-2-one to a non-aromatic heterocycle or carbocycle. These resulted in a vast variety of IC<sub>50</sub> values, ranging from 0.002 to &#x3E;10 &#xB5;M (Moree et al., 2008[<xref ref-type="bibr" rid="R48">48</xref>]).</p><p>As well as PubChemFP300, another four fingerprints were within the top ten important features: PubChemFP358 (3<sup>rd</sup> rank), PubChemFP576 (4<sup>th</sup> rank), PubChemFP391 (8<sup>th</sup> rank), and PubChemFP695 (9<sup>th</sup> rank). PubChemFP358 is part of the benzamide scaffold, thus making it critical for PARP-1 inhibitor synthesis because this scaffold mimics the NAD<sup>&#x2B;</sup> substrate. This scaffold has been maintained through all generations of PARP-1 synthesis (Malyuchenko et al., 2015[<xref ref-type="bibr" rid="R41">41</xref>]). As previously mentioned, the crystal structures revealed that NH in the quinazolinone scaffold of FR257517 forms a hydrogen bond with the Gly863-C&#x3D;O that is required for the inhibitor to remain in the NAD<sup>&#x2B;</sup> binding pocket (Kinoshita et al., 2004[<xref ref-type="bibr" rid="R35">35</xref>]). Moreover, PubChemFP358 is part of the pendant fluorobenzyl group that participates in the adenine-ribose binding pocket within the NAD<sup>&#x2B;</sup> binding site (Pescatore et al., 2010[<xref ref-type="bibr" rid="R60">60</xref>]).</p><p>PubChemFP576 is part of the pyridine and pyrimidine moieties. Moree and colleagues (2008[<xref ref-type="bibr" rid="R48">48</xref>]) fused a pyrazolo pyridin-2-one to a non-aromatic heterocycle or carbocycle to generate novel PARP-1 inhibitors (Moree et al., 2008[<xref ref-type="bibr" rid="R48">48</xref>]). The fused structures were designed based on the observation that pyrazolo pyridin-2-one showed a similar binding mode between chicken PARP-1 (PDB: 1PAX) and the Parke-Davis&#x2F;Pfizer inhibitor. Ferraris and colleagues (2003[<xref ref-type="bibr" rid="R18">18</xref>]) synthesized a series of aza-5&#x5B;<italic>H</italic>&#x5D;-phenanthridine-6-inhibitors where nitrogen atoms were introduced to the 5&#x5B;<italic>H</italic>&#x5D;-phenanthridin-6-one core at different positions to compare the potency (Ferraris et al., 2003[<xref ref-type="bibr" rid="R18">18</xref>]). Moreover, this fingerprint was part of the tetraaza phenalen-3-one (Xu et al., 2014[<xref ref-type="bibr" rid="R80">80</xref>]), 4-benzyl-2<italic>H</italic>-phthalazin-1-one (Menear et al., 2008[<xref ref-type="bibr" rid="R45">45</xref>]), and 4-&#x5B;4&#x27;-fluoro-3&#x27;-(piperazine-1&#x27;-carbonyl)benzyl&#x5D;-2H-phthalazin-1-one cores (Zmuda et al., 2015[<xref ref-type="bibr" rid="R87">87</xref>]). Torrisi and colleagues (2010[<xref ref-type="bibr" rid="R73">73</xref>]) demonstrated that introduction of 3-pyridyl to a hexahydrobenzonaphthyridinone pharmacophore resulted in metabolic stability (Torrisi et al., 2010[<xref ref-type="bibr" rid="R73">73</xref>]).</p><p>PubChemFP391 represents the tertiary amines that Ferraris and colleagues (2003[<xref ref-type="bibr" rid="R17">17</xref>]) added to the partially saturated aza-5&#x5B;<italic>H</italic>&#x5D;-phenanthridine-6-ones to increase aqueous solubility (Ferraris et al., 2003[<xref ref-type="bibr" rid="R17">17</xref>]). Moreover, it is part of the optimal nitrogen substituent of the hexahydrobenzophthyridinone pharmacophore to synthesize diverse ranges of PARP-1 inhibitors that was synthesized by Torrisi and colleagues (2010[<xref ref-type="bibr" rid="R73">73</xref>]). Pescatore and colleagues (2010[<xref ref-type="bibr" rid="R60">60</xref>]) synthesized a series of pyrrolo&#x5B;1,2-a&#x5D;pyrazin-1(2<italic>H</italic>)-one to inhibit PARP-1 (Pescatore et al., 2010[<xref ref-type="bibr" rid="R60">60</xref>]). Additionally, the same study revealed that the pyrrolo&#x5B;1,2-a&#x5D;pyrazin-1(2<italic>H</italic>)-one scaffold exhibited good potency and inhibited <italic>BRCA</italic>-deficient tumor cells. Rhee and colleagues (2009[<xref ref-type="bibr" rid="R62">62</xref>]) used isoquinolinone-based tetracycles as the main scaffold to develop PARP-1 inhibitors (Rhee et al., 2009[<xref ref-type="bibr" rid="R62">62</xref>]). Based on this fingerprint, some of the compounds from this study exhibited an IC<sub>50</sub> lower than 1 &#xB5;M. Zhou and colleagues (2017[<xref ref-type="bibr" rid="R85">85</xref>]) made a group of compounds called fused tetra- or penta-cyclic compounds, in which one part of the ring had a tertiary amine as a spacer to link other substituents, that showed diverse ranges of enzymatic activity (Zhou et al., 2017[<xref ref-type="bibr" rid="R85">85</xref>]).</p><p>PubChemFP695 overlapped with both PubChemFP358 and PubChemFP191, which are important for the NAD<sup>&#x2B;</sup> binding pocket. Moreover, PubChemFP695 was part of tricyclic derivative PARP-1 inhibitor synthesis (Myung-Hwa et al., 2014[<xref ref-type="bibr" rid="R49">49</xref>]), and substituents participated in the adenine-ribose (AD) binding site within the NAD<sup>&#x2B;</sup> binding pocket (Scarpelli et al., 2010[<xref ref-type="bibr" rid="R65">65</xref>]). PubChemFP695 is a component of proline derivatives and contributes to lipophilicity, which is necessary for cell permeability (Dunn et al., 2012[<xref ref-type="bibr" rid="R15">15</xref>]). This was confirmed by introducing the polar carboxylic acid moiety to proline derivatives, resulting in less cell-based activity. Moreover, PubChemFP695 also overlapped with PubChemFP391, making this fingerprint part of the AD binding site.</p><p>Collectively, this suggests that nitrogen-containing fingerprints are important in model construction.</p></sec><sec><title>Ether, aldehyde, and alcohol functional groups</title><p>One fingerprint, PubChem695, which contained both aldehyde and amine functional groups, is categorized in this class and has been discussed previously. The remaining fingerprints falling into this class, PubChem680 (15<sup>th</sup> rank) and PubChem594 (17<sup>th</sup> rank), were not ranked in the top ten important features. Based on our curated dataset (n &#x3D; 2018), few compounds contained these fingerprints: PubChem680, n &#x3D; 714 (18<sup>th</sup> rank); and PubChem594, n &#x3D; 468 (18<sup>th</sup> rank). PubChem680 is composed of alkane and alcohol functional groups and participates in the nicotinamide-ribose (NI) and AD binding sites within the NAD<sup>&#x2B;</sup> binding pocket. The study led by Ferraris and colleagues (2003[<xref ref-type="bibr" rid="R18">18</xref>]) replaced the C&#x3D;O of the amide group from the benzamide scaffold with C-OH, which resulted in IC<sub>50</sub> values ranging from 14-0.042 &#xB5;M (Ferraris et al., 2003[<xref ref-type="bibr" rid="R18">18</xref>]). This suggests that OH could be able to maintain a hydrogen bond within the NAD<sup>&#x2B;</sup> binding pocket. Additionally, this fingerprint served as an o-linked spacer between two distinct pharmacophores, one of which was responsible for the NI binding site and the other for the AD binding site, as demonstrated by Park and colleagues (2010[<xref ref-type="bibr" rid="R56">56</xref>]) via the synthesis of a series of 1,2-dihydro-4H-thiopyrano&#x5B;3,4-c&#x5D;quinolin-5(6H)-one derivatives (Park et al., 2010[<xref ref-type="bibr" rid="R56">56</xref>]). As part of the AD binding site, this fingerprint also overlapped with PubChemFP695, which contributes to aqueous solubility and cellular permeability, as previously mentioned.</p><p>PubChemFP594 is part of the pyran and was found to play roles in both the NI and AD binding sites within the NAD<sup>&#x2B;</sup> binding pocket. Several studies have used pyran as part of the scaffold. For example, introducing a dihydropyran to the A-ring caused the derivatives to be more polar but less potent toward PARP-1 inhibition (Shultz et al., 2013[<xref ref-type="bibr" rid="R69">69</xref>]). Xu and colleagues (2014[<xref ref-type="bibr" rid="R81">81</xref>]) filed the patent on the synthesis of diazabenzo&#x5B;de&#x5D;anthracen-3-one derivatives that contain pyran as part of the tri-cyclic ring (Xu et al., 2014[<xref ref-type="bibr" rid="R81">81</xref>]). All the compounds reported in this study were categorized as active compounds. Conversely, the patent filed by Cheung and colleagues (2015[<xref ref-type="bibr" rid="R11">11</xref>]) revealed mostly inactive compounds against PARP-1 (Cheung et al., 2015[<xref ref-type="bibr" rid="R11">11</xref>]). For the AD binding site, this fingerprint participated in phenyl derivative substituents, as demonstrated by Orvieto and colleagues (2009[<xref ref-type="bibr" rid="R55">55</xref>]) when they introduced methyl groups to the aromatic ether (Orvieto et al., 2009[<xref ref-type="bibr" rid="R55">55</xref>]). They found that this improved the inhibitory effect compared with its parental phenyl. As previously mentioned, PubChemFP594 also functionally overlapped with PubChem680, as part of the o-linked spacer between two binding modes of pharmacophores.</p></sec><sec><title>Structural interpretation</title><p>PARP-1 has three important domains: 1) the DNA binding domain, 2) the catalytic domain, and 3) the nuclear acceptor protein (Ferraris, 2010[<xref ref-type="bibr" rid="R19">19</xref>]). The catalytic domain is subdivided into: 1) the helical domain (HD), and 2) the ADP-ribosyl transferase (ART) domain, as illustrated in Figure 8<xref ref-type="fig" rid="F8">(Fig. 8)</xref> (Patel et al., 2012[<xref ref-type="bibr" rid="R58">58</xref>]). Most of the compounds were synthesized to inhibit the catalytic domain that consists of three subsites: 1) the nicotinamide-ribose binding site (NI), 2) the phosphate binding site (PH), and 3) the adenine-ribose binding site (AD), and the inhibitors were designed to mimic the nicotinamide scaffold of NAD<sup>&#x2B;</sup> (Kinoshita et al., 2004[<xref ref-type="bibr" rid="R35">35</xref>]). Thus, all generations of PARP-1 inhibitors have maintained the basal chemical interaction network between the inhibitors and the key amino acids within the NI binding site (Malyuchenko et al., 2015[<xref ref-type="bibr" rid="R41">41</xref>]). These key amino acids include Gly863 (nitrogen of the &#x3B1;-amine) and Ser904 (oxygen of the R-group) forming hydrogen bonds with either C&#x3D;O or C-OH of inhibitors. The oxygen of the carboxyl group of Gly863 forms a hydrogen bond with either the nitrogen-containing ring of inhibitors or the NH group of the nicotinamide scaffold, whereas the hydrogen of the amino group of Gly863 forms a hydrogen bond with either the C&#x3D;O or C-OH of the inhibitors, as illustrated in Figure 8<xref ref-type="fig" rid="F8">(Fig. 8)</xref>. Additionally, &#x3C0;-&#x3C0; and hydrophobic interactions between the side chain of Tyr896 and Tyr907 in PARP-1 and either the cyclic or aromatic ring of inhibitors contribute to the NI binding site, as shown in Figure 8<xref ref-type="fig" rid="F8">(Fig. 8)</xref>. These interactions were shown by the co-crystallization of chicken PARP-1, which is highly conserved with human PARP-1 (sequence identity and similarity, 79 &#x25; and 89 &#x25;, respectively), with three different inhibitors: 6-amino-benzo&#x5B;de&#x5D;isoquinoline-1,3-dione (4ANI), 3-methoxybenzamide (3MBA), and 8-hydroxy-2-methyl-3-hydro-quinazolin-4-one (NU1025) (Kinoshita et al., 2004[<xref ref-type="bibr" rid="R35">35</xref>]; Ruf et al., 1998[<xref ref-type="bibr" rid="R63">63</xref>]). The importance of the chemical interaction network has been confirmed through site-directed mutagenesis on human PARP-1. Ruf and colleagues (1998[<xref ref-type="bibr" rid="R63">63</xref>]) demonstrated that G863A, Y896N, and Y907N reduced PARP-1 activity to 70 &#x25;, 15 &#x25; and 1.1 &#x25;, respectively, compared with wildtype (Ruf et al., 1998[<xref ref-type="bibr" rid="R63">63</xref>]).</p><p>To improve the potency of PARP-1 inhibitors, because the NI binding site is found in other NAD<sup>&#x2B;</sup> binding proteins, the development of PARP-1 inhibitors was extended to use the AD binding site to increase the selectivity of PARP-1 inhibition. In particular, this helps to differentiate between PARP-1 and PARP-2, which share very high similarity at the active site, and double knockout of PARP-1 and PARP-2 is lethal during embryogenesis (M&#xE9;nissier de Murcia et al., 2003[<xref ref-type="bibr" rid="R46">46</xref>]). PARP-2 knockout in mice also demonstrated a role in maintaining the genetic integrity of hematopoietic stem&#x2F;progenitor cells (Farr&#xE9;s et al., 2013[<xref ref-type="bibr" rid="R16">16</xref>]). Cross-reactivity of inhibitors with PARP-2 could therefore have significant side-effects.</p><p>The amino acids making up the AD binding site include Glu763, Asp766, Asn767, Leu769, Asp770, His862, Ser864, Asn868, Ile872, Gly876, Ile877, Arg878, and Ala880, as defined by several co-crystal structures (Kinoshita et al., 2004[<xref ref-type="bibr" rid="R35">35</xref>]; Patel et al., 2012[<xref ref-type="bibr" rid="R58">58</xref>], 2014[<xref ref-type="bibr" rid="R57">57</xref>]). Glu763, Asp766, Asn767, and Asp770 are part of the helical domain which uncoils upon DNA-binding activation, thus enabling inhibitors to insert into the catalytic pocket (van Beek et al., 2021[<xref ref-type="bibr" rid="R75">75</xref>]). Ishida and colleagues (2006[<xref ref-type="bibr" rid="R28">28</xref>]) used structure-based drug design to understand the different interactions of inhibitors between PARP-1 and PARP-2 (Ishida et al., 2006[<xref ref-type="bibr" rid="R28">28</xref>]). They discovered that two chemical frameworks, quinazolinone and quinoxaline derivatives, fit the AD binding site differently and inhibit PARP-1 and PARP-2, respectively. Zhao and colleagues (2017[<xref ref-type="bibr" rid="R84">84</xref>]) modified the spacer and the <italic>N</italic>-Boc-pyrrolidin-3-yl subunit of a quinazoline-2,4(1<italic>H</italic>,3<italic>H</italic>)-dione derivative to adjust the interaction within both the spacer and the AD binding site (Zhao et al., 2017[<xref ref-type="bibr" rid="R84">84</xref>]). Moreover, Zhou and colleagues (2021[<xref ref-type="bibr" rid="R86">86</xref>]) exploited the unique AD binding site between PARP-1 and PARP-2 to generate a series of quinazoline-2,4(1<italic>H</italic>,3<italic>H</italic>)-dione derivatives with a variety of substituted cyclic amines (Zhou et al., 2021[<xref ref-type="bibr" rid="R86">86</xref>]). They reported that compound 24, which had an (<italic>R</italic>)-3-ethyl piperazine ring, showed high enzymatic potency and selectivity toward PARP-1. This compound also demonstrated an acceptable pharmacokinetic profile and reduced tumor growth in xenograft and orthotopic models of breast cancer and glioblastoma, respectively. Co-crystallization of PARP-1 with compounds 4 (PDB ligand ID 6WZ) and 6 (PDB ligand ID 6X2) demonstrated a favorable hydrophobic interaction of either the methyl or ethyl substituent on the piperazine ring with the key amino acids His862 and Leu877. Additionally, the substituents on the piperazine nitrogen projected onto a key subpocket consisting of Asp766, Leu769, and Asp770 in PARP-1. Leu769 is replaced by Gly338 in PARP-2, and so this was used as rational for PARP-1 selectivity. Johannes and colleagues (2021[<xref ref-type="bibr" rid="R30">30</xref>]) attached various aryl piperazines to an 8-chloroquinazolinone core and found that the interactions between 1) the piperazine moiety and His862 through water molecules and 2) the imidazole moiety and Asp770 via a hydrogen bond resulted in selectivity toward PARP-1 (Johannes et al., 2021[<xref ref-type="bibr" rid="R30">30</xref>]). Yu and colleagues (2022[<xref ref-type="bibr" rid="R83">83</xref>]) used the key amino acid differences between PARP-1 (Gln759, Glu763, and Asp766) and PARP-2 (Gln324, Ser328, and Gln332) and further modified rucaparib to obtain increased selectivity of PARP-1 inhibitors (Yu et al., 2022[<xref ref-type="bibr" rid="R83">83</xref>]). They discovered that Y49 showed excellent selectivity (IC<sub>50</sub> of PARP-1 and PARP-2, 0.96 nM and 61.90 nM, respectively). Molecular docking demonstrated hydrogen bond formation between the amino group of 4-aminopiperidine-1-yl with Glu763 and Asp766 in PARP-1, whereas 4-aminopiperidine-1-yl caused steric hindrance in PARP-2. Thus, they suggested that nitrogen-containing basic substituents were required to fit into the hydrophilic pocket formed by acidic amino acids around the AD site.</p></sec><sec><title>Model deployment as web server</title><p>To facilitate accessibility for non-chemoinformatic scientists who intend to determine whether their compounds have PARP-1 inhibitory activity, a public web server was created. Thus, the predictive model, PARP1pred, is available at <ext-link ext-link-type="uri" xlink:href="https:&#47;&#47;parp1pred.streamlitapp.com">https:&#47;&#47;parp1pred.streamlitapp.com</ext-link>.</p><p>Briefly, the PARP1pred web server uses SMILES as the input for the query compound. PadelPy is used to convert SMILES to PubChem fingerprints, which are then used as an input to trained classification models whose outputs are reported as active or inactive (Figure 9<xref ref-type="fig" rid="F9">(Fig. 9)</xref>).</p></sec></sec>
    <sec sec-type="conclusions">
      <title>Conclusion</title><p>In the era of precision medicine, targeting of DNA repair is effective in killing cancer cells. PARP-1 plays a role in DNA damage and repair, and is a well-known target for cancers with <italic>BRCA1&#x2F;2</italic> mutations. Several drugs targeting PARP-1 have been FDA approved; however, accessing such targeted drugs is problematic because of their high cost, particularly in middle- and low-income countries. Thus, advancements in drug development would contribute to the alleviation of such access constraints. In this study, computer-aided drug design was used to understand the relationship between the chemical structures of inhibitors and PARP-1 through the QSAR building model. Understanding such relationships will facilitate rational drug design to effectively target PARP-1. Our study retrieved a set of biological activities from the ChEMBL database that contained 2018 non-redundant compounds. A PubChem fingerprint-based random forest classification model from an oversampling approach was built to predict PARP-1 activity. Gini index calculation revealed the important features in the random forest model, which included aromatic&#x2F;cyclic&#x2F;heterocyclic moieties and nitrogen-containing fingerprints, and ether&#x2F;aldehyde&#x2F;alcohol moieties. Additionally, a detailed examination of the structure-activity relationship revealed that hydrophobic interactions and hydrogen bonding networks with nitrogen-containing scaffolds are critical for developing PARP-1 inhibitors. As a result, this insight provides a framework for data-driven PARP-1 inhibitor design.</p></sec>
    <sec>
      <title>Notes</title><p>Tassanee Lerksuthirat, Aijaz Ahmad Malik (Center of Excellence in Computational Molecular Biology, Faculty of Medicine, Chulalongkorn University, Bangkok 10330, Thailand; E-mail: ajaz&#x5F;me&#x40;hotmail.com) and Chanin Nantasenamat (Streamlit Open Source, Snowflake Inc., USA; E-mail: hellodataprofessor&#x40;gmail.com) contributed equally as corresponding author.</p></sec>
    <sec>
      <title>Declaration</title><sec><title>Conflict of interests</title><p>All authors declare that there are no conflicts of interest.</p></sec><sec><title>Acknowledgments</title><p>We thank Dr. Patipark Kueanjinda for useful discussion on machine learning. We thank Catherine Perfect, MA (Cantab), from Edanz (www.edanz.com&#x2F;ac), for editing a draft of this manuscript. This project is funded by the National Research Council of Thailand (NRCT) and Mahidol University (NRCT5-TRG63009-04).</p></sec></sec>
  </body>
  <back>
    <ref-list>
      <ref id="R1">
        <label>1</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Abbasi-Radmoghaddam</surname>
              <given-names>Z</given-names>
            </name>
            <name>
              <surname>Riahi</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Gharaghani</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Mohammadi-Khanaposhtanai</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <article-title>Design of potential anti-tumor PARP-1 inhibitors by QSAR and molecular modeling studies</article-title>
          <source>Mol Diversity</source>
          <year>2021</year>
          <volume>25</volume>
          <fpage>263</fpage>
          <lpage>277</lpage>
        </citation>
      </ref>
      <ref id="R2">
        <label>2</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Armstrong</surname>
              <given-names>JF</given-names>
            </name>
            <name>
              <surname>Faccenda</surname>
              <given-names>E</given-names>
            </name>
            <name>
              <surname>Harding</surname>
              <given-names>SD</given-names>
            </name>
            <name>
              <surname>Pawson</surname>
              <given-names>AJ</given-names>
            </name>
            <name>
              <surname>Southan</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Sharman</surname>
              <given-names>JL</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>The IUPHAR&#x2F;BPS Guide to PHARMACOLOGY in 2020: extending immunopharmacology content and introducing the IUPHAR&#x2F;MMV Guide to MALARIA PHARMACOLOGY</article-title>
          <source>Nucleic Acids Res</source>
          <year>2020</year>
          <volume>48</volume>
          <issue>D1</issue>
          <fpage>D1006</fpage>
          <lpage>D1021</lpage>
        </citation>
      </ref>
      <ref id="R3">
        <label>3</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Balasubramaniam</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Beaver</surname>
              <given-names>JA</given-names>
            </name>
            <name>
              <surname>Horton</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Fernandes</surname>
              <given-names>LL</given-names>
            </name>
            <name>
              <surname>Tang</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Horne</surname>
              <given-names>HN</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>fda approval summary: Rucaparib for the treatment of patients with deleterious BRCA mutation-associated advanced ovarian cancer</article-title>
          <source>Clin Cancer Res</source>
          <year>2017</year>
          <volume>23</volume>
          <fpage>7165</fpage>
          <lpage>7170</lpage>
        </citation>
      </ref>
      <ref id="R4">
        <label>4</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Banasik</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Komura</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Shimoyama</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Ueda</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <article-title>Specific inhibitors of poly(ADP-ribose) synthetase and mono(ADP-ribosyl)transferase</article-title>
          <source>J Biol Chem</source>
          <year>1992</year>
          <volume>267</volume>
          <fpage>1569</fpage>
          <lpage>1575</lpage>
        </citation>
      </ref>
      <ref id="R5">
        <label>5</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Baudino</surname>
              <given-names>TA</given-names>
            </name>
          </person-group>
          <article-title>Targeted cancer therapy: the next generation of cancer treatment</article-title>
          <source>Curr Drug Discov Technol</source>
          <year>2015</year>
          <volume>12</volume>
          <fpage>3</fpage>
          <lpage>20</lpage>
        </citation>
      </ref>
      <ref id="R6">
        <label>6</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Beck</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Robert</surname>
              <given-names>I</given-names>
            </name>
            <name>
              <surname>Reina-San-Martin</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Schreiber</surname>
              <given-names>V</given-names>
            </name>
            <name>
              <surname>Dantzer</surname>
              <given-names>F</given-names>
            </name>
          </person-group>
          <article-title>Poly(ADP-ribose) polymerases in double-strand break repair: focus on PARP1, PARP2 and PARP3</article-title>
          <source>Exp Cell Res</source>
          <year>2014</year>
          <volume>329</volume>
          <fpage>18</fpage>
          <lpage>25</lpage>
        </citation>
      </ref>
      <ref id="R7">
        <label>7</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Breiman</surname>
              <given-names>L</given-names>
            </name>
          </person-group>
          <article-title>Random forests</article-title>
          <source>Machine Learning</source>
          <year>2001</year>
          <volume>45</volume>
          <issue>1</issue>
          <fpage>5</fpage>
          <lpage>32</lpage>
        </citation>
      </ref>
      <ref id="R8">
        <label>8</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Brown</surname>
              <given-names>JS</given-names>
            </name>
            <name>
              <surname>O&#x27;Carrigan</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Jackson</surname>
              <given-names>SP</given-names>
            </name>
            <name>
              <surname>Yap</surname>
              <given-names>TA</given-names>
            </name>
          </person-group>
          <article-title>Targeting DNA repair in cancer: beyond PARP inhibitors</article-title>
          <source>Cancer Discov</source>
          <year>2017</year>
          <volume>7</volume>
          <fpage>20</fpage>
          <lpage>37</lpage>
        </citation>
      </ref>
      <ref id="R9">
        <label>9</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Calle</surname>
              <given-names>ML</given-names>
            </name>
            <name>
              <surname>Urrea</surname>
              <given-names>V</given-names>
            </name>
          </person-group>
          <article-title>Letter to the Editor: Stability of Random Forest importance measures</article-title>
          <source>Brief Bioinform</source>
          <year>2010</year>
          <volume>12</volume>
          <issue>1</issue>
          <fpage>86</fpage>
          <lpage>89</lpage>
        </citation>
      </ref>
      <ref id="R10">
        <label>10</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Carracedo-Reboredo</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Li&#xF1;ares-Blanco</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Rodr&#xED;guez-Fern&#xE1;ndez</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Cedr&#xF3;n</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Novoa</surname>
              <given-names>FJ</given-names>
            </name>
            <name>
              <surname>Carballal</surname>
              <given-names>A</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>A review on machine learning approaches and trends in drug discovery</article-title>
          <source>Comput Struct Biotechnol J</source>
          <year>2021</year>
          <volume>19</volume>
          <fpage>4538</fpage>
          <lpage>4558</lpage>
        </citation>
      </ref>
      <ref id="R11">
        <label>11</label>
        <citation citation-type="other">
          <collab>Cheung AK, Chin DN, Fan J, Miller-Moslin KM, Shultz MD, Smith TD,., inventors</collab>
          <article-title>2-piperidin-1-yl-acetamide compounds for use as tankyrase inhibitors. US patent, US9181266B2</article-title>
          <day>10</day>
          <month>11</month>
          <year>2015</year>
        </citation>
      </ref>
      <ref id="R12">
        <label>12</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Cortes-Ciriano</surname>
              <given-names>I</given-names>
            </name>
            <name>
              <surname>Bender</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Malliavin</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>Prediction of PARP inhibition with proteochemometric modelling and conformal prediction</article-title>
          <source>Mol Inform</source>
          <year>2015</year>
          <volume>34</volume>
          <fpage>357</fpage>
          <lpage>366</lpage>
        </citation>
      </ref>
      <ref id="R13">
        <label>13</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Davies</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Nowotka</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Papadatos</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Dedman</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Gaulton</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Atkinson</surname>
              <given-names>F</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>ChEMBL web services: streamlining access to drug discovery data and utilities</article-title>
          <source>Nucleic Acids Res</source>
          <year>2015</year>
          <volume>43</volume>
          <issue>W1</issue>
          <fpage>W612</fpage>
          <lpage>W620</lpage>
        </citation>
      </ref>
      <ref id="R14">
        <label>14</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>de Bono</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Mateo</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Fizazi</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>Saad</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Shore</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Sandhu</surname>
              <given-names>S</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Olaparib for metastatic castration-resistant prostate cancer</article-title>
          <source>N Engl J Med</source>
          <year>2020</year>
          <volume>382</volume>
          <fpage>2091</fpage>
          <lpage>2102</lpage>
        </citation>
      </ref>
      <ref id="R15">
        <label>15</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Dunn</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Husten</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Ator</surname>
              <given-names>MA</given-names>
            </name>
            <name>
              <surname>Chatterjee</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>Novel poly(ADP-ribose) polymerase-1 inhibitors</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2012</year>
          <volume>22</volume>
          <fpage>222</fpage>
          <lpage>224</lpage>
        </citation>
      </ref>
      <ref id="R16">
        <label>16</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Farr&#xE9;s</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Mart&#xED;n-Caballero</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Mart&#xED;nez</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Lozano</surname>
              <given-names>JJ</given-names>
            </name>
            <name>
              <surname>Llacuna</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Ampurdan&#xE9;s</surname>
              <given-names>C</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Parp-2 is required to maintain hematopoiesis following sublethal &#x3B3;-irradiation in mice</article-title>
          <source>Blood</source>
          <year>2013</year>
          <volume>122</volume>
          <fpage>44</fpage>
          <lpage>54</lpage>
        </citation>
      </ref>
      <ref id="R17">
        <label>17</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ferraris</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Ficco</surname>
              <given-names>RP</given-names>
            </name>
            <name>
              <surname>Pahutski</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Lautar</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>J</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Design and synthesis of poly(ADP-ribose)polymerase-1 (PARP-1) inhibitors. Part 3: In vitro evaluation of 1,3,4,5-Tetrahydro-benzo&#x5B;c&#x5D;&#x5B;1,6&#x5D;- and &#x5B;c&#x5D;&#x5B;1,7&#x5D;-naphthyridin-6-ones</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2003</year>
          <volume>13</volume>
          <fpage>2513</fpage>
          <lpage>2518</lpage>
        </citation>
      </ref>
      <ref id="R18">
        <label>18</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ferraris</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Ko</surname>
              <given-names>Y-S</given-names>
            </name>
            <name>
              <surname>Pahutski</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Ficco</surname>
              <given-names>RP</given-names>
            </name>
            <name>
              <surname>Serdyuk</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Alemu</surname>
              <given-names>C</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Design and synthesis of poly ADP-ribose polymerase-1 inhibitors. 2. Biological evaluation of Aza-5&#x5B;H&#x5D;-phenanthridin-6-ones as potent, aqueous-soluble compounds for the treatment of ischemic injuries</article-title>
          <source>J Med Chem</source>
          <year>2003</year>
          <volume>46</volume>
          <fpage>3138</fpage>
          <lpage>3151</lpage>
        </citation>
      </ref>
      <ref id="R19">
        <label>19</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ferraris</surname>
              <given-names>DV</given-names>
            </name>
          </person-group>
          <article-title>Evolution of poly(ADP-ribose) polymerase-1 (PARP-1) inhibitors. From concept to clinic</article-title>
          <source>J Med Chem</source>
          <year>2010</year>
          <volume>53</volume>
          <fpage>4561</fpage>
          <lpage>4584</lpage>
        </citation>
      </ref>
      <ref id="R20">
        <label>20</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Fong</surname>
              <given-names>PC</given-names>
            </name>
            <name>
              <surname>Boss</surname>
              <given-names>DS</given-names>
            </name>
            <name>
              <surname>Yap</surname>
              <given-names>TA</given-names>
            </name>
            <name>
              <surname>Tutt</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Wu</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Mergui-Roelvink</surname>
              <given-names>M</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Inhibition of poly(ADP-ribose) polymerase in tumors from BRCA mutation carriers</article-title>
          <source>N Engl J Med</source>
          <year>2009</year>
          <volume>361</volume>
          <fpage>123</fpage>
          <lpage>134</lpage>
        </citation>
      </ref>
      <ref id="R21">
        <label>21</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Fundytus</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Sengar</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Lombe</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Hopman</surname>
              <given-names>W</given-names>
            </name>
            <name>
              <surname>Jalink</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Gyawali</surname>
              <given-names>B</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Access to cancer medicines deemed essential by oncologists in 82 countries: an international, cross-sectional survey</article-title>
          <source>Lancet Oncol</source>
          <year>2021</year>
          <volume>22</volume>
          <fpage>1367</fpage>
          <lpage>1377</lpage>
        </citation>
      </ref>
      <ref id="R22">
        <label>22</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Gilson</surname>
              <given-names>MK</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Baitaluk</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Nicola</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Hwang</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Chong</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <article-title>BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology</article-title>
          <source>Nucleic Acids Res</source>
          <year>2016</year>
          <volume>44</volume>
          <fpage>D1045</fpage>
          <lpage>D1 53</lpage>
        </citation>
      </ref>
      <ref id="R23">
        <label>23</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Golan</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Hammel</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Reni</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Van Cutsem</surname>
              <given-names>E</given-names>
            </name>
            <name>
              <surname>Macarulla</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Hall</surname>
              <given-names>MJ</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Maintenance olaparib for germline BRCA-mutated metastatic pancreatic cancer</article-title>
          <source>N Engl J Med</source>
          <year>2019</year>
          <fpage>381317</fpage>
          <lpage>381327</lpage>
        </citation>
      </ref>
      <ref id="R24">
        <label>24</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Gupte</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>Z</given-names>
            </name>
            <name>
              <surname>Kraus</surname>
              <given-names>WL</given-names>
            </name>
          </person-group>
          <article-title>PARPs and ADP-ribosylation: recent advances linking molecular functions to biological outcomes</article-title>
          <source>Genes Dev</source>
          <year>2017</year>
          <volume>31</volume>
          <fpage>101</fpage>
          <lpage>126</lpage>
        </citation>
      </ref>
      <ref id="R25">
        <label>25</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Halder</surname>
              <given-names>AK</given-names>
            </name>
            <name>
              <surname>Saha</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Saha</surname>
              <given-names>KD</given-names>
            </name>
            <name>
              <surname>Jha</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>Stepwise development of structure-activity relationship of diverse PARP-1 inhibitors through comparative and validated in silico modeling techniques and molecular dynamics simulation</article-title>
          <source>J Biomol Struct Dyn</source>
          <year>2015</year>
          <volume>33</volume>
          <fpage>1756</fpage>
          <lpage>1779</lpage>
        </citation>
      </ref>
      <ref id="R26">
        <label>26</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Helleday</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Petermann</surname>
              <given-names>E</given-names>
            </name>
            <name>
              <surname>Lundin</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Hodgson</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Sharma</surname>
              <given-names>RA</given-names>
            </name>
          </person-group>
          <article-title>DNA repair pathways as targets for cancer therapy</article-title>
          <source>Nat Rev Cancer</source>
          <year>2008</year>
          <volume>8</volume>
          <fpage>193</fpage>
          <lpage>204</lpage>
        </citation>
      </ref>
      <ref id="R27">
        <label>27</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Hoy</surname>
              <given-names>SM</given-names>
            </name>
          </person-group>
          <article-title>Talazoparib: first global approval</article-title>
          <source>Drugs</source>
          <year>2018</year>
          <volume>78</volume>
          <fpage>1939</fpage>
          <lpage>1946</lpage>
        </citation>
      </ref>
      <ref id="R28">
        <label>28</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ishida</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Yamamoto</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Kido</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Kamijo</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>Murano</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>Miyake</surname>
              <given-names>H</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Discovery of potent and selective PARP-1 and PARP-2 inhibitors: SBDD analysis via a combination of X-ray structural study and homology modeling</article-title>
          <source>Bioorg Med Chem</source>
          <year>2006</year>
          <volume>14</volume>
          <fpage>1378</fpage>
          <lpage>1390</lpage>
        </citation>
      </ref>
      <ref id="R29">
        <label>29</label>
        <citation citation-type="other">
          <collab>Ji J, Guo N, Xue T, Kang B, Ye X, Chen X,., inventors</collab>
          <article-title>Poly (ADP-ribose) polymerase inhibitor. US patent, US9187430B2</article-title>
          <day>17</day>
          <month>11</month>
          <year>2015</year>
        </citation>
      </ref>
      <ref id="R30">
        <label>30</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Johannes</surname>
              <given-names>JW</given-names>
            </name>
            <name>
              <surname>Balazs</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Barratt</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Bista</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Chuba</surname>
              <given-names>MD</given-names>
            </name>
            <name>
              <surname>Cosulich</surname>
              <given-names>S</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Discovery of 5-&#x7B;4-&#x5B;(7-Ethyl-6-oxo-5,6-dihydro-1,5-naphthyridin-3-yl)methyl&#x5D;piperazin-1-yl&#x7D;-N-methylpyridine-2-carboxamide (AZD5305): A PARP1&#x2013;DNA trapper with high selectivity for PARP1 over PARP2 and other PARPs</article-title>
          <source>J Med Chem</source>
          <year>2021</year>
          <volume>64</volume>
          <fpage>14498</fpage>
          <lpage>14512</lpage>
        </citation>
      </ref>
      <ref id="R31">
        <label>31</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Kanan</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Kanan</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Al Shardoub</surname>
              <given-names>EJ</given-names>
            </name>
            <name>
              <surname>Durdagi</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>Transcription factor NF-&#x3BA;B as target for SARS-CoV-2 drug discovery efforts using inflammation-based QSAR screening model</article-title>
          <source>J Mol Graph Model</source>
          <year>2021</year>
          <volume>108</volume>
          <fpage>107968</fpage>
        </citation>
      </ref>
      <ref id="R32">
        <label>32</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Kennard</surname>
              <given-names>RW</given-names>
            </name>
            <name>
              <surname>Stone</surname>
              <given-names>LA</given-names>
            </name>
          </person-group>
          <article-title>Computer aided design of experiments</article-title>
          <source>Technometrics</source>
          <year>1969</year>
          <volume>11</volume>
          <fpage>137</fpage>
          <lpage>148</lpage>
        </citation>
      </ref>
      <ref id="R33">
        <label>33</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Kim</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Ison</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>McKee</surname>
              <given-names>AE</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Tang</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Gwise</surname>
              <given-names>T</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>FDA approval summary: Olaparib monotherapy in patients with deleterious germline BRCA-mutated advanced ovarian cancer treated with three or more lines of chemotherapy</article-title>
          <source>Clin Cancer Res</source>
          <year>2015</year>
          <volume>21</volume>
          <fpage>4257</fpage>
          <lpage>4261</lpage>
        </citation>
      </ref>
      <ref id="R34">
        <label>34</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Kim</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Thiessen</surname>
              <given-names>PA</given-names>
            </name>
            <name>
              <surname>Bolton</surname>
              <given-names>EE</given-names>
            </name>
            <name>
              <surname>Chen</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Fu</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Gindulyte</surname>
              <given-names>A</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>PubChem substance and compound databases</article-title>
          <source>Nucleic Acids Res</source>
          <year>2016</year>
          <volume>44</volume>
          <fpage>D1202</fpage>
          <lpage>D1213</lpage>
        </citation>
      </ref>
      <ref id="R35">
        <label>35</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Kinoshita</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Nakanishi</surname>
              <given-names>I</given-names>
            </name>
            <name>
              <surname>Warizaya</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Iwashita</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Kido</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Hattori</surname>
              <given-names>K</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Inhibitor-induced structural change of the active site of human poly(ADP-ribose) polymerase</article-title>
          <source>FEBS Letters</source>
          <year>2004</year>
          <volume>556</volume>
          <fpage>43</fpage>
          <lpage>46</lpage>
        </citation>
      </ref>
      <ref id="R36">
        <label>36</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ledermann</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Harter</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Gourley</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Friedlander</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Vergote</surname>
              <given-names>I</given-names>
            </name>
            <name>
              <surname>Rustin</surname>
              <given-names>G</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Olaparib maintenance therapy in platinum-sensitive relapsed ovarian Cancer</article-title>
          <source>N Engl J Med</source>
          <year>2012</year>
          <volume>366</volume>
          <fpage>1382</fpage>
          <lpage>1392</lpage>
        </citation>
      </ref>
      <ref id="R37">
        <label>37</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Li</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Zhou</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Cai</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Bao</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <article-title>In silico screening identifies a novel potential PARP1 inhibitor targeting synthetic lethality in cancer treatment</article-title>
          <source>Int J Mol Sci</source>
          <year>2016</year>
          <volume>17</volume>
          <issue>2</issue>
          <fpage>258</fpage>
        </citation>
      </ref>
      <ref id="R38">
        <label>38</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Li</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Bu</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Zhou</surname>
              <given-names>Q</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>L</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>An open-label, multicenter, single-arm, phase ii study of fluzoparib in patients with germline BRCA1&#x2F;2 mutation and platinum-sensitive recurrent ovarian cancer</article-title>
          <source>Clin Cancer Res</source>
          <year>2021</year>
          <volume>27</volume>
          <fpage>2452</fpage>
          <lpage>2458</lpage>
        </citation>
      </ref>
      <ref id="R39">
        <label>39</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Lipinski</surname>
              <given-names>CA</given-names>
            </name>
            <name>
              <surname>Lombardo</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Dominy</surname>
              <given-names>BW</given-names>
            </name>
            <name>
              <surname>Feeney</surname>
              <given-names>PJ</given-names>
            </name>
          </person-group>
          <article-title>Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings</article-title>
          <source>Adv Drug Deliv Rev</source>
          <year>2001</year>
          <volume>46</volume>
          <fpage>3</fpage>
          <lpage>26</lpage>
        </citation>
      </ref>
      <ref id="R40">
        <label>40</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Malik</surname>
              <given-names>AA</given-names>
            </name>
            <name>
              <surname>Phanus-Umporn</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Schaduangrat</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Shoombuatong</surname>
              <given-names>W</given-names>
            </name>
            <name>
              <surname>Isarankura-Na-Ayudhya</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Nantasenamat</surname>
              <given-names>C</given-names>
            </name>
          </person-group>
          <article-title>HCVpred: A web server for predicting the bioactivity of hepatitis C virus NS5B inhibitors</article-title>
          <source>J Comput Chem</source>
          <year>2020</year>
          <volume>41</volume>
          <fpage>1820</fpage>
          <lpage>1834</lpage>
        </citation>
      </ref>
      <ref id="R41">
        <label>41</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Malyuchenko</surname>
              <given-names>NV</given-names>
            </name>
            <name>
              <surname>Kotova</surname>
              <given-names>EY</given-names>
            </name>
            <name>
              <surname>Kulaeva</surname>
              <given-names>OI</given-names>
            </name>
            <name>
              <surname>Kirpichnikov</surname>
              <given-names>MP</given-names>
            </name>
            <name>
              <surname>Studitskiy</surname>
              <given-names>VM</given-names>
            </name>
          </person-group>
          <article-title>PARP1 inhibitors: antitumor drug design</article-title>
          <source>Acta Naturae</source>
          <year>2015</year>
          <volume>7</volume>
          <issue>3</issue>
          <fpage>27</fpage>
          <lpage>37</lpage>
        </citation>
      </ref>
      <ref id="R42">
        <label>42</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Mateo</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Carreira</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Sandhu</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Miranda</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Mossop</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Perez-Lopez</surname>
              <given-names>R</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>DNA-repair defects and olaparib in metastatic prostate cancer</article-title>
          <source>N Engl J Med</source>
          <year>2015</year>
          <volume>373</volume>
          <fpage>1697</fpage>
          <lpage>1708</lpage>
        </citation>
      </ref>
      <ref id="R43">
        <label>43</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Mateo</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Lord</surname>
              <given-names>CJ</given-names>
            </name>
            <name>
              <surname>Serra</surname>
              <given-names>V</given-names>
            </name>
            <name>
              <surname>Tutt</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Balmana</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Castroviejo-Bermejo</surname>
              <given-names>M</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>A decade of clinical development of PARP inhibitors in perspective</article-title>
          <source>Ann Oncol</source>
          <year>2019</year>
          <volume>30</volume>
          <fpage>1437</fpage>
          <lpage>1447</lpage>
        </citation>
      </ref>
      <ref id="R44">
        <label>44</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Mendez</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Gaulton</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Bento</surname>
              <given-names>AP</given-names>
            </name>
            <name>
              <surname>Chambers</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>De Veij</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Felix</surname>
              <given-names>E</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>ChEMBL: towards direct deposition of bioassay data</article-title>
          <source>Nucleic Acids Res</source>
          <year>2019</year>
          <volume>47</volume>
          <issue>D1</issue>
          <fpage>D930</fpage>
          <lpage>D940</lpage>
        </citation>
      </ref>
      <ref id="R45">
        <label>45</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Menear</surname>
              <given-names>KA</given-names>
            </name>
            <name>
              <surname>Adcock</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Boulter</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Cockcroft</surname>
              <given-names>X-l</given-names>
            </name>
            <name>
              <surname>Copsey</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Cranston</surname>
              <given-names>A</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>4-&#x5B;3-(4-Cyclopropanecarbonylpiperazine-1-carbonyl)-4-fluorobenzyl&#x5D;-2H-phthalazin-1-one: A novel bioavailable inhibitor of poly(adp-ribose) polymerase-1</article-title>
          <source>J Med Chem</source>
          <year>2008</year>
          <volume>51</volume>
          <fpage>6581</fpage>
          <lpage>6591</lpage>
        </citation>
      </ref>
      <ref id="R46">
        <label>46</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>M&#xE9;nissier de Murcia</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Ricoul</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Tartier</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Niedergang</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Huber</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Dantzer</surname>
              <given-names>F</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Functional interaction between PARP-1 and PARP-2 in chromosome stability and embryonic development in mouse</article-title>
          <source>EMBO J</source>
          <year>2003</year>
          <volume>22</volume>
          <fpage>2255</fpage>
          <lpage>2263</lpage>
        </citation>
      </ref>
      <ref id="R47">
        <label>47</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Mirza</surname>
              <given-names>MR</given-names>
            </name>
            <name>
              <surname>Monk</surname>
              <given-names>BJ</given-names>
            </name>
            <name>
              <surname>Herrstedt</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Oza</surname>
              <given-names>AM</given-names>
            </name>
            <name>
              <surname>Mahner</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Redondo</surname>
              <given-names>A</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Niraparib maintenance therapy in platinum-sensitive, recurrent ovarian cancer</article-title>
          <source>N Engl J Med</source>
          <year>2016</year>
          <volume>375</volume>
          <fpage>2154</fpage>
          <lpage>2164</lpage>
        </citation>
      </ref>
      <ref id="R48">
        <label>48</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Moree</surname>
              <given-names>WJ</given-names>
            </name>
            <name>
              <surname>Goldman</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Demaggio</surname>
              <given-names>AJ</given-names>
            </name>
            <name>
              <surname>Christenson</surname>
              <given-names>E</given-names>
            </name>
            <name>
              <surname>Herendeen</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Eksterowicz</surname>
              <given-names>J</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Identification of ring-fused pyrazolo pyridin-2-ones as novel poly(ADP-ribose)polymerase-1 inhibitors</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>5126</fpage>
          <lpage>5129</lpage>
        </citation>
      </ref>
      <ref id="R49">
        <label>49</label>
        <citation citation-type="other">
          <collab>Myung-Hwa K, Seung-Hyun K, Sae-Kwang K, Chun-Ho P, Bo-Young J, Kwang-Woo C., inventors</collab>
          <article-title>Tricyclic derivative or pharmaceutically acceptable salts thereof, preparation method thereof, and pharmaceutical composition containing the same. US patent, US8815891B2</article-title>
          <day>26</day>
          <month>08</month>
          <year>2014</year>
        </citation>
      </ref>
      <ref id="R50">
        <label>50</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Nantasenamat</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Isarankura-Na-Ayudhya</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Naenna</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Prachayasittikul</surname>
              <given-names>V</given-names>
            </name>
          </person-group>
          <article-title>A practical overview of quantitative structure-activity relationship</article-title>
          <source>EXCLI J</source>
          <year>2009</year>
          <volume>8</volume>
          <fpage>74</fpage>
          <lpage>88</lpage>
        </citation>
      </ref>
      <ref id="R51">
        <label>51</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Nantasenamat</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Prachayasittikul</surname>
              <given-names>V</given-names>
            </name>
          </person-group>
          <article-title>Maximizing computational tools for successful drug discovery</article-title>
          <source>Exp Opin Drug Discov</source>
          <year>2015</year>
          <volume>10</volume>
          <fpage>321</fpage>
          <lpage>329</lpage>
        </citation>
      </ref>
      <ref id="R52">
        <label>52</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Nantasenamat</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Worachartcheewan</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Mandi</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Monnor</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Isarankura-Na-Ayudhya</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Prachayasittikul</surname>
              <given-names>V</given-names>
            </name>
          </person-group>
          <article-title>QSAR modeling of aromatase inhibition by flavonoids using machine learning approaches</article-title>
          <source>Chem Papers</source>
          <year>2014</year>
          <volume>68</volume>
          <fpage>697</fpage>
          <lpage>713</lpage>
        </citation>
      </ref>
      <ref id="R53">
        <label>53</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ocran Mattila</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Ahmad</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Hasan</surname>
              <given-names>SS</given-names>
            </name>
            <name>
              <surname>Babar</surname>
              <given-names>Z-U-D</given-names>
            </name>
          </person-group>
          <article-title>Availability, affordability, access, and pricing of anti-cancer medicines in low- and middle-income countries: a systematic review of literature</article-title>
          <source>Front Public Health</source>
          <year>2021</year>
          <volume>9</volume>
          <fpage>628744</fpage>
        </citation>
      </ref>
      <ref id="R54">
        <label>54</label>
        <citation citation-type="book">
          <collab>OECD</collab>
          <source>Guidance document on the validation of (Quantitative) Structure-Activity Relationship &#x5B;(Q)SAR&#x5D; models</source>
          <year>2014</year>
          <publisher-loc>Paris</publisher-loc>
          <publisher-name>OECD Publ</publisher-name>
        </citation>
      </ref>
      <ref id="R55">
        <label>55</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Orvieto</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Branca</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Giomini</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Jones</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Koch</surname>
              <given-names>U</given-names>
            </name>
            <name>
              <surname>Ontoria</surname>
              <given-names>JM</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Identification of substituted pyrazolo&#x5B;1,5-a&#x5D;quinazolin-5(4H)-one as potent poly(ADP-ribose)polymerase-1 (PARP-1) inhibitors</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2009</year>
          <volume>19</volume>
          <fpage>4196</fpage>
          <lpage>4200</lpage>
        </citation>
      </ref>
      <ref id="R56">
        <label>56</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Park</surname>
              <given-names>C-H</given-names>
            </name>
            <name>
              <surname>Chun</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>Joe</surname>
              <given-names>B-Y</given-names>
            </name>
            <name>
              <surname>Park</surname>
              <given-names>J-S</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>Y-C</given-names>
            </name>
            <name>
              <surname>Choi</surname>
              <given-names>J-S</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Synthesis and evaluation of tricyclic derivatives containing a non-aromatic amide as inhibitors of poly(ADP-ribose)polymerase-1 (PARP-1)</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2010</year>
          <volume>20</volume>
          <fpage>2250</fpage>
          <lpage>2253</lpage>
        </citation>
      </ref>
      <ref id="R57">
        <label>57</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Patel</surname>
              <given-names>MR</given-names>
            </name>
            <name>
              <surname>Bhatt</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Steffen</surname>
              <given-names>JD</given-names>
            </name>
            <name>
              <surname>Chergui</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Murai</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Pommier</surname>
              <given-names>Y</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Discovery and structure&#x2013;activity relationship of novel 2,3-dihydrobenzofuran-7-carboxamide and 2,3-dihydrobenzofuran-3(2h)-one-7-carboxamide derivatives as poly(ADP-ribose)polymerase-1 inhibitors</article-title>
          <source>J Med Chem</source>
          <year>2014</year>
          <volume>57</volume>
          <fpage>5579</fpage>
          <lpage>5601</lpage>
        </citation>
      </ref>
      <ref id="R58">
        <label>58</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Patel</surname>
              <given-names>MR</given-names>
            </name>
            <name>
              <surname>Pandya</surname>
              <given-names>KG</given-names>
            </name>
            <name>
              <surname>Lau-Cam</surname>
              <given-names>CA</given-names>
            </name>
            <name>
              <surname>Singh</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Pino</surname>
              <given-names>MA</given-names>
            </name>
            <name>
              <surname>Billack</surname>
              <given-names>B</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Design and synthesis of N-substituted indazole-3-carboxamides as poly(ADP-ribose)polymerase-1 (PARP-1) inhibitors(&#x2020;)</article-title>
          <source>Chem Biol Drug Des</source>
          <year>2012</year>
          <volume>79</volume>
          <fpage>488</fpage>
          <lpage>496</lpage>
        </citation>
      </ref>
      <ref id="R59">
        <label>59</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Pedregosa</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Varoquaux</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Gramfort</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Michel</surname>
              <given-names>V</given-names>
            </name>
            <name>
              <surname>Thirion</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Grisel</surname>
              <given-names>O</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Scikit-learn: machine learning in Python</article-title>
          <source>J Mach Learn Res</source>
          <year>2011</year>
          <volume>12</volume>
          <fpage>2825</fpage>
          <lpage>2830</lpage>
        </citation>
      </ref>
      <ref id="R60">
        <label>60</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Pescatore</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Branca</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Fiore</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Kinzel</surname>
              <given-names>O</given-names>
            </name>
            <name>
              <surname>Bufi</surname>
              <given-names>LL</given-names>
            </name>
            <name>
              <surname>Muraglia</surname>
              <given-names>E</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Identification and SAR of novel pyrrolo&#x5B;1,2-a&#x5D;pyrazin-1(2H)-one derivatives as inhibitors of poly(ADP-ribose) polymerase-1 (PARP-1)</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2010</year>
          <volume>20</volume>
          <fpage>1094</fpage>
          <lpage>1099</lpage>
        </citation>
      </ref>
      <ref id="R61">
        <label>61</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Revathi</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Kanth</surname>
              <given-names>SS</given-names>
            </name>
            <name>
              <surname>Gururaj</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Chander</surname>
              <given-names>OS</given-names>
            </name>
            <name>
              <surname>Rajender</surname>
              <given-names>PS</given-names>
            </name>
          </person-group>
          <article-title>Understanding structural characteristics of PARP-1 inhibitors through combined 3D-QSAR and molecular docking studies and discovery of new inhibitors by multistage virtual screening</article-title>
          <source>Struct Chem</source>
          <year>2021</year>
          <volume>32</volume>
          <fpage>2035</fpage>
          <lpage>2050</lpage>
        </citation>
      </ref>
      <ref id="R62">
        <label>62</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Rhee</surname>
              <given-names>HK</given-names>
            </name>
            <name>
              <surname>Lim</surname>
              <given-names>SY</given-names>
            </name>
            <name>
              <surname>Jung</surname>
              <given-names>MJ</given-names>
            </name>
            <name>
              <surname>Kwon</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>MH</given-names>
            </name>
            <name>
              <surname>Choo</surname>
              <given-names>HY</given-names>
            </name>
          </person-group>
          <article-title>Synthesis of isoquinolinone-based tetracycles as poly (ADP-ribose) polymerase-1 (PARP-1) inhibitors</article-title>
          <source>Bioorg Med Chem</source>
          <year>2009</year>
          <volume>17</volume>
          <fpage>7537</fpage>
          <lpage>7541</lpage>
        </citation>
      </ref>
      <ref id="R63">
        <label>63</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Ruf</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Rolli</surname>
              <given-names>V</given-names>
            </name>
            <name>
              <surname>de Murcia</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Schulz</surname>
              <given-names>GE</given-names>
            </name>
          </person-group>
          <article-title>The mechanism of the elongation and branching reaction of Poly(ADP-ribose) polymerase as derived from crystal structures and mutagenesis11Edited by R. Huber</article-title>
          <source>J Mol Biol</source>
          <year>1998</year>
          <volume>278</volume>
          <fpage>57</fpage>
          <lpage>65</lpage>
        </citation>
      </ref>
      <ref id="R64">
        <label>64</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Sahin</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>Durdagi</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <article-title>Identifying new piperazine-based PARP1 inhibitors using text mining and integrated molecular modeling approaches</article-title>
          <source>J Biomol Struct Dyn</source>
          <year>2021</year>
          <volume>39</volume>
          <fpage>681</fpage>
          <lpage>690</lpage>
        </citation>
      </ref>
      <ref id="R65">
        <label>65</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Scarpelli</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Boueres</surname>
              <given-names>JK</given-names>
            </name>
            <name>
              <surname>Cerretani</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Ferrigno</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Ontoria</surname>
              <given-names>JM</given-names>
            </name>
            <name>
              <surname>Rowley</surname>
              <given-names>M</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Synthesis and biological evaluation of substituted 2-phenyl-2H-indazole-7-carboxamides as potent poly(ADP-ribose) polymerase (PARP) inhibitors</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2010</year>
          <volume>20</volume>
          <fpage>488</fpage>
          <lpage>492</lpage>
        </citation>
      </ref>
      <ref id="R66">
        <label>66</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Schaduangrat</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Malik</surname>
              <given-names>AA</given-names>
            </name>
            <name>
              <surname>Nantasenamat</surname>
              <given-names>C</given-names>
            </name>
          </person-group>
          <article-title>ERpred: a web server for the prediction of subtype-specific estrogen receptor antagonists</article-title>
          <source>PeerJ</source>
          <year>2021</year>
          <volume>9</volume>
          <fpage>e11716</fpage>
        </citation>
      </ref>
      <ref id="R67">
        <label>67</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Scott</surname>
              <given-names>LJ</given-names>
            </name>
          </person-group>
          <article-title>Niraparib: First global approval</article-title>
          <source>Drugs</source>
          <year>2017</year>
          <volume>77</volume>
          <fpage>1029</fpage>
          <lpage>1034</lpage>
        </citation>
      </ref>
      <ref id="R68">
        <label>68</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Shibata</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Jeggo</surname>
              <given-names>PA</given-names>
            </name>
          </person-group>
          <article-title>DNA double-strand break repair in a cellular context</article-title>
          <source>Clin Oncol (R Coll Radiol)</source>
          <year>2014</year>
          <volume>26</volume>
          <fpage>243</fpage>
          <lpage>249</lpage>
        </citation>
      </ref>
      <ref id="R69">
        <label>69</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Shultz</surname>
              <given-names>MD</given-names>
            </name>
            <name>
              <surname>Cheung</surname>
              <given-names>AK</given-names>
            </name>
            <name>
              <surname>Kirby</surname>
              <given-names>CA</given-names>
            </name>
            <name>
              <surname>Firestone</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Fan</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Chen</surname>
              <given-names>CH</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Identification of NVP-TNKS656: the use of structure-efficiency relationships to generate a highly potent, selective, and orally active tankyrase inhibitor</article-title>
          <source>J Med Chem</source>
          <year>2013</year>
          <volume>56</volume>
          <fpage>6495</fpage>
          <lpage>6511</lpage>
        </citation>
      </ref>
      <ref id="R70">
        <label>70</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Srivastava</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Raghavan</surname>
              <given-names>SC</given-names>
            </name>
          </person-group>
          <article-title>DNA double-strand break repair inhibitors as cancer therapeutics</article-title>
          <source>Chem Biol</source>
          <year>2015</year>
          <volume>22</volume>
          <fpage>17</fpage>
          <lpage>29</lpage>
        </citation>
      </ref>
      <ref id="R71">
        <label>71</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Steffen</surname>
              <given-names>JD</given-names>
            </name>
            <name>
              <surname>Brody</surname>
              <given-names>JR</given-names>
            </name>
            <name>
              <surname>Armen</surname>
              <given-names>RS</given-names>
            </name>
            <name>
              <surname>Pascal</surname>
              <given-names>JM</given-names>
            </name>
          </person-group>
          <article-title>Structural implications for selective targeting of PARPs</article-title>
          <source>Front Oncol</source>
          <year>2013</year>
          <volume>3</volume>
          <fpage>301</fpage>
        </citation>
      </ref>
      <ref id="R72">
        <label>72</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Steinhagen</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Gerisch</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Mittendorf</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Schlemmer</surname>
              <given-names>K-H</given-names>
            </name>
            <name>
              <surname>Albrecht</surname>
              <given-names>B</given-names>
            </name>
          </person-group>
          <article-title>Substituted uracil derivatives as potent inhibitors of poly(ADP-ribose)polymerase-1 (PARP-1)</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2002</year>
          <volume>12</volume>
          <fpage>3187</fpage>
          <lpage>3190</lpage>
        </citation>
      </ref>
      <ref id="R73">
        <label>73</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Torrisi</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Bisbocci</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Ingenito</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Ontoria</surname>
              <given-names>JM</given-names>
            </name>
            <name>
              <surname>Rowley</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Schultz-Fademrecht</surname>
              <given-names>C</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Discovery and SAR of novel, potent and selective hexahydrobenzonaphthyridinone inhibitors of poly(ADP-ribose)polymerase-1 (PARP-1)</article-title>
          <source>Bioorg Med Chem Lett</source>
          <year>2010</year>
          <volume>20</volume>
          <fpage>448</fpage>
          <lpage>452</lpage>
        </citation>
      </ref>
      <ref id="R74">
        <label>74</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Tutt</surname>
              <given-names>ANJ</given-names>
            </name>
            <name>
              <surname>Garber</surname>
              <given-names>JE</given-names>
            </name>
            <name>
              <surname>Kaufman</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Viale</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Fumagalli</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>Rastogi</surname>
              <given-names>P</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Adjuvant olaparib for patients with BRCA1- or BRCA2-mutated breast cancer</article-title>
          <source>N Engl J Med</source>
          <year>2021</year>
          <volume>384</volume>
          <fpage>2394</fpage>
          <lpage>2405</lpage>
        </citation>
      </ref>
      <ref id="R75">
        <label>75</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>van Beek</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>McClay</surname>
              <given-names>&#xC9;</given-names>
            </name>
            <name>
              <surname>Patel</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>Schimpl</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Spagnolo</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Maia de Oliveira</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>PARP power: A structural perspective on PARP1, PARP2, and PARP3 in DNA damage repair and nucleosome remodelling</article-title>
          <source>Int J Mol Sci</source>
          <year>2021</year>
          <volume>22</volume>
          <issue>10</issue>
          <fpage>5112</fpage>
        </citation>
      </ref>
      <ref id="R76">
        <label>76</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>van de Waterbeemd</surname>
              <given-names>H</given-names>
            </name>
          </person-group>
          <person-group person-group-type="editor">
            <name>
              <surname>van de Waterbeeemd</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Testa</surname>
              <given-names>B</given-names>
            </name>
          </person-group>
          <article-title>Physicochemical approaches to drug absorption</article-title>
          <source>Drug bioavailability: estimation of solubility, permeability, absorption and bioavailability, Vol. 40</source>
          <year>2008</year>
          <edition>2nd ed.</edition>
          <publisher-loc>New York</publisher-loc>
          <publisher-name>Wiley</publisher-name>
          <fpage>69</fpage>
          <lpage>99</lpage>
        </citation>
      </ref>
      <ref id="R77">
        <label>77</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Wildman</surname>
              <given-names>SA</given-names>
            </name>
            <name>
              <surname>Crippen</surname>
              <given-names>GM</given-names>
            </name>
          </person-group>
          <article-title>Prediction of physicochemical parameters by atomic contributions</article-title>
          <source>J Chem Inf Comput Sci</source>
          <year>1999</year>
          <volume>39</volume>
          <fpage>868</fpage>
          <lpage>873</lpage>
        </citation>
      </ref>
      <ref id="R78">
        <label>78</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Worachartcheewan</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Nantasenamat</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Isarankura-Na-Ayudhya</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>Prachayasittikul</surname>
              <given-names>V</given-names>
            </name>
          </person-group>
          <article-title>QSAR study of H1N1 neuraminidase inhibitors from influenza a virus</article-title>
          <source>Lett Drug Des Discov</source>
          <year>2014</year>
          <volume>11</volume>
          <fpage>420</fpage>
          <lpage>427</lpage>
        </citation>
      </ref>
      <ref id="R79">
        <label>79</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Xu</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Yin</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Dong</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Song</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>W</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>X</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Pamiparib dose escalation in Chinese patients with non-mucinous high-grade ovarian cancer or advanced triple-negative breast cancer</article-title>
          <source>Cancer Med</source>
          <year>2021</year>
          <volume>10</volume>
          <issue>1</issue>
          <fpage>109</fpage>
          <lpage>118</lpage>
        </citation>
      </ref>
      <ref id="R80">
        <label>80</label>
        <citation citation-type="other">
          <collab>Xu W, Delahanty G, Wei L, Zhang J, inventors</collab>
          <article-title>PARP inhibitor compounds, compositions and methods of use. US patent, US8894989B2</article-title>
          <day>25</day>
          <month>11</month>
          <year>2014</year>
        </citation>
      </ref>
      <ref id="R81">
        <label>81</label>
        <citation citation-type="other">
          <collab>Xu W, Delahanty G, Zhang J, inventors</collab>
          <article-title>Diazabenzo&#x5B;de&#x5D; anthracen-3-one compounds and methods for inhibiting PARP. US Patent, US8470825B2</article-title>
          <day>11</day>
          <month>11</month>
          <year>2014</year>
        </citation>
      </ref>
      <ref id="R82">
        <label>82</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Yap</surname>
              <given-names>CW</given-names>
            </name>
          </person-group>
          <article-title>PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints</article-title>
          <source>J Comput Chem</source>
          <year>2011</year>
          <volume>32</volume>
          <fpage>1466</fpage>
          <lpage>1474</lpage>
        </citation>
      </ref>
      <ref id="R83">
        <label>83</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Yu</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Luo</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Hu</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Cui</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>X</given-names>
            </name>
            <name>
              <surname>Gou</surname>
              <given-names>W</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Structure-based design, synthesis, and evaluation of inhibitors with high selectivity for PARP-1 over PARP-2</article-title>
          <source>Eur J Med Chem</source>
          <year>2022</year>
          <volume>227</volume>
          <fpage>113898</fpage>
        </citation>
      </ref>
      <ref id="R84">
        <label>84</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Zhao</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Ji</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Cui</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Zhou</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Lai</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Chen</surname>
              <given-names>X</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Discovery of novel quinazoline-2,4(1H,3H)-dione derivatives as potent PARP-2 selective inhibitors</article-title>
          <source>Bioorg Med Chem</source>
          <year>2017</year>
          <volume>25</volume>
          <fpage>4045</fpage>
          <lpage>4054</lpage>
        </citation>
      </ref>
      <ref id="R85">
        <label>85</label>
        <citation citation-type="other">
          <collab>Zhou C, Ren B, Wang H, inventors</collab>
          <article-title>Fused tetra or penta-cyclic dihydrodiazepinocarbazolones as parp inhibitors. US patent, US9617273B2</article-title>
          <day>11</day>
          <month>04</month>
          <year>2017</year>
        </citation>
      </ref>
      <ref id="R86">
        <label>86</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Zhou</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Ji</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>X</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>Cao</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Jin</surname>
              <given-names>J</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Discovery of quinazoline-2,4(1H,3H)-dione derivatives containing 3-substituted piperizines as potent PARP-1&#x2F;2 inhibitors&#x2500;design, synthesis, in vivo antitumor activity, and X-ray crystal structure analysis</article-title>
          <source>J Med Chem</source>
          <year>2021</year>
          <volume>64</volume>
          <fpage>16711</fpage>
          <lpage>16730</lpage>
        </citation>
      </ref>
      <ref id="R87">
        <label>87</label>
        <citation citation-type="journal">
          <person-group>
            <name>
              <surname>Zmuda</surname>
              <given-names>F</given-names>
            </name>
            <name>
              <surname>Malviya</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>Blair</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Boyd</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>Chalmers</surname>
              <given-names>AJ</given-names>
            </name>
            <name>
              <surname>Sutherland</surname>
              <given-names>A</given-names>
            </name>
            <etal />
          </person-group>
          <article-title>Synthesis and evaluation of a radioiodinated tracer with specificity for poly(ADP-ribose) polymerase-1 (PARP-1) in vivo</article-title>
          <source>J Med Chem</source>
          <year>2015</year>
          <volume>58</volume>
          <fpage>8683</fpage>
          <lpage>8693</lpage>
        </citation>
      </ref>
    </ref-list>
  </back>
  <floats-wrap>
    <fig id="T1" position="float">
      <label>Table 1</label>
      <caption><title>Twelve different sets of fingerprint descriptors derived from the PaDEL-Descriptor software</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-t-001" />
    </fig>
    <fig id="T2" position="float">
      <label>Table 2</label>
      <caption><title>Descriptions of SMARTS patterns and substructures from the top 20 Gini indices</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-t-002" />
    </fig>
    <fig id="F1" position="float">
      <label>Figure 1</label>
      <caption><title>Overall workflow of the development of the webserver for PARP-1 inhibitors</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-001" />
    </fig>
    <fig id="F2" position="float">
      <label>Figure 2</label>
      <caption><title>Illustration of the relationship between molecular weight (MW) and Ghose-Crippen-Viswanadhan octanol-water partition coefficient (LogP). Blue and orange represent active and inactive compounds. The size of the circle refers to the pIC<sub>50</sub> value, which is the negative logarithmic of the IC<sub>50</sub> concentration (nM).</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-002" />
    </fig>
    <fig id="F3" position="float">
      <label>Figure 3</label>
      <caption><title>Box plots of Lipinski&#x27;s rule-of five descriptors comparing between active and inactive groups. The dashed line represents cut-off values indicating drug-like molecules: molecular weight (MW) &#x3C; 500, Ghose-Crippen-Viswanadhan octanol-water partition coefficient (LogP) &#x3C; 5, number of hydrogen bond donors (NumHDonors) &#x3C; 5, number of hydrogen bond acceptors (NumHAcceptors) &#x3C; 10. A circle represents the mean, and an asterisk indicates a significant difference between two groups (<italic>p</italic> &#x3C; 0.05).</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-003" />
    </fig>
    <fig id="F4" position="float">
      <label>Figure 4</label>
      <caption><title>Heat maps of the MCC values of the training, CV, and test sets for each data sampling approach. (A) Balanced undersampling, (B) balanced oversampling, and (C) imbalanced non-class weight. Abbreviations: MCC, Matthews correlation coefficient; CV, cross-validation; gaussianNB, Gaussian Naive Bayes; LBMC, light gradient boosted machine; MLP, multi-layer perceptron; SVC, C-support vector; XGB, extreme gradient boosting</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-004" />
    </fig>
    <fig id="F5" position="float">
      <label>Figure 5</label>
      <caption><title>Heat maps of MCC<sub>train</sub>&#x2212;MCC<sub>CV</sub> and MCC<sub>train</sub>&#x2212;MCC<sub>test</sub> for each data sampling approach. (A) Balanced undersampling, (B) balanced oversampling, (C) imbalanced non-class weight. Abbreviations: MCC, Matthews correlation coefficient; CV, cross-validation; gaussianNB, Gaussian Naive Bayes; LBMC, light gradient boosted machine; MLP, multi-layer perceptron; SVC, C-support vector; XGB, extreme gradient boosting</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-005" />
    </fig>
    <fig id="F6" position="float">
      <label>Figure 6</label>
      <caption><title>Plot of PCA scores for applicability domain analysis. The score plot indicates the distribution of chemical space of the internal (green) and external (red) datasets, which were used to determine the applicability domain of the PARP-1 inhibitors dataset.</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-006" />
    </fig>
    <fig id="F7" position="float">
      <label>Figure 7</label>
      <caption><title>Feature importance plot as rationalized by Gini index obtained from random forest model using oversampling</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-007" />
    </fig>
    <fig id="F8" position="float">
      <label>Figure 8</label>
      <caption><title>Crystal structure of the catalytic domain of PARP-1 (PDB ID 1UK0) and the interaction network between PARP-1 and olaparib (PDB ID 7KK4). The alpha-helical subdomain (HD) is shown in light orange color while the ADP-ribosyl transferase subdomain (ART) is shown in wheat color. Hydrogen forming network (blue solid line), &#x3C0;-&#x3C0; (green dashed line), and hydrophobic (grey dashed line) interactions between key amino acids within the nicotinamide binding site and olaparib</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-008" />
    </fig>
    <fig id="F9" position="float">
      <label>Figure 9</label>
      <caption><title>Screenshot of the PARP1pred webserver before (A) and after (B) entering the SMILES input. Notice that after submission of the SMILES notation the corresponding molecular fingerprints are computed whereby the trained predictive model is applied to classify the query molecule as active or inactive. In this case, the query molecule is classified to be active.</title></caption>
      <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="EXCLI-22-84-g-009" />
    </fig>
  </floats-wrap>
</article>