<?xml version='1.0' encoding='UTF-8'?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1d1 20130915//EN" "JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta id="journal-meta-3ba10e5f2fb94953ac6e0f291434ba9c">
      <journal-id journal-id-type="nlm-ta">Sciresol</journal-id>
      <journal-id journal-id-type="publisher-id">Sciresol</journal-id>
      <journal-id journal-id-type="journal_submission_guidelines">https://www.jcbsonline.ac.in/</journal-id>
      <journal-title-group>
        <journal-title>Journal of Clinical and Biomedical Sciences</journal-title>
      </journal-title-group>
      <issn publication-format="electronic">2319-2453</issn>
      <issn publication-format="print"/>
    </journal-meta>
    <article-meta id="article-meta-eef2bf921dfe4869bdc68e7363d81331">
      <article-id pub-id-type="doi">10.58739/jcbs/v15i2.24.187</article-id>
      <article-categories>
        <subj-group>
          <subject>Original Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title id="article-title-939a8862ed3f4f5982d86b1b35eba43a">
          <bold id="strong-3c5f4fa8bd0440149101b3cb8be832d2">Advanced Arrhythmia Classification Using Transformer-Based CNN</bold>
        </article-title>
        <alt-title alt-title-type="right-running-head">Advanced arrhythmia classification using transformer based  CNN</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <name id="name-8bd9257b050c45f9b1ade78133a1c2bd">
            <surname>Febeena</surname>
            <given-names>K R</given-names>
          </name>
          <email>febigenius@gmail.com</email>
          <xref id="xref-02593fd43ed94d33b4d5cdf0b700af96" rid="aff-3bfbefa1ffcf4514b962b09df6f6b5b5" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name id="name-97862c7f65df4c39a9ab639996c5ac0d">
            <surname>Kurian</surname>
            <given-names>Cini</given-names>
          </name>
          <xref id="xref-02280293c5ac4e6eb8aa3f52f0948326" rid="aff-80966e44196c4eeda6649b1d38c5f3eb" ref-type="aff">2</xref>
        </contrib>
        <aff id="aff-3bfbefa1ffcf4514b962b09df6f6b5b5">
          <institution>Research Scholar, School of Computer Sciences, Mahatma Gandhi University</institution>
          <addr-line>Kottayam, Kerala</addr-line>
          <country country="IN">India</country>
        </aff>
        <aff id="aff-80966e44196c4eeda6649b1d38c5f3eb">
          <institution>Principal, Al-Ameen College</institution>
          <addr-line>Edathala, Aluva, Kerala</addr-line>
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <volume>15</volume>
      <issue>2</issue>
      <fpage>118</fpage>
      <permissions>
        <copyright-year>2025</copyright-year>
      </permissions>
      <abstract id="abstract-abstract-title-72970cf860bf45d5a3b23bae1b9b5028">
        <title id="abstract-title-72970cf860bf45d5a3b23bae1b9b5028">Abstract</title>
        <p id="paragraph-5634aa8e61d24d59aec715395cfae965"><bold id="strong-f5dd5ab0b712466894768cb00386f626">Background</bold>: The Electrocardiogram (ECG) is a vital clinical signal for recognizing cardiovascular ailments (CVDs) such as Arrhythmia. However, manual assessment of ECG signals is challenging due to subtle physiological variations in both regular and irregular cases, mainly when dealing with a large volume of cardiac patients. From this perspective, automated sorting of ECG signals can offer substantial relief to healthcare experts, facilitating precise analysis. <bold id="strong-006d2779b5644f2987d350e5da3935ec">Objective</bold>: This study aims to develop an automated system for sorting ECG signals to ease the workload of healthcare experts and enhance the precision of cardiac condition analysis. The ultimate goal is to provide healthcare professionals with a reliable tool that streamlines the interpretation process, enabling timely and accurate diagnoses, thereby improving patient outcomes and reducing healthcare burdens. <bold id="strong-44989367ac1c43f8a86a0f4c0b265673">Method &amp; Material</bold>: Current approaches predominantly rely on convolutional neural networks (CNNs) to extract ECG signal features. However, these may fail to capture nuanced differences in pathological features across different diseases. Transformer networks, known for their prowess in handling sequence data, offer advantages in feature extraction but often rely on extensive datasets, making the complete network intricate. This proposed model utilizes CNN and Transformers for arrhythmia classification. This study was conducted on the MIT-BIH Arrhythmia database (MIT-ArrhyDB), classifying five distinct classes of arrhythmias based on their morphological features. <bold id="strong-ae958ccb7ee34bbba791cb8a862a72d7">Result</bold>: The proposed model exhibits an impressive F1 Score of 98.52% and classification accuracy of 98.95%. Conclusion: Comparative analysis with standard CNN exposes the superior performance of our proposed model. This highlights its outstanding overall performance and potential utility in clinical applications.﻿ </p>
      </abstract>
      <kwd-group id="kwd-group-45155e6e02fd4b84ae9046f3fc544a14">
        <title>Keywords</title>
        <kwd>Electrocardiogram (ECG)</kwd>
        <kwd>Arrhythmia</kwd>
        <kwd>Convolutional Neural Network (CNN)</kwd>
        <kwd>Transformer</kwd>
      </kwd-group>
      <funding-group>
        <funding-statement>None</funding-statement>
      </funding-group>
    </article-meta>
  </front>
  <body>
    <sec>
      <title id="title-925e8b9b4ad940ada2a502ad07410170">1 Introduction</title>
      <p id="paragraph-72c34c571bc540928bcfafe5db93a3c9">An irregular heartbeat, known as arrhythmia, occurs when the heart's rhythm is disrupted, causing it to beat erratically, excessively quickly, or unusually slowly, distracting its ability to pump blood efficiently. It can be analysed with an Electrocardiogram (ECG), which records the heart's electrical activity without invasive procedures. Traditionally, diagnosing arrhythmia relied on human observation, which was time-consuming. However, automatic detection and classification now save time and improve accuracy. Over the past few decades, cardiovascular disease diagnosis has increasingly relied on machine learning (ML) and deep learning (DL) techniques. Outmoded ML requires extensive feature extraction from ECG signals, while DL models simplify this process by automatically identifying relevant features for prediction and classification <xref id="xref-fe8efc970bc145eca8dd915aa3ca8d0d" rid="R273086533403599" ref-type="bibr">1</xref>. Yanfang Dong et al.<xref id="xref-b35fe64f9f8a4be8ab2cde61cd3327aa" rid="R273086533403583" ref-type="bibr">2</xref> proposed CNN-DVIT, the new architecture for deep learning classification of multi-label arrhythmias using 12-lead ECG signals containing different-length recordings. This model combines CNNs with depth wise separable convolutions and incorporates a ViT with a deformable attention mechanism to extract spatial and temporal characteristics from ECG data efficiently. Sattar et al.<xref id="xref-f073f28eb0f349fd9fe89d79bccfbf25" rid="R273086533403598" ref-type="bibr">3</xref> developed a deep learning-based approach for ECG classification by digitising ECG images to time-series data and applying models such as CNN, LSTM, and SSL-based autoencoders. The CNN model achieves the highest accuracy for classifying cardiac arrhythmias. The results have proven that the digitised ECG signals allow such accurate real-time monitoring for cardiologists, offering an essential instrument for the early and efficient diagnosis of cardiac diseases. Answer: Ansari et al.<xref id="xref-d8743a6fd2444f4e810a760c8ecf20c2" rid="R273086533403591" ref-type="bibr">4</xref> review deep learning (DL) architectures for ECG arrhythmia detection, comparing models like CNNs, MLPs, Transformers, and RNNs used from 2017 to 2023. It offers a roadmap for researchers entering the field, providing insights into current trends and practical models for detecting ECG anomalies. The survey also highlights areas for future research, aiming to inspire further advancements in ECG arrhythmia detection and classification.</p>
      <p id="paragraph-6ede59f41948455f95532de01cfcb529">ECG is the most widely employed and pertinent method for evaluating a patient's cardiac activity. Each cardiac phase embraces successive atrial and ventricular depolarisation <xref id="xref-73279e999ae342a1bd7704f64a1fe97a" rid="R273086533403604" ref-type="bibr">5</xref>, emanating from the atrial sinoatrial node and spreading across the heart. This process produces electrical currents on the body's surface, inducing skin surface electrical potential variations. Surface electrodes capture these signals, which are visually depicted in the ECG. A standard ECG cycle's key attributes include amplitudes, morphologies, and durations of waves such as P, QRS, and T waves <xref id="xref-51d25b19aff84af082d9ac3b2cc9dd60" rid="R273086533403598" ref-type="bibr">3</xref>, as illustrated in <xref id="x-2ee254d665f3" rid="figure-998b233fdab1493d8102bf3f7d5dfa71" ref-type="fig">Figure 1</xref>. A simple ECG wave starts with a P wave. The QRS wave, notably, is the most significant waveform in the ECG signal and represents the depolarisation of the ventricles. T waves represent ventricular repolarisation.</p>
      <p id="p-73bf013ee341">Baseline wander, Power Line Interference, contact noise, and motion artefacts represent diverse categories of noise found in ECG signals <xref id="xref-f906a87ad38f4e5cba587b531b4a56eb" rid="R273086533403584" ref-type="bibr">6</xref>. M Wu et al.<xref rid="R273086533403592" ref-type="bibr">7</xref>, <xref rid="R273086533403603" ref-type="bibr">8</xref> proposed a resilient and effective deep 1D CNN with 12 layers to categorise five arrhythmia classes. X Hua et al.<xref id="xref-2aaa51bf9c564a04b7377c9c94fa9bc1" rid="R273086533403593" ref-type="bibr">9</xref> presented a comprehensive ECG signal classification technique that leverages a unique 1D CNN segmentation strategy. MMR Khan et al.<xref id="xref-5e9a8f1a5ff44cb38139a6ccc4fac44c" rid="R273086533403600" ref-type="bibr">10</xref> presented a 1-D CNN based on DL for the automated categorisation of ECG heartbeats, aiming to classify five distinct types of abnormal rhythms. BM Maweu et al.<xref id="xref-63ecae657753418eab9ef810f01947af" rid="R273086533403586" ref-type="bibr">11</xref> presented the CNN Explanations Proposed model for ECG Signals (CEFEs), designed to provide interpretable explanations. Y Zhao et al.<xref id="xref-42aeeadc075a4d8980ffae9a35344bcb" rid="R273086533403585" ref-type="bibr">12</xref> introduced an automated technique for ECG signal classification exploiting a deep CNN, complemented by wavelet transform for data filtering. Savalia S and Emamian V <xref id="xref-8ec52cfc4f2b43709ecdc92a64ba8227" rid="R273086533403596" ref-type="bibr">13</xref> introduced a computerised classification system employing the Multi-Layer Perceptron (MLP) network and the CNN.</p>
      <fig id="figure-998b233fdab1493d8102bf3f7d5dfa71" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 1 </label>
        <caption id="caption-a963a43f81cd4afc95e72151a83d3778">
          <title id="title-51a6ef8f2afe4ed0a3a1e119ad2897a6">
            <bold id="strong-33beff6150214487961d63542018b68a">Typical diagram of ECG Waveform</bold>
          </title>
        </caption>
        <graphic id="graphic-2bd403b4cb38403d9dd08eecb8d4a1ec" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image1.png"/>
      </fig>
      <p id="paragraph-be72342d6c0d4163a5253568f4c4330e">﻿ Despite their effectiveness, DL-based methods face specific challenges <xref id="xref-987af1fa41f8429fb812ccabdbaac690" rid="R273086533403594" ref-type="bibr">14</xref>. They often require intricate convolution and recursive structures, producing a series of concealed states with limited parallelisation due to dependencies on previous states. While CNNs have become famous for pattern classification, they may not capture pathological variations in ECG signals, such as irregularities in shape, duration, or timing <xref id="xref-392e73d9c9954df286e51c633c8068c9" rid="R273086533403588" ref-type="bibr">15</xref>. Transformer networks present a promising alternative, offering advantages in handling sequential data and extracting relevant features to classify cardiac abnormalities better.</p>
      <p id="paragraph-f0f1c1d950eb4c0289c283359815ce09">Key contributions of the paper include: </p>
      <p id="paragraph-2f21250cf5394894ac98acae4f03b9d1">(a) In pre-processing, denoising, signal segmentation, and data resampling are meticulously executed.</p>
      <p id="paragraph-027eb524714741f8bc98d048ad91b184">(b) The proposed model employs transformer network (TransNet) based CNN for feature mining.</p>
      <p id="paragraph-9be2967a693f4f82b9c7869ba3ab3db7">(c) A comparative study is performed against the standard CNN architecture to evaluate the model's effectiveness.﻿</p>
      <p id="paragraph-f2d043e76d9c46c9901aabd499b6eda4">ECG monitoring generates a continuous flow of data, yielding large amounts of complicated waveforms that cannot be analysed manually, especially in real-time. The volume becomes even larger in large-scale clinical environments where numerous patients are monitored simultaneously. Arrhythmia can take such diverse forms and severities. Some are subtle and not easily detectable and require specialists' expert analyses for pattern recognition. Manual interpretation is prone to error, particularly regarding infrequent or unusual arrhythmias. This analysis takes more time and requires great attention to detail, which would be impractical for a clinical setting that needs prompt diagnoses to intervene effectively. It is hard to find many professionals in clinical settings whose qualifications warrant extensive training and experience for proper ECG interpretation. In high-volume settings, the absence of available experts might reduce the capability to keep pace with demand. Automated techniques, like Transformer-based Convolutional Neural Networks, help address these challenges through efficient, scalable, and accurate tools for classifying arrhythmias. Specifically, Transformer-based CNNs apply deep learning techniques to capture temporal and spatial features in ECG data, allowing for improved accuracy and speed up, which is critical in vast, real-time applications.</p>
      <p id="paragraph-3c3b0fa374594bbf95d00c37fc6d5682">The proposed methodology integrates TransNet with the CNN network to address its limitations, particularly its suboptimal performance in handling temporal features. The paper's structure is outlined as follows: The proposed model is detailed in Section 2; preliminary findings are presented in Section 3; and a concise overview and concluding remarks are provided in Section 4.</p>
    </sec>
    <sec>
      <title id="title-d5a0dd0e7b0043529f6c183e2d3dc008">2 Materials and Methodology</title>
      <p id="paragraph-5d4b71aeddff4551b2453ab1f82fc6aa">The proposed model is represented in <xref id="x-18a84b5b4198" rid="figure-e0169e4b6384484f8330b0c604118110" ref-type="fig">Figure 2</xref>. In <xref id="x-c6f7b29d9889" rid="figure-e0169e4b6384484f8330b0c604118110" ref-type="fig">Figure 2</xref> ECG signals from MIT-ArrhyDB are given as input to the model. Due to undesired random instabilities, the signal is exposed to eradicate noise as part of the pre-processing. After the signal has been de-noised, R-peak information is accessible from the database, which separates the individual heartbeats. Next, resampling has been used to enrich the data, and lastly, the classifier architecture receives the processed and expanded data. The following subsections include a full description of the complete procedure.</p>
      <fig id="figure-e0169e4b6384484f8330b0c604118110" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 2 </label>
        <caption id="caption-9906739e345a428fbb237cfee5f69c86">
          <title id="title-8b161b263ac0466084afc609842c8a53">
            <bold id="strong-8cfd4589c84a44909e742efd9578af0e">Architecture of proposed </bold>
            <bold id="strong-66f83c348826419dbdcbfe7ee135d1bc">TransNet-based CNN Model</bold>
          </title>
        </caption>
        <graphic id="graphic-a171be9b71834ac186324e55db9288f4" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image2.png"/>
      </fig>
      <p id="paragraph-f0b57e5c9a8a45559a28565596076eb9">In <xref id="x-bd29ede8f0ef" rid="figure-e0169e4b6384484f8330b0c604118110" ref-type="fig">Figure 2</xref>, the raw data is pre-processed to eliminate unwanted noise or artefacts of the ECG signal, improving the signal quality. Then, we extract all the relevant segments that might correspond to individual heartbeats or specific intervals, after which we bring the data into a uniform sampling rate for consistency, which is valuable for further processing. Filter count and kernel dimension with values of 32, 64, and 128 and dimensions of 1.5, 1.7, and 1.9, respectively, are used within each layer for identifying the different patterns appearing in ECG signals. After extracting the features by CNN, these are fed into a Transformer Network, which, using attention mechanisms, helps to understand complex relationships between data points. This approach also accounts for sequential relationships among heartbeats in ECGs, making it suitable for analysing ECG data, which is inherently time-dependent. This output from the Transformer Network is given a softmax function to provide different classes' probabilities. The pipeline implements CNN for feature extraction and Transformer networks for sequential learning of ECG data to classify it into different heartbeat categories. The approach efficiently suits the appropriately complex temporal patterns of ECG signals and will potentially contribute to automated arrhythmia detection systems.</p>
      <sec>
        <title id="title-b8160aa1248f464fa6ed7e35eaaab2ed">
          <bold id="s-091840491a64">2.1 Dataset</bold>
        </title>
        <p id="paragraph-707f29dd483d49eea06277ab2e6bacb2">The data hired in this research were traced from the MIT-ArrhyDB <xref rid="R273086533403595" ref-type="bibr">16</xref>, <xref rid="R273086533403589" ref-type="bibr">17</xref>. By leveraging the R-peak data from MIT-ArrhyDB, we fragmented the signal into individual beats grounded around each R-peak and documented their respective types. Each beat fragment contained 300 data points, with 150 preceding the R-peak and 149 following the peak. Adhering to AAMI recommendations, we elicited 109,446 fragments categorised into five categories: 90,592 Non-ectopic beats (N), 2,781 Supraventricular ectopic beats (S), 7,232 Ventricular ectopic beats (V), 802 Fusion beats (F), and 8,039 Unknown beats (Q). The dataset is divided into 80:20 ratios for training and evaluate the proposed model.</p>
      </sec>
      <sec>
        <title id="title-597f3e3bcbbd4ec4aeb8f3f3e3197048">
          <bold id="s-231af2553978">2.2 Pre-Processing of ECG Input Signal</bold>
        </title>
        <p id="paragraph-ef9c558eec604d079dede3ad974d79f6">Denoising, signal segmentation and resampling are done as part of pre-processing. The prime noise in most electrophysiological signals is power line interference (PLI) <xref id="xref-e3a2b3beec134806828486d68001fb57" rid="R273086533403597" ref-type="bibr">18</xref>. The difficulties of transient interferences associated with digital notch filters are addressed by the proposal of an adaptive notch filter (ANF) <xref id="x-d05dd4de73aa" rid="R273086533403708" ref-type="bibr">19</xref>. In this method, apply fast Fourier transform (FFT) on the input data, <inline-formula id="inline-formula-a67b97593ed04bf8be9b8bf242834d3f"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>e</mml:mi><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced></mml:math></inline-formula> with the sampling frequency of <inline-formula id="inline-formula-dd39eff2685c4b34b9237caa770f22e2"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>f</mml:mi><mml:mi>s</mml:mi></mml:msub></mml:math></inline-formula> and length N. Utilizing a pair of spectral bins positioned in the primary lobe of the Fourier spectrum permits the assessment of PLI parameters, including amplitude <inline-formula id="inline-formula-1fd33f863ce042d1abd1b471f7b8802a"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, frequency <inline-formula id="inline-formula-a554effc514e4dbaaeb865bb8202eb1d"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and initial phase <inline-formula id="inline-formula-485ccd919882425bb6596b4648cede81"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mo>∅</mml:mo><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> with the help of ratio based spectrum correction methods (RBSC). This estimation facilitates the creation of a compensation signal, <inline-formula id="inline-formula-4504444dac4c413d9fec4a394d4bda0d"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced></mml:math></inline-formula>. The subsequent subtraction of this recompense signal from the original signal effectively suppresses the occurrence of PLI. The mathematical representation of the above steps is shown in <xref rid="disp-formula-group-e5987d06eb304e0b9d2f05a2d7162e60" ref-type="disp-formula"/>, <xref rid="disp-formula-group-c814171af2294304bf452b179642529c" ref-type="disp-formula"/>, <xref rid="disp-formula-group-53946e1a9bc84d73bb822a86beff6d57" ref-type="disp-formula"/>, <xref rid="disp-formula-group-a146118bd89b4e1eb451a669c3e69d6a" ref-type="disp-formula"/>, respectively, <italic id="e-3d628770537d">et</italic> represents the ECG signal, <italic id="e-f7d6f576feb1">ef</italic> represents the ECG signal after the application of FFT, and <italic id="e-1ac627ec959b">CPLIt</italic> is the compensation signal.</p>
        <disp-formula-group id="disp-formula-group-e5987d06eb304e0b9d2f05a2d7162e60"> <disp-formula><label>1</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>e</mml:mi><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced><mml:mo> </mml:mo><mml:mover><mml:mo>→</mml:mo><mml:mrow><mml:mi>F</mml:mi><mml:mi>F</mml:mi><mml:mi>T</mml:mi></mml:mrow></mml:mover><mml:mo> </mml:mo><mml:mo> </mml:mo><mml:mover><mml:mi>e</mml:mi><mml:mo>^</mml:mo></mml:mover><mml:mfenced><mml:mi>f</mml:mi></mml:mfenced></mml:math></disp-formula></disp-formula-group>
        <disp-formula-group id="disp-formula-group-c814171af2294304bf452b179642529c"> <disp-formula><label>2</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mover><mml:mi>e</mml:mi><mml:mo>^</mml:mo></mml:mover><mml:mfenced><mml:mi>f</mml:mi></mml:mfenced><mml:mo> </mml:mo><mml:mover><mml:mo>→</mml:mo><mml:mrow><mml:mi>R</mml:mi><mml:mi>B</mml:mi><mml:mi>S</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:mover><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi><mml:mo>,</mml:mo><mml:mo> </mml:mo></mml:mrow></mml:msub><mml:msub><mml:mo>∅</mml:mo><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mo> </mml:mo></mml:math></disp-formula></disp-formula-group>
        <disp-formula-group id="disp-formula-group-53946e1a9bc84d73bb822a86beff6d57"> <disp-formula><label>3</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mo>.</mml:mo><mml:mo> </mml:mo><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mo>(</mml:mo><mml:mn>2</mml:mn><mml:mi>π</mml:mi><mml:mi>f</mml:mi></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mo>∅</mml:mo><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:math></disp-formula></disp-formula-group>
        <disp-formula-group id="disp-formula-group-a146118bd89b4e1eb451a669c3e69d6a"> <disp-formula><label>4</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mover><mml:mrow><mml:mi>e</mml:mi><mml:mo> </mml:mo></mml:mrow><mml:mo>-</mml:mo></mml:mover><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:mi>e</mml:mi><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced><mml:mo>-</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>L</mml:mi><mml:mi>I</mml:mi></mml:mrow></mml:msub><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced></mml:math></disp-formula></disp-formula-group>
        <p id="paragraph-51cfd7ac58064c14bda692855739f842">The signal is further treated by segmenting it into individual heartbeats following the denoising step. A significant imbalance in the dataset prompted the implementation of resampling techniques to achieve data balance.</p>
      </sec>
      <sec>
        <title id="title-7b07f0e5e3a94aeb81a17c6fa4330ac3">
          <bold id="s-9a802687cae4">2.3 Proposed model</bold>
        </title>
        <p id="paragraph-d3b9b05abbd14ee1b4de950dc7392043">The proposed architecture embraces three primary segments: (a) an embedded network based on one-dimensional convolution layers for extracting unprocessed data from divided ECG waves, (b) a transformer network, and (c) a classification segment.</p>
        <sec>
          <title id="t-bcf237d2c03b">
            <bold id="s-31bfa34dfb66">(a) CNN segment</bold>
          </title>
          <p id="paragraph-8a77c673a0c349749965687d4fb988d7">CNN operates feed-forwardly and is extensively employed for extracting features <xref id="x-f3a81cc8f7cb" rid="R273086533403707" ref-type="bibr">20</xref>. The CNN segment consists of four convolutional layers with the following parameters: Layer 1 has 32 filters and features a kernel size (k<sub id="subscript-891bf44336434aa5b773ec2a79c4e7c7">s</sub>) of 15. Layer 2 has the same parameter as Layer 1. Layer 3 is composed of 64 filters and utilises a k<sub id="subscript-ef4750887b55437fa6605fd241c627f6">s</sub> of 17, while Layer 4 comprises 128 filters with a k<sub id="subscript-da7c23ce2bbc43f8a2eca6cb76c7f2d7">s</sub> of 19. The heartbeats undergo processing using four 1D convolutional layers. Following representation learning, the characteristic vector <inline-formula id="inline-formula-eacc234901a945b0bf178c5a05cfc04f"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>f</mml:mi><mml:mo>'</mml:mo></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>f</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>f</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>…</mml:mo><mml:mo>…</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>f</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is generated, and this vector serves as the input to a TransNet. Padding and stride are kept identical in all layers. To give the network non-linearity and to address the vanishing gradient problem, the non-linear function is configured to a rectified linear unit (ReLU).</p>
        </sec>
        <sec>
          <title id="t-feeb1d919cb7">
            <bold id="s-9610b32e95fe">(b) Transformer segment</bold>
          </title>
          <p id="paragraph-7f14fdf0ddc640a09fda0762518f8529">To address the limitation of CNNs in modelling long-range correlations in time series data, a TransNet is integrated into the proposed model to extract extended patterns from the features identified by the CNN. The TransNet <xref id="xref-95aeb23579ed42508cae1ff74814339f" rid="R273086533403590" ref-type="bibr">21</xref> was devised based on the attention mechanism, comprising both an encoder and a decoder. In the proposed model the transformer encoder has been used to record interactions and long-range dependencies across time instances. The sole portion employed in the input signal categorisation problem is the encoder section, whose structure is depicted in <xref id="x-6e238d78478b" rid="figure-2a2c8ee89bdb4876aacd87d166f4de33" ref-type="fig">Figure 3</xref>. Before applying the attention module, the output of the convolution segment, <italic id="e-d6b6df8ab825">xf′</italic>, is subjected to positional encoding. As shown in <xref id="x-8950e6be7cbd" rid="figure-2a2c8ee89bdb4876aacd87d166f4de33" ref-type="fig">Figure 3</xref>, the encoder function is explained as follows.</p>
          <p id="paragraph-4eb90cea83e949bea3709f2b3b08f736"><bold id="strong-8779fc71492146239dd62e5efedd350f">Self-Attention Module:</bold> The Inputs Q, K, and V correspond to the query, key, and value, respectively. The attention score is computed based on the resemblance between the Q and K. Subsequently, the attention context is established according to this attention score. The calculation for the scaled dot-product attention employed by the model can be expressed using <xref id="x-d7cc9874e598" rid="dfg-8e90cad8ea6b" ref-type="disp-formula">Equation 5</xref>  where <inline-formula id="inline-formula-c1150e157986442bb900ae06b4a6168d"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mfrac><mml:mn>1</mml:mn><mml:msqrt><mml:msub><mml:mi>d</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:msqrt></mml:mfrac></mml:math></inline-formula> is the scaling factor.</p>
          <disp-formula-group id="dfg-8e90cad8ea6b"> <disp-formula><label>5</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>A</mml:mi><mml:mi>t</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mfenced><mml:mrow><mml:mi>Q</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>o</mml:mi><mml:mi>f</mml:mi><mml:mi>t</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi><mml:mo> </mml:mo><mml:mfenced><mml:mfrac><mml:mrow><mml:mi>Q</mml:mi><mml:msup><mml:mi>K</mml:mi><mml:mi>T</mml:mi></mml:msup></mml:mrow><mml:msqrt><mml:msub><mml:mi>d</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:msqrt></mml:mfrac></mml:mfenced></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-a928b2caae1246ef84cbc54629a99800">Multi-head attention: The multi-head attention mechanism splits different attention products after projecting Q, K, and V through ‘n’ distinct linear transformations. The values of Q, K, and V are identical in the self-attention process. The formulas are represented in <xref rid="disp-formula-group-a649e4b5eac343149772e55654ce62ab" ref-type="disp-formula"/>, <xref rid="disp-formula-group-d8780d1692bb4a659a87144d486b1efc" ref-type="disp-formula"/>. The model's input size is 256, and it is executed through four transformer encoder blocks.</p>
          <disp-formula-group id="disp-formula-group-a649e4b5eac343149772e55654ce62ab"> <disp-formula><label>6</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>M</mml:mi><mml:mi>u</mml:mi><mml:mi>l</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>H</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>d</mml:mi><mml:mo>(</mml:mo><mml:mi>Q</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi><mml:mo>,</mml:mo><mml:mi>V</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo> </mml:mo><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>d</mml:mi></mml:mrow><mml:mn>1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>d</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mo>…</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>d</mml:mi></mml:mrow><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:math></disp-formula></disp-formula-group>
          <disp-formula-group id="disp-formula-group-d8780d1692bb4a659a87144d486b1efc"> <disp-formula><label>7</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>d</mml:mi></mml:mrow><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>A</mml:mi><mml:mi>t</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mo>(</mml:mo><mml:mi>Q</mml:mi><mml:msubsup><mml:mi>W</mml:mi><mml:mi>i</mml:mi><mml:mi>Q</mml:mi></mml:msubsup><mml:mo>,</mml:mo><mml:mi>K</mml:mi><mml:msubsup><mml:mi>W</mml:mi><mml:mi>i</mml:mi><mml:mi>K</mml:mi></mml:msubsup><mml:mo>,</mml:mo><mml:mi>V</mml:mi><mml:msubsup><mml:mi>W</mml:mi><mml:mi>i</mml:mi><mml:mi>V</mml:mi></mml:msubsup><mml:mo> </mml:mo><mml:mo> </mml:mo></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-4f7266ad325f4b2bb126f36bf4d96853">Position-wise feed-forward networks: - Every layer of the encoder has a fully linked feed-forward network, a two-level layer linear mapping, and an attention sub-layer. Linear transformations in the network using weight matrices W1, W2, and biases b1, b2, followed by layer normalisation, are represented as <inline-formula id="inline-formula-4280e8d0cd094de4803ee7c943d1dbc3"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>F</mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:math></inline-formula> and calculated using <xref id="x-c55191e6a843" rid="disp-formula-group-d4aafce8e13d4e1797ef3b5645c86f32" ref-type="disp-formula">Equation 8</xref>. The resulting output of the transformer network, denoted as <inline-formula id="inline-formula-6b60a41e8328417b85074b9fba061dc9"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:msub><mml:mo> </mml:mo><mml:mo>:</mml:mo><mml:msub><mml:mrow><mml:mo>{</mml:mo><mml:mi>o</mml:mi></mml:mrow><mml:mn>1</mml:mn></mml:msub><mml:msub><mml:mrow><mml:mo>,</mml:mo><mml:mo> </mml:mo><mml:mi>o</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo> </mml:mo><mml:mo>…</mml:mo><mml:mo>…</mml:mo><mml:mo>…</mml:mo><mml:mo>…</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:msub><mml:msub><mml:mi>o</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>}</mml:mo><mml:mo> </mml:mo></mml:math></inline-formula>represents a learned vector for each feature.</p>
          <disp-formula-group id="disp-formula-group-d4aafce8e13d4e1797ef3b5645c86f32"> <disp-formula><label>8</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>F</mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mfenced><mml:mi>x</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mfenced><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mo> </mml:mo><mml:mi>x</mml:mi><mml:msub><mml:mi>W</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfenced><mml:mo> </mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:msub><mml:mrow><mml:mo>+</mml:mo><mml:mi>b</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mrow><mml:mo> </mml:mo></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-329d0678361d4f6783d8e13a56e040d3"><bold id="strong-eebeea7c08264587bb34242e37cf298f">Positional encoding:</bold> The sequence's absolute or relative position is appended to the input at the encoder's top to utilise the sequence's order.</p>
          <p id="paragraph-23922a760abb44bc86c4574a554e9960">Certain conditions in medical datasets, such as certain arrhythmias, are much less frequent than others. Transformers struggle with imbalanced datasets since they can become biased towards more frequently occurring patterns and, therefore, do not perform optimally in rare but critical conditions. Transformers have high computational and memory demands, especially because the self-attention mechanism scales quadratically with the input sequence length. Long sequences are typical in medical data: continuous ECG signals and extended MRI scans consume a lot of resources, making training and inference slow and resource-hungry. Big transformers need a great deal of labelled data to generalise well. In medical domains, however, labelled data is usually limited due to privacy concerns, costs of annotations and the requirement of expert interpretation. In such cases, it can be challenging to attain optimal model performance. Sometimes Transformers have poor performance in local feature extraction. This may require supplementary convolutional layers, or hybrid approaches to improve sensitivities to local features. These will add to the complexity of the model.</p>
        </sec>
        <sec>
          <title id="t-4c950f1ff233">
            <bold id="s-ec16c35399f7">(c) Classification segment</bold>
          </title>
          <p id="paragraph-ad0e6b5da4b6414e8d8dc7a6fe0bdcdb">The output of TransNet <inline-formula id="inline-formula-67f7a401e03f4f17b93ad05a70baf2df"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is attached to the fully connected layer designed for multiclass categorisation. This tier categorises five different classes of arrhythmias.</p>
          <fig id="figure-2a2c8ee89bdb4876aacd87d166f4de33" orientation="portrait" fig-type="graphic" position="anchor">
            <label>Figure 3 </label>
            <caption id="caption-264e97225e0c4fa0b2d2f849b3086628">
              <title id="title-09c2cb6371494969957a2b7a3cede470">
                <bold id="strong-01e5910ade7a45f4ad3fb8ed157f9690"/>
                <bold id="strong-e25a1cec3be24e60ae9c2cbc2fddbc0c">Encoder architecture of </bold>
                <bold id="strong-ef68f4df905340dc90a7da7ccb4c49ef">TransNet Model</bold>
              </title>
            </caption>
            <graphic id="graphic-6610c4e7591741e480a2eb4e3e767026" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image3.jpeg"/>
          </fig>
          <p id="paragraph-5920a1032bca4f7dbc931898d539710d">In <xref id="x-e52ac3a7a33c" rid="figure-2a2c8ee89bdb4876aacd87d166f4de33" ref-type="fig">Figure 3</xref>, the architecture of a Transformer Encoder Layer is quite typical in deep learning models to treat sequential data, such as text or time-series data, such as an ECG signal. Raw Input Data: This could be an ECG signal segment, text, or sequential data. Input Embedding: This block converts the input into a dense vector representation with which the model can work. This embedding captures the inductive features of the input data. Since Transformer models are inherently position agnostic, a positional embedding is added to the input embedding. This helps it learn the order of the data point, which is very important for sequences like the signal from an ECG. This layer lets the model focus on different parts of the input sequence simultaneously by computing attention scores for multiple "heads." Each head learns a different aspect of the relationships in the data, allowing the model to capture complex dependencies across time steps in the ECG signal. The Add &amp; Norm layer normalises the output of the attention layer. Then it adds it to the input embedding along with a residual connection, which supports the flow of information through the network and stabilises training. This layer applies to each position separately with a fully connected feed-forward network that further transforms the representation and supports the model in catching more complex patterns. Here, another Add &amp; Norm process is introduced so that the standardisation of feed-forward output is combined with the previous output for better memorisation and a stabilisation model. By this Transformer Encoder Layer, the model becomes helpful in understanding complex dependencies and contextual information that remains in such sequential data. The multi-head attention combined with feed-forward layers makes this architecture particularly well suited to learning both short-term and long-term dependencies; this gives its strength to applications such as electrocardiogram signal classification and natural language processing.</p>
        </sec>
      </sec>
    </sec>
    <sec>
      <title id="title-8feb96b4332747f885a8ca09f5e13089">3 Experimental Results</title>
      <p id="paragraph-58693e470a00435ba8a1971d6886b001">This architecture uses 109,446 ECG signal segments from the MIT-ArrhyDB to evaluate model performance. The dataset is divided 80:20 for learning and evaluation, with 87,554 beats for training and 21,892 for assessment. The model classifies five categories of CVDs—N, S, V, F, and Q achieving an overall accuracy of 98.95% and an F1-score of 98.52%. The training was conducted for 50 epochs. A gradual increase in training accuracy was observed up to the 40th epoch, after which it reached a saturation point, maintaining stability from the 40th to the 50th epoch. Therefore, training was halted at 50 epochs. The Adam optimiser with a learning rate 0.001 was utilised to avoid overfitting. Despite some misclassifications in the normal class, these are minor relative to the total number of beats tested. Although S and F beats are less frequent, the model achieves 83% and 86% accuracy for these classes, respectively. In contrast, a reference model [20] using a transformer for ECG classification reported an accuracy of 90.52%. The commonly employed metrics, namely Accuracy, Precision, Recall, and F1-score (represented in <xref rid="disp-formula-group-d145657b3d5e42c982f077f74a1d4718" ref-type="disp-formula"/>, <xref rid="disp-formula-group-dde10fa14961495099bdb213c2fa23de" ref-type="disp-formula"/>, <xref rid="disp-formula-group-6174787b5d94412c9c53c7dac528f7dc" ref-type="disp-formula"/>, <xref rid="disp-formula-group-92be2c7d792b41a4879724d66c669291" ref-type="disp-formula"/>, respectively), have been utilised to evaluate the proposed classification model quantitatively. True positives and true negatives are denoted as TP and TN, while false positives and false negatives are represented as FP and FN.</p>
      <disp-formula-group id="disp-formula-group-d145657b3d5e42c982f077f74a1d4718"> <disp-formula><label>9</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>A</mml:mi><mml:mi>c</mml:mi><mml:mi>c</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>c</mml:mi><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:math></disp-formula></disp-formula-group>
      <disp-formula-group id="disp-formula-group-dde10fa14961495099bdb213c2fa23de"> <disp-formula><label>10</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfrac></mml:math></disp-formula></disp-formula-group>
      <disp-formula-group id="disp-formula-group-6174787b5d94412c9c53c7dac528f7dc"> <disp-formula><label>11</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:math></disp-formula></disp-formula-group>
      <disp-formula-group id="disp-formula-group-92be2c7d792b41a4879724d66c669291"> <disp-formula><label>12</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>F</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo> </mml:mo><mml:mi>s</mml:mi><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mo> </mml:mo><mml:mo> </mml:mo><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:mo>.</mml:mo><mml:mo> </mml:mo><mml:mfrac><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>.</mml:mo><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:mfrac></mml:math></disp-formula></disp-formula-group>
      <p id="paragraph-6c320430a3da4769bcae19f83143ad66">Each category is evaluated based on accuracy, precision, recall, and F1-score. The model achieves high accuracy across all categories, with N and Q at 99.00%, V at 95.00%, S at 83.00%, and F at 86.00%. Precision and recall values closely match accuracy, indicating consistent performance in correctly identifying each category. F1-scores, which balance precision and recall, range from 83.00% to 99.00%, highlighting the model's effectiveness in classifying ECG signals across diverse cardiac conditions.</p>
      <p id="paragraph-daf28071632c4dadb83fc365c718faa5">To analyse the performance of the proposed model, it is also trained using a convolutional neural network, which gives an overall accuracy of 96.82%. When the transformer block is incorporated with CNN, the overall accuracy is enhanced to 98.95%. <xref id="x-250d43bd0c24" rid="table-wrap-e3f308bf9daf47b3bcd5e5f5f99a1502" ref-type="table">Table 1</xref> shows the comparison of the transformer-based CNN model with previous architectures. <xref rid="figure-c7d0bb0059ac487393352df509551c32" ref-type="fig">Figure 4</xref>, <xref rid="figure-dce411931cc3495d9ff6d0c90630dd98" ref-type="fig">Figure 5</xref> represent training and validation accuracy and training and validation loss, respectively. <xref rid="figure-d73419768f6148fb8c1cde9c3c5bf1f8" ref-type="fig">Figure 6</xref>, <xref rid="figure-b2df51ac421e458797d1fa9b8b0f3029" ref-type="fig">Figure 7</xref> illustrate the confusion matrices for the model employing CNN-Transformer and CNN, respectively. The application of CNN alone results in an F1-score of 96.31% in the proposed model. Incorporating the transformer into the CNN leads to a notable improvement, with the F1-score improving by 2.21%.</p>
      <p id="paragraph-1610713a1e0f4e6caf257629c4f79086"><xref id="x-7faae722ab2c" rid="figure-c7d0bb0059ac487393352df509551c32" ref-type="fig">Figure 4</xref> compares the training and validation accuracy of two models over 50 epochs. <xref id="x-e541c2a08afa" rid="figure-c7d0bb0059ac487393352df509551c32" ref-type="fig">Figure 4</xref>  <bold id="strong-77b331ac9cdb4797a3399e17de95cea7">(a)</bold> shows the performance of a CNN, where training accuracy stabilises close to 1.0, but validation accuracy fluctuates and stabilises around 0.975, showing potential overfitting. <xref id="x-f0f85d13b45d" rid="figure-c7d0bb0059ac487393352df509551c32" ref-type="fig">Figure 4</xref><bold id="strong-d4cfb8a7e37d4fc1a9a0cbb861184f2c"> (b)</bold> depicts the performance of a combined CNN and Transformer model, with both training and validation accuracies increasing rapidly and stabilising close to 1.0, representing better generalisation and reliable performance. The enclosure of transformers augments the model's capability to capture complex patterns, leading to superior overall performance compared to CNN alone. <xref id="x-a4b4d7cebdd2" rid="figure-dce411931cc3495d9ff6d0c90630dd98" ref-type="fig">Figure 5</xref><bold id="strong-e5c8693af73344d790bb6c145df1114e"> (a)</bold> shows that the training loss decreases rapidly and stabilises near zero, while the validation loss also decreases but remains higher and fluctuates more, suggesting potential overfitting. <xref id="x-e430aa608fc1" rid="figure-dce411931cc3495d9ff6d0c90630dd98" ref-type="fig">Figure 5</xref><bold id="strong-7a9758107e1f428f99e22103af649800"> (a)</bold> shows that the training and validation losses decrease rapidly and closely follow each other, stabilising near zero, indicating that the TransNet-based CNN model achieves better generalisation and more stable performance on both training and validation data. </p>
      <fig id="figure-c7d0bb0059ac487393352df509551c32" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 4 </label>
        <caption id="caption-a8aa4da1e929497c9518cedb9ed6dbe6">
          <title id="title-845be8bea6664076bc56cb32153dfd94">
            <bold id="strong-0b013a446e444b2c9bd991b646186cc3">(a) Training and validation accuracy of CNN (b) Training and validation accuracy of </bold>
            <bold id="strong-69eeeabb6fd04e3e97bfde532ceb5111">TransNet-based CNN</bold>
          </title>
        </caption>
        <graphic id="graphic-8232df917bd54f84abfee7bbe2d4da73" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image4.png"/>
      </fig>
      <p id="paragraph-df392286d9d0423d961ac36f579791d0">As shown in <xref rid="figure-d73419768f6148fb8c1cde9c3c5bf1f8" ref-type="fig">Figure 6</xref>, <xref rid="figure-b2df51ac421e458797d1fa9b8b0f3029" ref-type="fig">Figure 7</xref>, the CNN with TransNet performs better than the basic CNN. The proposed model demonstrates an accuracy of 86% for class S and 83% for class F, despite these classes having a smaller quantity of data.</p>
      <fig id="figure-dce411931cc3495d9ff6d0c90630dd98" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 5 </label>
        <caption id="caption-b21452a6c77e4b38b171247c966e1867">
          <title id="title-b7e80ea73417474aae625e4c0fcc5829">
            <bold id="strong-35f8cefdf52c4ee9ac95b7ea5be91afb">(a) Training and validation loss of CNN (b) Training and validation loss of</bold>
            <bold id="strong-22e50fdf56ca463d980cccabbb648dd9">TransNet-based CNN</bold>
          </title>
        </caption>
        <graphic id="graphic-47afc5a97ff04bdc8c1acb5ff9e70eb3" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image5.png"/>
      </fig>
      <fig id="figure-d73419768f6148fb8c1cde9c3c5bf1f8" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 6 </label>
        <caption id="caption-2b1509cd4104486da18e367f9c7d7899">
          <title id="title-8a2faa0019a94fc6a296008047605d78">
            <bold id="strong-140d645e95584fe6b71197a44abd4a07">Confusion matrix of </bold>
            <bold id="strong-e8b1d47086be4bd4a301d189ded0a1b5">TransNet-based CNN (in probability)</bold>
          </title>
        </caption>
        <graphic id="graphic-df22d4d140c54bf098cb87b0396dbf2a" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image6.png"/>
      </fig>
      <fig id="figure-b2df51ac421e458797d1fa9b8b0f3029" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 7 </label>
        <caption id="caption-d881463346d84751bd4b6b673544c242">
          <title id="title-3eb69a174fad4837b53b33ad76e8985e">
            <bold id="strong-b7fdd132d4e34cfdb8d0c659d3f0ff9a">Confusion matrix of CNN (in probability)</bold>
          </title>
        </caption>
        <graphic id="graphic-4125563cbf0c40e18a0b9fbd11f08086" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/192e8cfc-4eae-4188-9772-a2f773280398image7.png"/>
      </fig>
      <table-wrap id="table-wrap-e3f308bf9daf47b3bcd5e5f5f99a1502" orientation="portrait">
        <label>Table 1</label>
        <caption id="caption-050799c39fd04f358ed4f07e1bd06578">
          <title id="title-e2b9dafabeb845ddb802a88b3707f670">
            <bold id="strong-500bbf8192e84a88a4e7e6915d42dc14">Performance comparisons of </bold>
            <bold id="strong-3fa855be37d44fafa2893464c371d531">TransNet-based CNN with other models</bold>
          </title>
        </caption>
        <table id="table-181bd26093044562a403ca94d3101e0c" rules="rows">
          <colgroup>
            <col width="24.44"/>
            <col width="16.61"/>
            <col width="15.18"/>
            <col width="16.620000000000005"/>
            <col width="12.699999999999998"/>
            <col width="14.450000000000001"/>
          </colgroup>
          <tbody id="table-section-c50fdf8801b34c73b272a11309a1bb10">
            <tr id="table-row-02824966f98b4a94855eec8926485639">
              <td id="table-cell-fedc542229cf4526b459f2595b8fdc4b" align="left">
                <p>
                  <bold>
                    <p id="paragraph-2e51b4fb9e88418abadca9400cc6a144">Model</p>
                  </bold>
                </p>
              </td>
              <td id="table-cell-19b8e679fbae4cff913b9de194644e8e" align="left">
                <p>
                  <bold>
                    <p id="paragraph-54da37be94094724824f1d25e8cfb0df">Year of publication</p>
                  </bold>
                </p>
              </td>
              <td id="table-cell-a4c8e1c69120423ea90575a81145cb09" align="left">
                <p>
                  <bold>
                    <p id="paragraph-01827bd7ba2440f392bef54ce6cf5190">Accuracy</p>
                  </bold>
                </p>
              </td>
              <td id="table-cell-6aaa8bff6e7547949aa7a99cc39c8b5d" align="left">
                <p>
                  <bold>
                    <p id="paragraph-fb3cae44b79b4a00acdc1956d5024885">Precision</p>
                  </bold>
                </p>
              </td>
              <td id="table-cell-873626a05c9c4c5fac41143f59726f70" align="left">
                <p>
                  <bold>
                    <p id="paragraph-11f0c334769948eaaae18240f8f410eb">Recall</p>
                  </bold>
                </p>
              </td>
              <td id="table-cell-61a0e6caefca4a3fb30621ff988d8ec4" align="left">
                <p>
                  <bold>
                    <p id="paragraph-e71ce7c4b5df4a4a98abd428fc3878da">F1-score</p>
                  </bold>
                </p>
              </td>
            </tr>
            <tr id="table-row-a363d6858d7a41179ae3281a40bb2723">
              <td id="table-cell-c74d393011804a328358b8849f3e600f" align="left">
                <p id="paragraph-012e8d79d39d439b9c897e6264fbf60a"> Transformer <xref id="xref-262802d9837f48b5b897a30a19f47347" rid="R273086533403601" ref-type="bibr">22</xref></p>
              </td>
              <td id="table-cell-a24009173a76445d856ab95f47a808b5" align="left">
                <p id="paragraph-88ee27d652e246d988d07d3223357ed8"> 2022</p>
              </td>
              <td id="table-cell-d9792468eaa34d3d9cee6f3484cac42e" align="left">
                <p id="paragraph-7c440e3bf61e47f481a4a3cb3b59a374"> 90.52%</p>
              </td>
              <td id="table-cell-e5c94c8a5a8a4bbc8ffa6c6a8728a100" align="left">
                <p id="paragraph-a03dc713384744e89cc1e1d88c9dd24c"> 88.50%</p>
              </td>
              <td id="table-cell-406501075f2c4bedbe041de62db3fc4d" align="left">
                <p id="paragraph-acac2b143b654f20b0db3c45c2adf5ee"> 86.46%</p>
              </td>
              <td id="table-cell-58f3bf949d1f4fc18112e470bc53c2a9" align="left">
                <p id="paragraph-f0a9b6c454244763941a97971f767071"> 87.47%</p>
              </td>
            </tr>
            <tr id="table-row-0a211249d9fc40fa950761ebea3daf2b">
              <td id="table-cell-e7665c9d16954964bf3066f02223f95d" align="left">
                <p id="paragraph-707dd7ca6e814089a901303d22f9d112"> CNN-LSTM <xref id="x-542b52a092f9" rid="R273086533403710" ref-type="bibr">23</xref> </p>
              </td>
              <td id="table-cell-f7678841e6e346618a6d3b56ff61e61a" align="left">
                <p id="paragraph-5d77b66e750a47e3bafb943486ba7644"> 2021</p>
              </td>
              <td id="table-cell-d8936f65bcc348739ae11f230e8ddccd" align="left">
                <p id="paragraph-8fd31cd833954298a647115146b4ceb1"> 95.81%</p>
              </td>
              <td id="table-cell-5a5f4e0d558742c08853c171e39f4b8a" align="left">
                <p id="paragraph-5a70701c53714146b102d73658cc45f7"> 74.94%</p>
              </td>
              <td id="table-cell-f87f1d7d23b240299a69120a7e21b1f6" align="left">
                <p id="paragraph-12db7b9ff92349fab4c94e7fc14dbc8a"> 69.20%</p>
              </td>
              <td id="table-cell-8949a7fa453f4b05865a28312ebf4b62" align="left">
                <p id="paragraph-5611e0403c0947d8837cc63e153cddd6"> 71.06%</p>
              </td>
            </tr>
            <tr id="table-row-f844ffa11c4d493cafdced278b8bdcb9">
              <td id="table-cell-8e0920724fd44894a77840b95717ee86" align="left">
                <p id="paragraph-b10d94c92bb842eea15cd908fbda0738"> CNN <xref id="x-dc59c91a5ba2" rid="R273086533403707" ref-type="bibr">20</xref> </p>
              </td>
              <td id="table-cell-c34ed3df1d3e4d26a7d3c59466c444d9" align="left">
                <p id="paragraph-a343373fbb894cf09fde957d10863306"> 2021</p>
              </td>
              <td id="table-cell-7aa6a5e7d038424dac03081a2eca51bc" align="left">
                <p id="paragraph-cd0b79850bd74e42838265782b0e1901"> 94.70%</p>
              </td>
              <td id="table-cell-b295307fd4e7417daaf2b4075fd9f3fe" align="left">
                <p id="paragraph-253bc5cc255b48d7a4b065f6d7005e6b"> 93.70%</p>
              </td>
              <td id="table-cell-4289be457a2b469a8fe29ba7453e6f1c" align="left">
                <p id="paragraph-2381b9d0b64e42db96b0781fdf21313a"> 89.00%</p>
              </td>
              <td id="table-cell-b316f4cfbdc34cdab95eabf9bbaccb86" align="left">
                <p id="paragraph-9095596e13334e7ba1b15228581b4611"> 88.90%</p>
              </td>
            </tr>
            <tr id="table-row-70b9637bdb36410cb7fdbd438d32f964">
              <td id="table-cell-b0aa2d995e824554b7780f44b708ccd5" align="left">
                <p id="paragraph-bb9b911320424afc8a3f64e04d688586"> Ensemble Multilabel Classification <xref id="x-587937253c1a" rid="R273086533403711" ref-type="bibr">24</xref></p>
              </td>
              <td id="table-cell-7577ef0291cd43659dcfff679b9428d6" align="left">
                <p id="paragraph-90b01575f55b49db8e001235c172c5a8"> 2020</p>
              </td>
              <td id="table-cell-10d211d3b107481e92c9416cd22f325f" align="left">
                <p id="paragraph-7febe6482d2849abbf04ad0da8955eb6"> 75.20%</p>
              </td>
              <td id="table-cell-2eacd62a116649a389ebfc8a96037b3a" align="left">
                <p id="paragraph-ce7a6bbadb6540ccbff84d005c737863"> 80.80%</p>
              </td>
              <td id="table-cell-43c92db071614b5fbbd0895bf0107f63" align="left">
                <p id="paragraph-3ba47d58b86f4788954ff61f8202af53"> 71.60%</p>
              </td>
              <td id="table-cell-9e8c7390a8304a01be1d2ce67ac74581" align="left">
                <p id="paragraph-0fd51369114b4caeadaf3b3e179ef6aa"> 75.20%</p>
              </td>
            </tr>
            <tr id="table-row-8202d9f72e9a4824925186a84327d32d">
              <td id="table-cell-3e0f298a13a3426eae7050c51c7cac31" align="left">
                <p id="paragraph-649b5e0c496643ff93a593c2b7031936"> 1D CNN <xref id="xref-ba3454d76efe400bb22806363cdca69e" rid="R273086533403593" ref-type="bibr">9</xref></p>
              </td>
              <td id="table-cell-8740ed6bc1f440e0bfd4a22ba484c741" align="left">
                <p id="paragraph-471dfdf2f1b546bbbff8e8cc216392d6"> 2020</p>
              </td>
              <td id="table-cell-5c1ac08910e5431eafc06ef158a019ba" align="left">
                <p id="paragraph-a680420ed7c64e359aee503d09833bb2"> 97.45%</p>
              </td>
              <td id="table-cell-ec230c2747db43a981730593375649de" align="left">
                <p id="paragraph-fe863bfda9b246958d6e961eb62a548b"> ---</p>
              </td>
              <td id="table-cell-31d213e66f4d4ddca46e5a4687def864" align="left">
                <p id="paragraph-99ccc1faf5b94a20a01766673edb0e02"> 97.00%</p>
              </td>
              <td id="table-cell-6ab4d183dd9847589317654dd4fcb429" align="left">
                <p id="paragraph-21331bfc9bea4afdaa5c1dc910d80d3c"> 97.00%</p>
              </td>
            </tr>
            <tr id="table-row-f8bf402cb3b449ecbb1e0879258dc11a">
              <td id="table-cell-46c00dadab0c489fba159722d35ea99a" align="left">
                <p id="paragraph-350636345b40445392325bbaf385198b"> CNN-BiLSTM <xref id="x-99f850be21ae" rid="R273086533403745" ref-type="bibr">25</xref></p>
              </td>
              <td id="table-cell-cea678b54d474698a90c616de8acc9cc" align="left">
                <p id="paragraph-f90c0f9f87504c058c01ca0cb6dc9378"> 2020</p>
              </td>
              <td id="table-cell-a434990405fe452ab2b4b115f9bb8435" align="left">
                <p id="paragraph-aabeaff6976d496f986f5191c5d2ad46"> 96.77%</p>
              </td>
              <td id="table-cell-f4fdce17998a4fa4bb9dad48915d2060" align="left">
                <p id="paragraph-9e8d1e50f60f4fc3b0d9f64fffa7f313"> 81.24%</p>
              </td>
              <td id="table-cell-5cca9fb2a1644976bb1744abf306ce64" align="left">
                <p id="paragraph-c434754a34a74068a671b5415c80434c"> 74.89%</p>
              </td>
              <td id="table-cell-24d4cacafcc540cda6b91c705a0f3901" align="left">
                <p id="paragraph-31a5ee4d071d42dbafb2ebd963452e9d"> 77.84%</p>
              </td>
            </tr>
            <tr id="table-row-360632308fc94251bb12c5ef915aab23">
              <td id="table-cell-e907f128970643eab819c1092bd06a85" align="left">
                <p id="paragraph-86c96904ba714768b660e68ba8b417da"> CNN <xref id="x-8937f7e4bd07" rid="R273086533403746" ref-type="bibr">26</xref> </p>
              </td>
              <td id="table-cell-4682d706407644fd8bef0b0a29ec151c" align="left">
                <p id="paragraph-e4c1e3c0ec6242b191dc2213c3b819f9"> 2019</p>
              </td>
              <td id="table-cell-ffb9d9c1b36e4846aa70d4f3f122e41b" align="left">
                <p id="paragraph-c3827b93cd9a423e852fa2a2a092b2d4"> 93.71%</p>
              </td>
              <td id="table-cell-80bfa90b37a846ed8acff6e8a2cad3e7" align="left">
                <p id="paragraph-0c2645851f854c0bb255c9a1a7828f3a"> 88.30%</p>
              </td>
              <td id="table-cell-a4914fe1ef884ee2b5e636d0b5f2bed7" align="left">
                <p id="paragraph-f1450597cce948ae94ad20453b49ae61"> 91.25%</p>
              </td>
              <td id="table-cell-2cff7682bd534ea6ae71763d46e2bbe0" align="left">
                <p id="paragraph-dc2a45869d0d45b5a7d8119cb749aa22"> 89.75%</p>
              </td>
            </tr>
            <tr id="table-row-5fb85c1dde4f419f8fd33969d9c7026e">
              <td id="table-cell-d7b54a2de55f48c9b4b4cc7f70e787d2" align="left">
                <p id="paragraph-1c336fbe5dab4133a392faef3e15af5b"> 1D CNN <xref id="x-e5924f1a9b42" rid="R273086533403747" ref-type="bibr">27</xref> </p>
              </td>
              <td id="table-cell-2c4f0e83db7441d48e5436be3a8d0699" align="left">
                <p id="paragraph-70e691a535624ea890191c322f36a40b"> 2018</p>
              </td>
              <td id="table-cell-f1ac860b6c24408a8e6a92552a5a0358" align="left">
                <p id="paragraph-bca4bf864da04344bced3df710c425f7"> 95.20%</p>
              </td>
              <td id="table-cell-96696d279acf48ab83b32b2b6b40dc40" align="left">
                <p id="paragraph-dacad3e616cf4c9f8f993afc032d8418"> 92.52%</p>
              </td>
              <td id="table-cell-b6d8a5aa27f64484bd2e474c1a9c9a67" align="left">
                <p id="paragraph-7ed69aad1f714724b14038c4d1ec6079"> 93.52%</p>
              </td>
              <td id="table-cell-0571100f33974d60be18ee68abb58674" align="left">
                <p id="paragraph-9768531f78f8402ab905a5160bdcdbbf"> 92.45%</p>
              </td>
            </tr>
            <tr id="table-row-1c06f247cbfe4f0688f4160e1f3967c3">
              <td id="table-cell-11b4efde0dd54b0e9c950590092eae7d" align="left">
                <p id="paragraph-0edf48393b684084a833eaa971dd29ee"> Proposed-CNN</p>
              </td>
              <td id="table-cell-a2d6b346c7934160903898345012d7fe" align="left">
                <p id="paragraph-c0ee23ac957f492baa8619482c8a64f5"> <bold id="strong-1c8eb8ceebc742e98ea936002375b0dc">----</bold></p>
              </td>
              <td id="table-cell-9f13e8cbf4df4b22b7d15f3dd296b149" align="left">
                <p id="paragraph-6d5d68cfc54d4ab48a7439c81e68578d"> 96.82%</p>
              </td>
              <td id="table-cell-258ca3deb9294405ba5820f5015a890f" align="left">
                <p id="paragraph-566d058ac75d46f69c087b9f2d1bb59a"> 96.03%</p>
              </td>
              <td id="table-cell-430e74dde28f44e8aa36132cfc11b2f7" align="left">
                <p id="paragraph-80fdba5daecc49f2a4b845b2561041b5"> 96.26%</p>
              </td>
              <td id="table-cell-cad997b6a91f46d1ace084e8e6dd7d8b" align="left">
                <p id="paragraph-55c4b25db7f4444eae672e92209354e9"> 96.31%</p>
              </td>
            </tr>
            <tr id="table-row-eec53702e28f4f47b7aee8c90f23a201">
              <td id="table-cell-69fe8c8e9d6147769995399b2edcb37a" align="left">
                <p id="paragraph-b57c121a832a45e19af1b4c7fe4145bb"> Proposed-TransNet based CNN</p>
              </td>
              <td id="table-cell-1d987649fb254407af6355c306c50b3e" align="left">
                <p id="paragraph-75366d1217ab44548c60add8ffc59f91"> <bold id="strong-ee88ced9b04240308a7eae1f234d86f8">----</bold></p>
              </td>
              <td id="table-cell-3e08597487364c818572f31bac07459a" align="left">
                <p id="paragraph-316b96a23b2247839b1f0318025c361d"> 98.95%</p>
              </td>
              <td id="table-cell-1c6fe4fd8e6f4c47ba4c74912eb57e02" align="left">
                <p id="paragraph-a74c548f646340cba22ab57e82e54718"> 98.35%</p>
              </td>
              <td id="table-cell-84c18c32d83245df9f5cd43d16893069" align="left">
                <p id="paragraph-baa8ce44c7f24eb9a468b173fb3d79f7"> 98.24%</p>
              </td>
              <td id="table-cell-5244a9a6065d4457bc2bbfe9d3d7856d" align="left">
                <p id="paragraph-77997a19e8ba413091a870580861e38f"> 98.52%</p>
              </td>
            </tr>
            <tr id="tr-11bb82a72a5a">
              <td id="tc-1bb5982ee431" align="left">
                <p id="p-8b3c032998bc"/>
              </td>
              <td id="tc-be3e7309eb7e" align="left">
                <p id="p-a9f5862a8b8e"/>
              </td>
              <td id="tc-2cbfdb609a56" align="left">
                <p id="p-19b4ca33d4ae"/>
              </td>
              <td id="tc-26c30f83be08" align="left">
                <p id="p-ae6ade4e2806"/>
              </td>
              <td id="tc-0db11a4cae1d" align="left">
                <p id="p-f24299c7a3c2"/>
              </td>
              <td id="tc-ae07449176dd" align="left">
                <p id="p-bd674a5274f0"/>
              </td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p id="paragraph-2e7daeca69d6497daa58d2477924ffaf"><xref id="x-dc634e186b56" rid="table-wrap-e3f308bf9daf47b3bcd5e5f5f99a1502" ref-type="table">Table 1</xref>  s<bold id="strong-16c7954975c34758bf4030849f15b32f"/>howcases the superior performance of the proposed TransNet-based CNN model, which achieves an impressive accuracy of 98.95% and an F1 score of 98.52%. The table demonstrates that the proposed architecture significantly outperforms other compared models, particularly excelling in accurately classifying low-quantity signals such as Supraventricular and Fusion beats.</p>
      <p id="paragraph-bdc7ceafd4df49cdb80144fbded74454">Implementing an automated Transformer-based CNN system for classifying arrhythmia in a real-world clinical environment presents several potential challenges. Hospitals and clinics use many legacy systems that cannot support advanced deep-learning models; thus, much modification and integration may be necessary for seamless data transfer and processing. Integrations with electronic health records and other medical databases are technically complicated and resource-demanding processes. Healthcare data is most sensitive, and implementing automated systems requires strict adherence to privacy regulations in the healthcare sector, such as HIPAA in the U.S. or GDPR in Europe. Secure data handling, encryption, and anonymisation protocols will protect privacy but add some layers of complexity to the deployment of such a system. Clinical environments would particularly demand near real-time if not real-time, processing of ECG data for expedited patient diagnosis and treatment. Transformer-based models are powerful but could also be computationally expensive, posing significant challenges in achieving adequate speed and efficiency without high-performance hardware. ECG data differ considerably from patient to patient due to age, physical condition, and sensor quality. Noises and artefacts from patient movement, electrode placement, or other environmental reasons can affect signal quality and make accurate classification difficult. The system built should ensure robustness against such variability to limit misdiagnosis. Advanced deep learning models integrated and maintained within healthcare require special hardware, maintenance, and technical support for proper functionality. These costs can be highly prohibitive for smaller clinics or facilities with limited budgets, slowing adoption across diverse clinical environments.</p>
    </sec>
    <sec>
      <title id="title-76837e90513446c2ab39ab71e4b5d81e">4 Conclusion</title>
      <p id="paragraph-fd653197405e405aa56baeb2d4ea7a72">This research proposed an automated categorisation model that combines a CNN and TransNet to categorise ECG signals. Before being fed into the CNN, the ECG signal undergoes pre-processing procedures, including denoising, segmentation, and resampling. The parameter information extracted by the CNN retains temporal characteristics. Through the synergistic combination of the CNN and an enhanced transformer, the proposed model achieves a categorisation accuracy of 98.95%. The suggested model may be used because of its performance and the fact that transformers were included to address the complexity issues with sequential models. This model can be used with wearable technologies in future research to help save more lives in emergencies by providing ongoing monitoring. Future scopes in this direction are promisingly developing Transformer-based CNNs for arrhythmia classification in clinical settings, which extend to all constituencies of healthcare domains. Further work could incorporate these models with wearable devices for health monitoring, such as smartwatches and mobile ECG monitors. This would enable seamless, in situ, immediate arrhythmia detection outside clinics, allowing for remote monitoring and early interventions, especially among high-risk or elderly patients. Automated systems can permit clinicians to monitor and manage patients with arrhythmias who reside in areas inaccessible by specialists. To combine it with telemedicine, future systems can be designed to integrate with existing platforms for reviewing and diagnosing patients by remote specialists, which is highly valuable in managing chronic cardiac conditions.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      <ref id="R273086533403599">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gao</surname>
              <given-names>Junli</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>Hongpo</given-names>
            </name>
            <name>
              <surname>Lu</surname>
              <given-names>Peng</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Zongmin</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>An Effective LSTM Recurrent Network to Detect Arrhythmia on Imbalanced ECG Dataset</article-title>
          <source>Journal of Healthcare Engineering</source>
          <year>2019</year>
          <volume>2019</volume>
          <fpage>1</fpage>
          <lpage>10</lpage>
          <issn>2040-2295</issn>
          <publisher-name>Wiley</publisher-name>
          <uri>https://doi.org/10.1155/2019/6320651</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403583">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Dong</surname>
              <given-names>Yanfang</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>Miao</given-names>
            </name>
            <name>
              <surname>Qiu</surname>
              <given-names>Lishen</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Lirong</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>Yong</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>An Arrhythmia Classification Model Based on Vision Transformer with Deformable Attention</article-title>
          <source>Micromachines</source>
          <year>2023</year>
          <volume>14</volume>
          <issue>6</issue>
          <fpage>1</fpage>
          <lpage>12</lpage>
          <issn>2072-666X</issn>
          <publisher-name>MDPI AG</publisher-name>
          <uri>https://dx.doi.org/10.3390/mi14061155</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403598">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sattar</surname>
              <given-names>Shoaib</given-names>
            </name>
            <name>
              <surname>Mumtaz</surname>
              <given-names>Rafia</given-names>
            </name>
            <name>
              <surname>Qadir</surname>
              <given-names>Mamoon</given-names>
            </name>
            <name>
              <surname>Mumtaz</surname>
              <given-names>Sadaf</given-names>
            </name>
            <name>
              <surname>Khan</surname>
              <given-names>Muhammad Ajmal</given-names>
            </name>
            <name>
              <surname>Waele</surname>
              <given-names>Timo De</given-names>
            </name>
            <name>
              <surname>Poorter</surname>
              <given-names>Eli De</given-names>
            </name>
            <name>
              <surname>Moerman</surname>
              <given-names>Ingrid</given-names>
            </name>
            <name>
              <surname>Shahid</surname>
              <given-names>Adnan</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Cardiac Arrhythmia Classification Using Advanced Deep Learning Techniques on Digitized ECG Datasets</article-title>
          <source>Sensors</source>
          <year>2024</year>
          <volume>24</volume>
          <issue>8</issue>
          <fpage>1</fpage>
          <lpage>23</lpage>
          <issn>1424-8220</issn>
          <publisher-name>MDPI AG</publisher-name>
          <uri>https://dx.doi.org/10.3390/s24082484</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403591">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ansari</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>Mourad</surname>
              <given-names>O</given-names>
            </name>
            <name>
              <surname>Qaraqe</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>Serpedin</surname>
              <given-names>E</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Deep learning for ECG Arrhythmia detection and classification: an overview of progress for period 2017–2023</article-title>
          <source>Frontiers in Physiology</source>
          <year>2023</year>
          <volume>14</volume>
          <fpage>1</fpage>
          <lpage>20</lpage>
          <uri>https://doi.org/10.3389/fphys.2023.1246746</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403604">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>McSharry</surname>
              <given-names>P E</given-names>
            </name>
            <name>
              <surname>Clifford</surname>
              <given-names>G D</given-names>
            </name>
            <name>
              <surname>Tarassenko</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>Smith</surname>
              <given-names>L A</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>A dynamical model for generating synthetic electrocardiogram signals</article-title>
          <source>IEEE Transactions on Biomedical Engineering</source>
          <year>2003</year>
          <volume>50</volume>
          <issue>3</issue>
          <fpage>289</fpage>
          <lpage>294</lpage>
          <issn>0018-9294</issn>
          <publisher-name>Institute of Electrical and Electronics Engineers (IEEE)</publisher-name>
          <uri>https://dx.doi.org/10.1109/tbme.2003.808805</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403584">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Rashmi</surname>
              <given-names>N</given-names>
            </name>
            <name>
              <surname>Begum</surname>
              <given-names>Ghousia</given-names>
            </name>
            <name>
              <surname>Singh</surname>
              <given-names>Vipula</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>ECG denoising using wavelet transform and filters</article-title>
          <source>2017 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)</source>
          <year>2018</year>
          <publisher-name>IEEE</publisher-name>
          <conf-loc>Chennai, India</conf-loc>
          <conf-date>22-24 March 2017</conf-date>
          <fpage>2395</fpage>
          <lpage>2400</lpage>
          <uri>https://doi.org/10.1109/WiSPNET.2017.8300189</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403592">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Guan</surname>
              <given-names>Jian</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Wenbo</given-names>
            </name>
            <name>
              <surname>Feng</surname>
              <given-names>Pengming</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Xinxin</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Wenwu</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>Low-Dimensional Denoising Embedding Transformer for ECG Classification</article-title>
          <source>ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</source>
          <year>2021</year>
          <publisher-name>IEEE</publisher-name>
          <conf-loc>Toronto, ON, Canada</conf-loc>
          <conf-date>06-11 June 2021</conf-date>
          <fpage>1285</fpage>
          <lpage>1289</lpage>
          <uri>https://doi.org/10.1109/ICASSP39728.2021.9413766</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403603">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>Mengze</given-names>
            </name>
            <name>
              <surname>Lu</surname>
              <given-names>Yongdi</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>Wenli</given-names>
            </name>
            <name>
              <surname>Wong</surname>
              <given-names>Shen Yuong</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>A Study on Arrhythmia via ECG Signal Classification Using the Convolutional Neural Network</article-title>
          <source>Frontiers in Computational Neuroscience</source>
          <year>2021</year>
          <volume>14</volume>
          <fpage>1</fpage>
          <lpage>10</lpage>
          <issn>1662-5188</issn>
          <publisher-name>Frontiers Media SA</publisher-name>
          <uri>https://dx.doi.org/10.3389/fncom.2020.564015</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403593">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hua</surname>
              <given-names>Xuan</given-names>
            </name>
            <name>
              <surname>Han</surname>
              <given-names>Jungang</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>Chen</given-names>
            </name>
            <name>
              <surname>Tang</surname>
              <given-names>Haipeng</given-names>
            </name>
            <name>
              <surname>He</surname>
              <given-names>Zhuo</given-names>
            </name>
            <name>
              <surname>Chen</surname>
              <given-names>Qinghui</given-names>
            </name>
            <name>
              <surname>Tang</surname>
              <given-names>Shaojie</given-names>
            </name>
            <name>
              <surname>Tang</surname>
              <given-names>Jinshan</given-names>
            </name>
            <name>
              <surname>Zhou</surname>
              <given-names>Weihua</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>A novel method for ECG signal classification via one-dimensional convolutional neural network</article-title>
          <source>Multimedia Systems</source>
          <year>2022</year>
          <volume>28</volume>
          <issue>4</issue>
          <fpage>1387</fpage>
          <lpage>1399</lpage>
          <issn>0942-4962</issn>
          <publisher-name>Springer Science and Business Media LLC</publisher-name>
          <uri>https://doi.org/10.1007/s00530-020-00713-1</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403600">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Khan</surname>
              <given-names>Mohammad Mahmudur Rahman</given-names>
            </name>
            <name>
              <surname>Siddique</surname>
              <given-names>Md Abu Bakr</given-names>
            </name>
            <name>
              <surname>Sakib</surname>
              <given-names>Shadman</given-names>
            </name>
            <name>
              <surname>Aziz</surname>
              <given-names>Anas</given-names>
            </name>
            <name>
              <surname>Tanzeem</surname>
              <given-names>Abyaz Kader</given-names>
            </name>
            <name>
              <surname>Hossain</surname>
              <given-names>Ziad</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>Electrocardiogram heartbeat classification using convolutional neural networks for the detection of cardiac Arrhythmia</article-title>
          <source>2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)</source>
          <year>2020</year>
          <publisher-name>IEEE</publisher-name>
          <conf-loc> Palladam, India</conf-loc>
          <conf-date>07-09 October 2020</conf-date>
          <uri>https://doi.org/10.1109/I-SMAC49090.2020.9243474</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403586">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Maweu</surname>
              <given-names>Barbara Mukami</given-names>
            </name>
            <name>
              <surname>Dakshit</surname>
              <given-names>Sagnik</given-names>
            </name>
            <name>
              <surname>Shamsuddin</surname>
              <given-names>Rittika</given-names>
            </name>
            <name>
              <surname>Prabhakaran</surname>
              <given-names>Balakrishnan</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>CEFEs: A CNN Explainable Framework for ECG Signals</article-title>
          <source>Artificial Intelligence in Medicine</source>
          <year>2021</year>
          <volume>115</volume>
          <fpage>102059</fpage>
          <issn>0933-3657</issn>
          <publisher-name>Elsevier BV</publisher-name>
          <uri>https://dx.doi.org/10.1016/j.artmed.2021.102059</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403585">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhao</surname>
              <given-names>Yunxiang</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>Jinyong</given-names>
            </name>
            <name>
              <surname>Zhan</surname>
              <given-names>Ping</given-names>
            </name>
            <name>
              <surname>Peng</surname>
              <given-names>Xueping</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>ECG Classification Using Deep CNN Improved by Wavelet Transform</article-title>
          <source>Computers, Materials &amp;amp; Continua</source>
          <year>2020</year>
          <volume>64</volume>
          <issue>3</issue>
          <fpage>1615</fpage>
          <lpage>1628</lpage>
          <issn>1546-2226</issn>
          <publisher-name>Tech Science Press</publisher-name>
          <uri>https://dx.doi.org/10.32604/cmc.2020.09938</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403596">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Golany</surname>
              <given-names>Tomer</given-names>
            </name>
            <name>
              <surname>Lavee</surname>
              <given-names>Gal</given-names>
            </name>
            <name>
              <surname>Yarden</surname>
              <given-names>Shai Tejman</given-names>
            </name>
            <name>
              <surname>Radinsky</surname>
              <given-names>Kira</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Improving ECG Classification Using Generative Adversarial Networks</article-title>
          <source>Proceedings of the AAAI Conference on Artificial Intelligence</source>
          <year>2020</year>
          <volume>34</volume>
          <issue>08</issue>
          <fpage>13280</fpage>
          <lpage>13285</lpage>
          <issn>2374-3468, 2159-5399</issn>
          <publisher-name>Association for the Advancement of Artificial Intelligence (AAAI)</publisher-name>
          <uri>https://dx.doi.org/10.1609/aaai.v34i08.7037</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403594">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Savalia</surname>
              <given-names>Shalin</given-names>
            </name>
            <name>
              <surname>Emamian</surname>
              <given-names>Vahid</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Cardiac Arrhythmia Classification by Multi-Layer Perceptron and Convolution Neural Networks</article-title>
          <source>Bioengineering</source>
          <year>2018</year>
          <volume>5</volume>
          <issue>2</issue>
          <fpage>1</fpage>
          <lpage>12</lpage>
          <issn>2306-5354</issn>
          <publisher-name>MDPI AG</publisher-name>
          <uri>https://dx.doi.org/10.3390/bioengineering5020035</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403588">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chatfield</surname>
              <given-names>Ken</given-names>
            </name>
            <name>
              <surname>Simonyan</surname>
              <given-names>Karen</given-names>
            </name>
            <name>
              <surname>Vedaldi</surname>
              <given-names>Andrea</given-names>
            </name>
            <name>
              <surname>Zisserman</surname>
              <given-names>Andrew</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Return of the Devil in the Details: Delving Deep into Convolutional Nets</article-title>
          <source>Proceedings of the British Machine Vision Conference 2014</source>
          <year>2014</year>
          <fpage>1</fpage>
          <lpage>11</lpage>
          <publisher-name>British Machine Vision Association</publisher-name>
          <uri>https://doi.org/10.48550/arXiv.1405.3531</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403595">
        <element-citation publication-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Mark</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>Moody</surname>
              <given-names>G</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <source>MIT-BIH Arrhythmia Database Directory</source>
          <publisher-name>Massachusetts Institute of Technology</publisher-name>
          <publisher-loc>Cambridge, MA, USA</publisher-loc>
          <year>1988</year>
          <uri>https://physionet.org/physiobank/database/html/mitdbdir/foreword.htm</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403589">
        <element-citation publication-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Acharya</surname>
              <given-names>U R</given-names>
            </name>
            <name>
              <surname>Suri</surname>
              <given-names>Jasjit S</given-names>
            </name>
            <name>
              <surname/>
              <given-names>Jos A E Spaan</given-names>
            </name>
            <name>
              <surname>Krishnan</surname>
              <given-names>Shankar M</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <source>Advances in Cardiac Signal Processing</source>
          <edition>1</edition>
          <publisher-name>Springer</publisher-name>
          <publisher-loc>Berlin, Heidelberg</publisher-loc>
          <year>2007</year>
          <fpage>XXII, 468 pages</fpage>
          <uri>https://doi.org/10.1007/978-3-540-36675-1</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403597">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chandrakar</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>Yadav</surname>
              <given-names>O P</given-names>
            </name>
            <name>
              <surname>Chandra</surname>
              <given-names>V K</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>A survey of noise removal techniques for ECG signals</article-title>
          <source>International Journal of Advanced Research in Computer and Communication Engineering</source>
          <year>2013</year>
          <volume>2</volume>
          <issue>3</issue>
          <fpage>1354</fpage>
          <lpage>1357</lpage>
          <uri>https://www.researchgate.net/publication/303155606_A_survey_of_noise_removal_techniques_for_ecg_signals</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403708">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chen</surname>
              <given-names>Binqiang</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>Yang</given-names>
            </name>
            <name>
              <surname>Cao</surname>
              <given-names>Xincheng</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>Weifang</given-names>
            </name>
            <name>
              <surname>He</surname>
              <given-names>Wangpeng</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Removal of Power Line Interference From ECG Signals Using Adaptive Notch Filters of Sharp Resolution</article-title>
          <source>IEEE Access</source>
          <year>2019</year>
          <volume>7</volume>
          <fpage>150667</fpage>
          <lpage>150676</lpage>
          <issn>2169-3536</issn>
          <publisher-name>Institute of Electrical and Electronics Engineers (IEEE)</publisher-name>
          <uri>https://dx.doi.org/10.1109/access.2019.2944027</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403707">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>Tao</given-names>
            </name>
            <name>
              <surname>Lu</surname>
              <given-names>Changhua</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>Yining</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>Mei</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>Chun</given-names>
            </name>
            <name>
              <surname>Ou</surname>
              <given-names>Chunsheng</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Automatic ECG Classification Using Continuous Wavelet Transform and Convolutional Neural Network</article-title>
          <source>Entropy</source>
          <year>2021</year>
          <volume>23</volume>
          <issue>1</issue>
          <fpage>1</fpage>
          <lpage>13</lpage>
          <issn>1099-4300</issn>
          <publisher-name>MDPI AG</publisher-name>
          <uri>https://dx.doi.org/10.3390/e23010119</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403590">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>Jia</given-names>
            </name>
            <name>
              <surname>Si</surname>
              <given-names>Yujuan</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>Tao</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>Saibiao</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Deep Convolutional Neural Network Based ECG Classification System Using Information Fusion and One-Hot Encoding Techniques</article-title>
          <source>Mathematical Problems in Engineering</source>
          <year>2018</year>
          <volume>2018</volume>
          <fpage>1</fpage>
          <lpage>10</lpage>
          <issn>1024-123X, 1563-5147</issn>
          <publisher-name>Wiley</publisher-name>
          <uri>https://dx.doi.org/10.1155/2018/7354081</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403601">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hu</surname>
              <given-names>Shuaicong</given-names>
            </name>
            <name>
              <surname>Cai</surname>
              <given-names>Wenjie</given-names>
            </name>
            <name>
              <surname>Gao</surname>
              <given-names>Tijie</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Mingjie</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>A Hybrid Transformer Model for Obstructive Sleep Apnea Detection Based on Self-Attention Mechanism Using Single-Lead ECG</article-title>
          <source>IEEE Transactions on Instrumentation and Measurement</source>
          <year>2022</year>
          <volume>71</volume>
          <fpage>1</fpage>
          <lpage>11</lpage>
          <issn>0018-9456, 1557-9662</issn>
          <publisher-name>Institute of Electrical and Electronics Engineers (IEEE)</publisher-name>
          <uri>https://dx.doi.org/10.1109/tim.2022.3193169</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403710">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Essa</surname>
              <given-names>Ehab</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>Xianghua</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>An Ensemble of Deep Learning-Based Multi-Model for ECG Heartbeats Arrhythmia Classification</article-title>
          <source>IEEE Access</source>
          <year>2021</year>
          <volume>9</volume>
          <fpage>103452</fpage>
          <lpage>103464</lpage>
          <issn>2169-3536</issn>
          <publisher-name>Institute of Electrical and Electronics Engineers (IEEE)</publisher-name>
          <uri>https://dx.doi.org/10.1109/access.2021.3098986</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403711">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sun</surname>
              <given-names>Zhanquan</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Chaoli</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>Yangyang</given-names>
            </name>
            <name>
              <surname>Yan</surname>
              <given-names>Chao</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Multi-Label ECG Signal Classification Based on Ensemble Classifier</article-title>
          <source>IEEE Access</source>
          <year>2020</year>
          <volume>8</volume>
          <fpage>117986</fpage>
          <lpage>117996</lpage>
          <publisher-name>Institute of Electrical and Electronics Engineers (IEEE)</publisher-name>
          <uri>https://doi.org/10.1109/ACCESS.2020.3004908</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403745">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chen</surname>
              <given-names>Aiyun</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Fei</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>Wenhan</given-names>
            </name>
            <name>
              <surname>Chang</surname>
              <given-names>Sheng</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Hao</given-names>
            </name>
            <name>
              <surname>He</surname>
              <given-names>Jin</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>Qijun</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Multi-information fusion neural networks for arrhythmia automatic detection</article-title>
          <source>Computer Methods and Programs in Biomedicine</source>
          <year>2020</year>
          <volume>193</volume>
          <fpage>105479</fpage>
          <issn>0169-2607</issn>
          <publisher-name>Elsevier BV</publisher-name>
          <uri>https://dx.doi.org/10.1016/j.cmpb.2020.105479</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403746">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Guo</surname>
              <given-names>Li</given-names>
            </name>
            <name>
              <surname>Sim</surname>
              <given-names>Gavin</given-names>
            </name>
            <name>
              <surname>Matuszewski</surname>
              <given-names>Bogdan</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Inter-patient ECG classification with convolutional and recurrent neural networks</article-title>
          <source>Biocybernetics and Biomedical Engineering</source>
          <year>2019</year>
          <volume>39</volume>
          <issue>3</issue>
          <fpage>868</fpage>
          <lpage>879</lpage>
          <issn>0208-5216</issn>
          <publisher-name>Elsevier BV</publisher-name>
          <uri>https://doi.org/10.1016/j.bbe.2019.06.001</uri>
        </element-citation>
      </ref>
      <ref id="R273086533403747">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yıldırım</surname>
              <given-names>Özal</given-names>
            </name>
            <name>
              <surname>Pławiak</surname>
              <given-names>Paweł</given-names>
            </name>
            <name>
              <surname>Tan</surname>
              <given-names>Ru-San</given-names>
            </name>
            <name>
              <surname>Acharya</surname>
              <given-names>U Rajendra</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Arrhythmia detection using deep convolutional neural network with long duration ECG signals</article-title>
          <source>Computers in Biology and Medicine</source>
          <year>2018</year>
          <volume>102</volume>
          <fpage>411</fpage>
          <lpage>420</lpage>
          <issn>0010-4825</issn>
          <publisher-name>Elsevier BV</publisher-name>
          <uri>https://dx.doi.org/10.1016/j.compbiomed.2018.09.009</uri>
        </element-citation>
      </ref>
    </ref-list>
  </back>
</article>
