Principalcomponentanalysis(PCA)isusedtoreducedimensionalityandforfeatureextractionoftheelectrocardiogram(ECG)datapriortoorafterperformingindependentcomponentanalysis(ICA).PCAaloneisnotaveryappropriatetechniqueforthevisualizationofECGdataandnon-lineardimen

(1)

Detection of indeterminacies in corrected ECG signals using parameterized multidimensional independent component analysis

M.P.S. Chawla*

Department of Electrical Engineering, Indian Institute of Technology, Roorkee 247667, India

(Received 9 January 2008; final version received 9 May 2008)

Independent component analysis (ICA) is a new technique suitable for separating independent components from electrocardiogram (ECG) complex signals. The basic idea of using multidimensional independent component analysis (MICA) is to find stable higher dimensional source signal subspaces and to decompose each rotation into elementary rotations within all two-dimensional planes spanned by the coordinate axes useful for diagnostic information of heart. In this paper, ability of ICA for parameterization of ECG signals was felt to reduce the amount of redundant ECG data. This work aims at finding an independent subspace analysis (ISA) model for ECG analysis that allows applicability to any random vectors available in an ECG data set. For the common standards for electrocardiography (CSE) based ECG data sets, joint approximate diagonalization of eigen matrices (Jade) algorithm is used to find smaller subspaces. The extracted independent components are further cleaned by statistical measures. In this study, it is also observed that the value of kurtosis coefficients for the independent components, which represents the noise component, can be further reduced using parameterized multidimensional ICA (PMICA) technique. The indeterminacies if available in the ECG data are to be analysed also using modified version of Jade algorithm to PMICA and parameterized standard ICA (PsICA) for comparative studies.

The indeterminacies if available in the ECG data are reduced in PMICA better in comparison to the analysis done using PsICA. The simulation results obtained indicate that ICA definitely improves signal – noise ratio (SNR) like the other higher order digital filtering methods like Kalman, Butterworth etc. with minimum reconstruction errors.

Here, it is also confirmed that re-parameterization of the standard ICA model results into a ‘component model’ using MICA technique, which is geometric in spirit and free of indeterminacies existing in sICA model.

Keywords: electrocardiogram; parameterization; multidimensional independent component analysis; indeterminacies; reconstruction; statistical thresholds

1. Introduction

Principal component analysis (PCA) is used to reduce dimensionality and for feature extraction of the electrocardiogram (ECG) data prior to or after performing independent component analysis (ICA). PCA alone is not a very appropriate technique for the visualization of ECG data and non-linear dimensionality reduction algorithms from the ECG morphological point of view [24,28]. This is due to the fact that it can only uncover linear relationships in the ECG data, and can be designed to find directions in the ECG data with the highest variance that may not be always the most informative directions. PCA

ISSN 1748-670X print/ISSN 1748-6718 online q2009 Taylor & Francis

DOI: 10.1080/17486700802193153 http://www.informaworld.com

*Email: mpschawla@rediffmail.com; mpschawla@gmail.com Vol. 10, No. 2, June 2009, 85–115

(2)

algorithm will always provide a projection to an orthogonal basis whether the signal of interest can be best decomposed or interpreted within this basis or not [27,28].

ICA has been widely used in data analysis and decomposition in the last three decades and typically aims to solve the blind source separation problem in which a set of unknown sources are mixed in some way to form the data which is available [39,66]. The increasing demand of ICA is generally applied to the ECG data, following the generative ICA model in order to guarantee a feasible algorithm to attain valid results on morphological basis [60].

In the literature, the terms blind source separation and independent component analysis are often used indifferently: they refer to the same model (1) with the same assumptions, pursue the same objectives and are addressed with the same algorithms [41,51]. It is also unfortunate because the term ‘analysis’ refers to the idea of decomposition into smaller, simpler elements and very often this decomposition is into a sum of terms, calling for an additive model rather than a multiplicative model [1,20].

Uniqueness and separability results play a central role in solving blind source separation (BSS) problems since they allow algorithms to apply ICA in order to uniquely (except for scaling and permutation) find the original mixing matrices [4,6,16,30,33,34,43,44,59,64,83].

In ECG, electroencephalogram (EEG) or magnetoencephalogram (MEG) signal processing, large set of electrodes (from 10 in ECG to more than 100 in EEG and MEG) are used, and the signals recorded using electrodes are related to the electric or magnetic fields due to the electrical activity of heart or neurons [53]. ECG source separation consists in retrieving unknown ECG signals, which are observed through unknown mixture of these [53].

ICA has been successfully applied to the processing of electroencephalography signals.

Various methods available in the literature have been proposed in the identification of components related to artefacts in both EEG and MEG. In brain research, ICA has previously been applied in a multitude of cases, including EEG, MEG and magnetic resonance imaging (MRI) analysis [18,38,49,77]. ICA has already proven to be a valuable tool for detecting and enhancing relevant source subspace brain signals, while suppressing irrelevant noise and artefacts such as those produced by muscle activity and eye blinks [6,50,51].

ICA opens new and useful windows into phenomena contained in multi-channel ECG records by separating ECG data recorded at multiple electrodes into a sum of independent components [21 – 23,28,42,56]. In particular, ICA appears to be a generally applicable and effective method for removing a wide variety of artefacts and noise from ECG records [25,26,28,42,62].BasicallyECG processing consists of three main steps:

(i) A measurement, which produces some raw ECG data

(ii) A data analysis algorithm that extracts parameters of interest from the ECG data and (iii) A decision or interpretation rule based on the extracted parameters.

Modelling ECG data as consisting of convolutive as well as static independent processes allows a richer palette for source modelling, possibly leading to more complete signal independence. ICA yields independent components (ICs) displaying more clearly the investigated properties of the original ECG sources [28,41]. The goal of multidimensional independent component analysis (MICA) for ECG analysis lies in the linear separation of ECG data into statistically independent groups of ECG signals [72,75].

Independent subspace analysis (ISA), nowadays an attractive alternative for dimensionality reduction, however, are currently limited by the assumption of equal group sizes or less general semi-parametric models. By introducing the concept of irreducible independent subspaces or components, one can present a generalization to a parameter free mixture of an ECG model. However, the condition of utmost one Gaussian

(3)

can be relaxed by including previous results on non Gaussian component analysis of an ECG data set. After introduction of this general model, joint block diagonalization with unknown block sizes can be discussed, on the basis of a simple extension to joint approximate diagonalization of eigen matrices (Jade) can be algorithmically performed to better understand the subspace analysis. Simulations of different typical case studies carried out on an ECG data set only can confirm the feasibility of the developed ISA algorithm [75].

1.1 Existing ECG techniques: A review

During ECG recording some of the unwanted signals (artefacts) interfere with ECG signal, as a result of which analysis of the ECG signal and extraction of diagnostic features from ECG becomes difficult [20,42,76]. Filtering the noise components instead of discarding them may result in a processed signal, which is more representative of the original data [12,23,28,81]. It should be noted that the ECG does not have the same appearance in all the leads of the standard 12-lead system used in clinical practice. The ECG polarity and the shape of the ECG constituent waves may change depending on the lead used and its location [24,41,80].

Recently, there are many approaches which have been proposed in literature that involve techniques for computer processing of 12-lead ECG in order to diagnose certain diseases. Numerous techniques [8,9,12,14,41,50,76,78,81] have been developed to recognize and analyse ECG waves, ranging from digital filtering techniques to intelligent techniques such as neural networks, fuzzy logic and spectro-temporal-based techniques. A first group of methods to interpret the ECG significance uses a morphological analysis [23,28,42]. A second group of techniques for computer analysis of ECG uses statistical models [7,20,37,57,63]. A third category of methods corresponding to neural models becomes a powerful concurrent to statistical ones for ECG signal classification [9,24,81].

In the past, a systematic comparison among several neural paradigms and architectures and between neural and statistical approach has been performed on a large validated ECG databases, in order to investigate the behaviour and the features of each approach in the particular problem of the ECG analysis. In the wavelet or discrete Fourier transform (DFT) domain approaches, it is common practice to window and taper the ECG data; hence, the choice of the optimal window size is a problem in practice [8,9,78].

When applying unsupervised learning techniques in ECG data analysis, a key question is whether the estimated parameters of the studied system are reliable. In other words, can one assess the quality of the result produced by the proposed learning technique? In the past years, several ECG analysers based on statistical methods, clustering methods, expert systems and Markov models have been developed and implemented to solve the problem of the time consuming ECG analysis [1,12,20,81]. That their reliability is less, together with their high sensitivity to noise and their failure to deal with new or ambiguous patterns, leads the research towards investigation of new analysis techniques. Artificial neural networks (ANN) have been often proposed as good classifiers when non-linear separation borders are required and incomplete or ambiguous input ECG patterns can be found. In the last few years the connectionist approach has been applied to the ECG analysis with promising results [22,23,27]. Once the reliability of a solution can be estimated, then one can use the information for model/algorithm selection purposes, for testing model validity and for improving the used algorithm [25,61]. Note that re-sampling is completely general and can be applied to assess reliability of any unsupervised learning algorithms, e.g.

projection techniques such as instance independent component analysis, principal component analysis, kernel PCA, clustering and so on [5,11,36,51].

(4)

1.2 PCA preprocessing

Signal processing in general has tremendously changed during the last 20 years and it is expected to change even more in the years to come. What was earlier visualized as digital signal processing now forms only a small part of the new concept of signal processing which might be more adequately explained as the methods of analysing, manipulating and conveying natural information? Feature extraction is basically reduction of the available information maintaining ECG morphology [28,29,61]. Features are representatives of identification to a particular subject or specimen. Analysis and feature extraction from electrocardiograms is difficult until and unless artefacts and noise from the ECG are removed, there are so many techniques available in the literature [25,26,41].

PCA is the optimal linear technique, which retains the maximum amount of variance (amongst all linear projections) within the projected feature space [28,36,40]. PCA is often used in ECG pattern recognition and the transform produce features that are mutually uncorrelated. The results obtained by the Karhunen – Loeve Transform (KLT) solution are optimal, when dimensionality reduction is the goal and it is desired to minimize the approximation mean square error. PCA preprocessing process consists of centring as well as whitening steps and the centring step is done by subtracting the mean of the observed ECG data, which provides zero mean ECG data. The whitening step is used to remove the correlation between the observed ECG data. A common method to achieve whitening is by the eigenvalue decomposition of the covariance matrix of the mixed ECG signal [13,28,37].

Figure 1(a) shows the basic ECG waveform, representing its basic component waves, whereas Figure 1(b) explains a philosophy for PCA preprocessing of ECG data. Some preprocessing [28,57] is useful before attempting to estimatew. The important points to be considered are:

(i) The observed signals should be centred by subtracting their mean value E{x}.

(ii) Then they are whitened, which means they are linearly transformed so that the components are uncorrelated and has unit variance.

(iii) Whitening can be performed via eigen value decomposition of the covariance matrix, VLV^T, whereVis the matrix of orthogonal eigenvectors andLis a diagonal matrix with the corresponding eigenvalues. The whitening is done by multiplication with the transformation matrixP.

P¼VL²¹⁼²V^T ð1Þ

z¼P{x2E½x}: ð2Þ

The covariance of the whitened data E [ZZ^T] equals the identity matrix, I. For simplicity, letxbe the centred mixed vector,{x2E[x]} i.e.x¼{x2E[x]}.

Figure 1. (a) ECG waveform. (b) PCA preprocessing of ECG data.

(5)

1.3 Blind source separation

Source separation methods have been developed exclusively for a long time for linear mixtures, instantaneous as well as convolutive, and more recently by a few researchers even for nonlinear mixtures in biomedical applications [55]. In most of the general cases, the underlying assumptions made on the sources are that they are statistically independent.

The ICA or BSS model can be uniquely identified provided it is properly parameterized in terms of one-dimensional subspaces. From this standpoint, BSS/ICA model performs better if generalized to multidimensional components [18,35,37,40,68]. In BSS methods, the ECG electrodes are assumed to measure a mixture of some latent sources, and both the ECG sources and the mixing process are unknown. Some undesired signals may be superimposed on ECG signals of interest and must be considered artefacts. These artefacts may have a biological origin, like reciprocal contaminations of muscle activity and heart cycles, respectively in ECG and EMG recordings [8,9,11,50]. These motion artefacts produce base line drifts, which can compromise vital signs parameters extrapolation [22,23,27 – 29,42]. In ECG recordings, it is possible that artefacts and noise can cause the loss of the main features, like QRS complex, P and T waves.

Linear as well as non-linear filtering and adaptive filtering can be listed among the techniques available, in order to recover the features of interest from ECG recordings.

Other approaches take advantage from the multi-channel nature of the acquisitions with the aim of searching for common features present in different ECG channels acquired simultaneously. Such approaches may be useful in situations when the frequency content of signal and noise overlap, and filtering operation cannot reduce noise without losing signal diagnostic information.

Second order blind identification (SOBI) techniques consist of separating an ECG mixture of independent sources with different spectral content through second order analysis, considering also temporal information of the ECG sources. For this purpose, SOBI methods aim to find a transformation that diagonalizes the cross correlation matrix at several lags simultaneously. Since there may exist no transformation which accomplishes that condition, a function that measures the diagonalization at different lags must be defined in order to maximize the joint diagonalization criteria. From Darmois’s result [17], one deduces that this problem has no solution for mutually independent Gaussian sources, with temporally independent and identically distributed (iid) samples. Then, since the Gaussian iid model has no solution, one must add priors, which are threefold [53,54] for success of an application:

(i) Non Gaussian iid.

(ii) Gaussian but not temporally independent (the first i of iid is relaxed), i.e. temporally correlated.

(iii) Gaussian, but not identically distributed (id of iid is relaxed), i.e. non stationary sources.

1.4 Standard ICA (sICA)

Independence is not always a strong constraint of ICA and independence assumption is generally true, but the nature of signals suggests using other priors: for instance, ECG is sparse signal, and most of the biological signals are temporally correlated and non- stationary [53]. ICA can be viewed as a maximum projection algorithm so as to find the sources that are maximally independent. This is quite meaningful when studying cardiac processes, as it is unreasonable to assume that the activity in one part of the heart is completely independent of the activity in various other parts of the heart [66]. Strictly

(6)

speaking, it is not necessary to assume for the information maximization in ICA that the sources do not have a Gaussian distribution. In the case of Gaussian distributed signals [22,28,31,39,66,67], ICA merely degenerates to regular old PCA with summarized variance equivalent to information and uncorrelated components also being independent.

ICA is a statistical technique for obtaining independent sources, sfrom their linear mixtures,xwhen neither the original sources nor the actual mixing matrix,Aare known as given by Equation (3). This model is easily applied by exploiting higher order signal statistics and optimization techniques. For the application of ICA, it is assumed that the ECG signal,x(t) is generated from sourcess(t) with a linear mixing procedure

xðtÞ ¼A sðtÞ; ð3Þ

where, ‘t’ is a vector of the sampling time with length T.

Also,x(t)¼[x₁(t). . .x_n(t)] isn £ T matrix of noise reduced ECG signal recorded from ‘n’ sensors.s(t)¼[s₁(t). . .s_m(t)] is am £ T matrix ofmsources activations.

Ais an unknownn£mfull rank mixing matrix [66].

Consider the noiseless complex linear instantaneous blind source separation model with as many sources as sensors:x¼As.Here sis an independent complex-valued ‘n’

dimensional random vector andAis an invertible complex matrix [6]. The task of linear BSS is to findAandsgiven onlyx. For the sake of simplicity, the discussion is restricted throughout to the case of zero-mean real signals. The source separation problem may be stated as: Identify mixing matrix A and/or estimate the source signals based only on observations ofxand assuming only

(i) Statistical independence of the ‘sources’ and (ii) Linear independence of the columns ofA.

The strength of this model is that the two independence assumptions stated above are physically plausible in several instances and are strong enough to provide some kind of identifiability, thus alleviating the need of any further modelling of the source distributions or of the mixing matrix [28,42,46,51,66]. ICA aims to find the demixing matrixw, such that

s¼w x; ð4Þ

wherewis whitening matrix.

The mixing/demixing process can be described as a change of coordinates. This means we consider the sourcess_ias components of a vectorswith respect to an orthonormal basis and the observationsx_jas the components of the very same vector in terms of a different basis. The demixing matrix thus helps to find the sourcess. To simplify the estimation of these independent sources we can start by decorrelating the mixtures, (whitening or sphering). This makes the covariance matrix ofx diagonal and its components of unit variance. ICA then uses higher order statistical information (kurtosis, negentropy, etc) to estimate the independent sources [11,20,42]. The prime requirement for applying ICA is that a number of simultaneously measured ECG signals carry linear combinations of the original source ECG signals.

1.5 Outline of the present analysis

In the present work, Jade algorithm for sICA is to be first applied to 3-channel ECG, common standards for electrocardiography (CSE) database files and the ECG waveforms are separated as independent components. The statistical corrections are then applied to

(7)

the extracted independent components to find noise or artefact components. In this paper, it is also proposed that the ability of ICA for parameterization of ECG signals may be necessary at times [22,29,61]. Properly parameterized ECG signals provide a better view to the extracted ECG signals, while reducing the amount of ECG data [11,70]. ICA yields independent components displaying more clearly the investigated properties of the original ECG sources. ICs of properly parameterized ECG signals may also be more readily interpretable than the measurements themselves, or their ICs. The basic idea of using MICA [15,20] is to find stable higher dimensional source signal subspaces and to decompose each rotation into elementary rotations within all two-dimensional planes spanned by the coordinate axes for useful diagnostic information of the heart. In this work, multidimensional blind source separation (MBSS) [75] i.e. the recovery of underlying sourcessfrom an observed mixturexof ECG signals is utilized. As usual,shas to fulfil additional properties such as independence or diagonality of the auto-covariances (ifs possesses time structure). However in contrast to ordinary blind source separation, MBSS is more general as some source ECG signals are allowed to possess common statistics. One possible solution for MBSS is multidimensional independent component analysis.

The idea of MICA is that one does not require the full independence of the transform y¼wxbut only mutual independence of certain tuples,yi1,. . .,yi2. If the size of all tuples is restricted to one, this reduces to ordinary ICA. In general, of course the tuples could have different sizes, but for the sake of simplicity we assume that they all have the same lengthk[6,15,73,74].

In this study, it is also to be verified that the value of kurtosis and Varvar coefficients for the independent components, which represents the noise and artefacts components, can be further reduced using parameterized multidimensional ICA (PMICA) technique or not.

The indeterminacies if available in the ECG data are to be analysed also using a modified version of Jade algorithm to PMICA and parameterized standard ICA (PsICA) for comparative studies.

The salient objectives of the proposed work are:

(a) This study aims at finding an ISA model for ECG analysis that allows applicability to either one or all random vectors available in an ECG data set.

(b) For the CSE based ECG data sets, it is desired to apply joint approximate diagonalization of eigen matrices (Jade) algorithm and its modified version are suggested as well re-implemented in this study, for estimating the smaller subspaces.

(c) The basic idea of applying MICA in this study, was to find stable higher dimensional source signal subspaces and to decompose each rotation into elementary rotations within all two-dimensional planes spanned by the coordinate axes for getting useful diagnostic information of heart for CSE based ECG data sets with and without noise as well as artefacts.

(d) Parameterization of ECG signals is to be properly done, in order to provide a better view to the extracted ECG signals, while reducing the amount of ECG data, which may be of redundant nature, possessing almost no useful diagnostic information.

2. Summarization of issues and imposed conditions before applying ICA

Because of the very weak assumptions in applying this, the problem is often referred to as BSS. The method based on the property of source independence has been called ICA [52,55]. In fact, one often has priors on the ECG signals, but not necessary always. A natural idea is then to add the priors in the proposed or developed ECG model, for simplifying or improving the separation methods [55]. Nonlinear mixtures are not in

(8)

general separable [69,82] and a practically important case of nonlinear mixtures is post nonlinear (PNL) mixtures [55], in which a linear mixture is followed by nonlinear sensors.

It has been shown that PNL mixtures are separable using statistical independence, too [2,55,69,82], with the same indeterminacies as linear mixtures. However, if some weak prior information about the source signals is available, then the performance of the source separation algorithms may be significantly improved. Thus, these methods are not ‘blind’

but ‘semi-blind’ [55].

The answers to the above raised issues and summarization of the facts that, what are the advantages of applying ICA in ECG analysis is: [23,25,26,28,29]

(i) ICA is a form of BSS.

(ii) It can solve and handle time delay with ambiguity problems if available in the ECG data.

(iii) ICA assumes the ECG sources as linear mixtures.

(iv) ICA allows physicians an alternative higher order statistical technique for better ECG interpretation.

(v) It performs better in terms of yielding a cleaned ECG signal as good as higher order digital filters.

(vi) ICA is better at recovering specific points on the ECG such as the R-peak, RR interval which is necessary for obtaining the heart rate.

2.1 Why are extensions to ICA required?

Independent component analysis is a multidimensional signal processing technique to separate signals from different sources into distinct components. Once separated, components classified as noise may be discarded and the remaining components used to reconstruct the pure signal. However, due to the imperfect nature of the ICA technique, substantial data may be lost, when the ‘noise’ components were removed or discarded [15,22,23,29,41,42].

An important structural aspect in the search for decompositions is to have the knowledge of the number of possible solutions for a problem, as well to find the number of indeterminacies involved in solving a problem [60]. The knowledge of this is essential, without which the result of any ICA or ISA algorithm cannot be compared with other possible and available solutions, as well the BSS would not be possible. Clearly, given an ISA solution, invertible transforms in each component (scaling matrices L) as well as permutations of components of the same dimension (permutation matricesP) give again an ISA ofX. In some of the special cases of ICA, scaling and permutation are already in all indeterminacies given that at most one Gaussian is contained inX[65]. This is one of the important points of ICA, allowing its usage for solving BSS problems and hence stimulating many biomedical applications. It has been proved by some of the researchers that, also fork-ISA, scalings and permutations are nothing but the indeterminacies [71]

with some additional restrictions imposed on the model and are not the weak restrictions to the model [75]. Blind identification of multi-channel ECGs based on second-order statistics (SOS) is unique only to within multiplication by a unitary matrix, whereas with higher-order statistics, the multiplication ambiguity is reduced to the class of signed permutation matrices. Of course, higher-order statistics (HOS) do not offer a panacea for all problems and one may have to use statistics and tools other than SOS and HOS.

ICA methods using time structure are better suited to biosignal analysis than HOS methods as ICA can solve and handle time delay with ambiguity problems if available in

(9)

the ECG data. In addition, ICA with time structure is better at recovering specific points on the ECG such as theR-peak, RR interval which is necessary for obtaining the heart rate.

On the other hand, conventional ICA algorithms when applied to the signals like ECG do not consider the data asymmetry and time structure, as a result of which the extracted ICs may not correspond to the intuitively expected directions.

2.2 Conditions of existence for ICA solutions

From a statistical point of view, since the problem has no solution for Gaussian iid signals, three types of statistical priors are possible: sources are non Gaussian iid, sources are Gaussian temporally correlated, and sources are Gaussian non-stationary [55]. Initially, even if it was not clearly stated [53], the problem has been related to the non Gaussian iid model, and has been referred to as BSS. The non Gaussian property appears clearly in the Comon’s theorem [32] for linear mixtures. Consider, x¼As, is ap-dimension regular mixture of mutually independent random variables, with at most one Gaussian,y¼Bxhas mutually independent components if BA¼PD, where P and D are permutation and diagonal matrices, respectively [53]. This theorem is only based on the independence of random variables and the independence criterion involves either explicitly or implicitly higher order statistics (at least more than two), but does not take into account the order of the ECG samples. It means that the iid assumption is not required, which is just a default assumption. Consequently, ICA methods can be applied for iid as well as for non iid sources, but it does not work successfully for Gaussian sources [52,55]. Moreover, the theorem points out that in reference to ECG analysis, the ECG sources cannot be exactly estimated, but only up to a scale and a permutation. These are the typical indeterminacies of source separation in the analysis when dealing with linear mixtures [17,32,55].

Of course, without other assumptions, the problem of ECG processing cannot be solved directly. Basically, it is necessary in any ECG analysis to have certain conditions such as [55]:

(i) Priors or information about the nature of the ECG mixtures should be known.

(ii) It is very important to choose an appropriate separating transformBsuitable for the mixture transformA.

(iii) The sources as well as the source properties, even if weak, is necessary for determination of the separating transformB.

2.3 Indeterminacies of complex ICA

The main interest of source separation problem is its relevance in many application domains, providing the maximum information is more than the multidimensional observations [53]. In the simplest case, this diversity is spatial and is obtained by using many sensors. Thus, for providing efficient solutions, as is required for any estimation problem, one has to choose carefully the following ingredients:

(1) The model of mixture, i.e. what is the relationship between the observations,xand the sources,s.

(2) The criterion is the source independence relevant or not?

(3) Do the sources have other properties that could be used in temporal coloration, non-stationarity, sparsity, discrete values, etc?

(4) The optimization algorithm [55].

Given a complexn-dimensional random vectorx, a matrixwis called complex ICA of xifwxis independent as a complex random vector. It can be proved thatwandvare complex

(10)

ICA’s ofxif and only ifw²¹,v²¹, that is if they differ by right multiplication by a complex scaling and permutation matrix [6,15]. This is equivalent to calculating the indeterminacies of the complex BSS model. The task of linear BSS is to findAandsgiven onlyx.

An obvious indeterminacy of this problem is thatAcan be found only up to equivalence because of scalingLand permutation matrixP. Thus,x¼A L P P²¹L²¹sandP²¹L²¹s is also independent. Thus, it can be shown that under mild assumptions tosthere are no further indeterminacies of complex BSS. Various algorithms for solving the complex BSS problem have been proposed as discussed in [6]. It can be shown that many cases where complex BSS is applied can in fact be reduced to using real BSS algorithms. This is the case if either the sources or the mixing matrix are real. The latter, for example, occurs after Fourier transformation of signals with time structure. If the sources are real, then the above complex model can be split up into two separate real BSS problems: Re(x)¼Re (A)sand Im(x)¼Im(A)s. Solving both of these real BSS equations yieldsA¼Re(A)þi Im(A).

Of course, Re(A) and Im(A) can only be found except for scaling and permutation.

By comparing the two recovered source random vectors (using for example mutual information of one component of each vector), we can however assume that the permutation and then also the scaling indeterminacy of both recovered matrices is the same, which allows the algorithm to correctly put A back together. Similarly, also separability of this special complex ICA problem can be derived from the well-known separability results in the real case [6,15]. If the mixing matrix is known to be real, then again splitting up Equation (3) into real and complex parts yields, Re(x)¼ARe(s) and Im(x)¼AIm(s). ‘A’ can be found from either equation. If both real and complex samples are to be used in order to increase precision, those can simply be concatenated in order to generate a twice as large sample set mixed by the same mixing matrixA. In terms of random vectors, this means working in two disjoint copies of the original probability space [6].

2.4 Mutual information (MI) and non Gaussianity

Finally, for linear mixtures, one can derive other families of algorithms by considering particular factorizations of the inverse transform, which is a matrixG. Due to the scale indeterminacy, one can look for unit variance source and a usual idea is to factorize G¼Uw, wherewis a whitening matrix andUis an orthogonal matrix. After estimatingw such that, E [(wx)(wx)^T]¼I with second order statistics, one can estimate U by minimizing MI with higher order statistics [53].

3. Multidimensional ICA (MICA)

Multidimensional ICA [6,72] has first been introduced by Cardoso [15] using geometrical motivations. HyvParinen and Hoyer then presented a special case of multidimensional ICA which they called independent subspace analysis [74]; there the dependence within a k-tuple is explicitly modelled enabling one to propose better algorithms without having to resort to the problematic multidimensional density estimation. The crucial idea how to find stable higher dimensional source signal subspaces is to calculate not the overall rotation for each direction, but to decompose each rotation intoN(N21)/2 elementary rotations within all two-dimensional planes spanned by the coordinate axes [15]. If the used BSS algorithm is successful in separating the independent subspaces, the separability matrix should have a block structure that groups together one-dimensional ICA projections that belong to the same independent subspace. Thus, a reliable independent subspace should become clearly separated from every other subspace.

(11)

A random vectorYis called an independent component of the random vectorX, if there exists an invertible matrixA,and a decompositionX 5 A(Y, Z) such thatYandZ are stochastically independent. The goal of a general independent subspace analysis or MICA is the decomposition of an arbitrary random vectorXinto independent components.

IfXis to be decomposed into one-dimensional components, this coincides with ordinary independent component analysis. Similarly, if the independent components are required to be of the same dimensionk, then this is denoted by multidimensional ICA of fixed group sizekor simplyk-ISA. So 1-ISA is equivalent to ICA [75]. A different extension of ICA is given by topographic ICA [48], where dependencies between all components are assumed.

A special case of multidimensional ICA is complex ICA and can be presented where dependence is allowed between real-valued couples of random variables [6]. PCA provides a decomposition of a second-order vectorxas the sum of its projections onto the principal axis of its covariance matrix. The PCA components are geometrically orthogonal by construction and also statistically orthogonal or uncorrelated [24]. The more ambitious ICA approach is to look for components which are not necessarily geometrically orthogonal but are statistically independent (that is ‘more than statistically orthogonal’

since independence is much stronger than mere un-correlation). Standard ICA algorithms [23,28,29,42,56] that are applied to such a data set will produce one-dimensional source estimates that are as independent as possible, which means they will still find the right decomposition, but they are forced to select as well a decomposition of the actually multidimensional components. Thus standard ICA techniques [1,6,15,51,73] are able to find the multidimensional source signal subspaces, but they choose an (arbitrary) basis within these subspaces. A matrix w is called a k-multidimensional ICA of an n- dimensional random vectorxifwxisk-independent. Ifk¼1, this is the same as ordinary ICA. As usual MICA can solve the multidimensional BSS problemx¼As; whereAands is a k-independent n-dimensional random vector. Finding the indeterminacies of multidimensional ICA then shows that A can be found except for k-equivalence (separability), because ifx¼Asandwis a demixing matrix [6]. The problem is then to decide which of the one-dimensional source space estimates given by the algorithm should be grouped together. One may think of other ways of decomposing a Gaussian vector into independent components but it does not seem possible to define a decomposition that would be invariant [37,40]. In consequence, a special treatment is reserved to Gaussian components: they should not be split into independent subcomponents but rather all the Gaussian components (if any) should be kept pieced together as a unique component. The ICA model (1) can be rewritten as an additive, component-based model:

x¼Xⁿ

p¼1

xp: ð5Þ

This (admittedly trivial) rewriting of the original model calls for a change of standpoint. While model (3) is a multiplicative model reading: the observed vector is the product of a mixing matrixAby a source vectors, model (5) is an additive model reading:

the observed vector is a sum of ‘n’ one-dimensional independent vectors x₁,. . .,x_n. The new view of ICA introduced in Equation (5) above is ‘matrix-free’. Here, we discuss how it can be parameterized and why the appropriate parameterization is uniquely determined contrarily to the matrix-based parameterization of model (3). In the component model (5), the smallest subspace containing thepth component is referred to as the ‘component (sub) space’ for thepth component. This is indeed the one-dimensional linear subspace spanned

(12)

by thepth column ofA. By focusing on the spaces containing each component rather than on the columns ofA, we obtain the desired result of getting rid of the indeterminations of scale and sign. In some sense, we move from an algebraic description of a mixture in terms of a ‘mixing matrix’ to a geometric description in terms of ‘component spaces’. Using MICA, the noiseless linear MBSS problem, x¼As can be solved, where the mn- dimensional random vectorxis given, andAandsare unknown. In the case of MICA,sis assumed to bem-independent [72,75].

It is important to determine the conditions, in order to find firstly how ICA algorithms can be designed to extract one-dimensional components and then to find a suitable criterion for such ICA algorithms, so as to investigate how the algorithms behave, when the ECG processing of a mixture of multidimensional independent components is being carried out. To describe the ECG signals more reliably for better extraction of diagnostic features and morphological understanding, it is intuitively suggested that the existing ICA algorithms can be modelled best with multidimensional components, but has to be carried more rigorously as a future work.

The reasons quoted above are to be carried as ‘additional research’, since an implicit assumption of common ICA algorithms is that the input signals are symmetric around their mean values, since the mean of the ECG data is initially removed, and the sign of the ICs are not determinable. This means that when applying conventional ICA algorithms to the signals such as ECG without considering the data asymmetry, the extracted ICs do not correspond to the intuitively expected directions for extracting useful morphological features for disease classification.

3.1 Existing ICA algorithms and limitations

Signals such as ECG have probability densities that are close to Gaussian, while artefacts such as EEG, EMG, motion artefacts have non Gaussian distributions. ICA has some inherent limitations, due to which identifying the independent component (IC) of interest becomes difficult and highly subjective at times [1,22]. Before giving an interpretation for the extracted ICs, the isoelectric point of the heart needs to be further discussed. The ECG signals are not symmetric around their mean values, and the isoelectric point of the heart slightly differs from the mean values of the data. Meanwhile, an implicit assumption of common ICA algorithms is that the input signals are symmetric around their mean values, since the mean of the data is initially removed, and the sign of the ICs are not determinable. This means that when applying conventional ICA algorithms to signals such as the ECG without considering the data asymmetry, the extracted ICs do not correspond to the intuitively expected directions.

The different existing ICA algorithms [28,57] are Molgedey and Schuster ICA (MS- ICA), optimized generalized weighted estimator (OGWE), joint approximation diagonalization of eigen matrices (Jade), shifted block blind separation (SHIBBS), kernel-ICA and robust, accurate, direct independent component analysis algorithm (RADICAL) for removal of artefacts and noise from ECG. For assessing the actual independence of the components obtained from ICA/BSS, mutual information is used. MI leads to basic performance tests for any ICA problem and hence different ICA/BSS algorithms can be ranked by how well they perform i.e. whether they find the most independent components.

The limitation of ICA algorithms is that there is no guarantee that any particular algorithm can capture the individual source signals in its components. As discussed above, a number of ICA/BSS based ECG methods have been published till date, but with

(13)

insufficient background to enable the ECG practitioner to choose the best algorithm [28,57]. For the ECG data sets considered here, Jade algorithm is able to find smaller subspaces and can therefore be regarded as the more suitable algorithm in this analysis. By applying Jade algorithm to the orthogonal subspace allows us to extract the other components in the ECG data set and yields the best solution for better feature extraction.

The Jade algorithm is more appropriate for separating ECG data, because Jade yields lower matrix entries, i.e. higher reliability. The Jade separability matrix in contrast shows one-dimensional components and two-dimensional components [23,28,29,42,57].

3.2 Existing approaches of MICA for dependent component analysis

Generalizations of the ICA model that are required to include the dependencies of multiple one-dimensional components have been studied for a long time [75]. ISA in the terminology of multidimensional ICA has first been introduced by Cardoso [15] utilizing geometrical motivations and his model as well as the related but independently proposed factorization of multivariate function classes [45] is quite general, however no identifiability results applied to an actual data set were shown and discussed, and applicability to an arbitrary random vector was unclear [75]. Later on in Cardoso, in the special case of equal group sizes (k-ISA), uniqueness results were extended from the basic ICA theory [71]. Algorithmic enhancements [75] in this setting have been recently studied by [65,79]. Moreover, if the observation contains additional structures such as spatial or temporal structures, these may be used for the multidimensional separation [45,79].

Hyvarinen and Hoyer presented a special case ofk-ISA by combining it with invariant feature subspace analysis [45]. They model the dependence within ak-tuple explicitly and are therefore able to propose more efficient algorithms without having to resort to the problematic multidimensional density estimation. A related relaxation of the ICA assumption is given by topographic ICA [47], where dependencies between all components are assumed and modelled along a topographic structure (e.g. a two-dimensional grid). Bach and Jordan [10] formulate ISA as a component clustering problem, which necessitates a model for inter-cluster independence and intra-cluster dependence [75].

4. Methodology of corrections applied to extracted ICs

The idea of ICA is to recover the original signals by assuming that they are statistically independent.yis independent and it is desired to find howwmaximizes the independence ofy. To estimate one of the independent components (ICs), a linear combination of thex_i is considered. Let us denote this by,

y¼w^Tx¼Xⁿ

i¼1

wixi; ð6Þ

where the column vectorwis to be determined.

The independent components are determined by applying a linear transformation to the whitened data.

A given component can be obtained using the linear transformation

i_c¼b^T_i x; ð7Þ

wherei_cgives independent components which is an estimate of the original signal.bis an appropriate vector to reconstruct the independent components [1,50,76]. The ICA

(14)

algorithmic principle is

y¼EstðsÞ ¼wICAx ð8Þ

y¼wx: ð9Þ

After estimatingA,w 5 A²¹is computed and hence

s¼wx¼A²¹x: ð10Þ

Further,

s⁰ðtÞ ¼w xðtÞ; ð11Þ

where,s⁰(t)¼[s10(t). . .sm0(t)] ism-dimensional matrix of the independent components, which represents the estimation of the sourcess(t) andwis a demixing matrix [66]. Also,

w¼A⁰þ; ð12Þ

whereA⁰^þdenotes an estimate of the pseudo-inverse of the mixing matrixA. After the identification of artefacts and noise, the ECG signals are reconstructed with the components which have not been labelled as artefacts and noise. Thus, in the remixing matrix, the weight vectors of the identified artefactual and noise ICs have been set to zero, such that

x⁰ðtÞ ¼w^þ_cleans⁰ðtÞ; ð13Þ wherex⁰(t) is the reconstructed artefacts and noise free ECG signal fornsensors,w¹is the pseudo-inverse of the demixing matrixw, andw¹_cleanis then£mremixing matrix with its columns corresponding to the artefactual and noise ICs set to zero,s⁰(t) is the activation matrix of ICs as mentioned above [22,23,28,29,42,66]. Using ICA, it is required to derive a

‘clean ECG signal’ from the source ECG signal to find the content of noise and compare the proposed model with the existing methods and algorithms [3,5,6,11,15,36,58].

Independent component analysis method is proposed where additional knowledge about the time and statistical structure of the ECG sources is incorporated [23,29,42]. ICA yields equations describing the behaviour of the various ECG segments as a function of cardiac cycle time. ICA can be used to synthesize an ECG signal which is a realistic reproduction of the original signal, as well can control parameters such as QRS complex amplitude, rise-time, fall-time and the relative amplitudes of the P & T-waves [22,28]. The duration of each component will automatically track the selected heart rate in a non-linear fashion, reflecting its true behaviour. It has been reported in the literature that the ICA source estimates promote some inverse process, acting on the ECG observations. In a standard ICA model, this inverse function is a generalized linear function, i.e. a function of the pseudo-inverse of the mixing matrix, the so called un-mixing matrix and the inferred noise model [8,19,23,28,29,42]. The intrinsic indeterminacy of scales and signs is a well- known limitation of the ICA problem as defined in (1). By using ICA, we are estimating the heart sources and the mixing matrix corresponding to the body volume conductor at the same time. However, due to the non-homogeneity of the body volume conductor the original heart signals are not attenuated with the same amount in all directions.

5. ICA parameterization

Based on trivial properties of convolution, some common signal parameters also fulfil the ICA mixing model, given that the original ECG measured signals comply with it. By ECG

(15)

signal parameterization [1,22,70], it is meant that the construction of a new ECG signal is possible from any local or global properties, i.e. parameters of the original ECG signal.

These properties may be related toa prioriknown features of the ECG sources. Even with proper ECG parameterizations, it may sometimes be hard or impossible to make an ICA algorithm converge, because of missing or otherwise bad ECG data due to, for example, bad electrode contacts. It is also otherwise possible that the ICA does not converge, or that several runs of ICA are needed.R-wave reflects the intensity and direction of progressing ventricular heart muscle depolarization, i.e. contraction. At times, it is desired that the independent components of the parameterization of ECG signals [70] is to be done that would display diagnostic information carried by the parameters more clearly than what is observable from the original ECG measurements or their ICs. It is to be noted that in order for the ICA mixing model (1) to be valid, the parameters have to be derived from the ECG signal amplitudes. ECG parameters describing time durations, e.g. the time periods between consecutiveRwaves, do not comply with the ICA mixing model. In some cases, the appropriate ECG parameterization [22] may be such that it greatly decreases the amount of ECG data, thus lowering the computational burden on the subsequent ECG analysis. As readily stated by the mixing model (1), each and every measured sample is a linear combination of the samples of the source ECG signals at the same time. Therefore, a set of ECG signals, in which each signal consists of samples from the corresponding original ECG measurement at the time points ‘s’, i.e.

y⁰¼yðsÞ: ð14Þ

Equation (14) also satisfies the ICA mixing model of (1). Also, it can be shown that ECG signals constructed from time averages of the original measurement ECG samples, or signals resulting from FIR filtering, fulfil the ICA signal model, and may thus be subjected to ICA. However, it is to be noted that even if the ICA algorithm converged and produced ICs, averaging will make the components more Gaussian. Thus, one must pay special attention to the proper interpretability of the components, and avoid lengthy averaging windows [49,77].

In this paper, it is proposed that the ability of ICA for parameterization of ECG signals is required at times. As appropriate ECG signal parameters reveal desired signal properties, ICs of thus parameterized ECG signals may as well display the desired aspects of the question at hand much more clearly than the ICs of the original ECG signals, while possibly also reducing the amount of ECG data to a fraction of the original ECG signal.

The proposed ICA algorithm estimates the independence of the original signal and the optimization based on the estimation searches an optimum-restoring matrix. In the

Figure 2. ICA parameterization scheme for ECG segment identification.

(16)

proposed model, estimated results are in good agreement with physiological view. The new scheme also makes no assumptions about the non-stationarity of the components, a key assumption in several successful ICA methods, whose relevance to ECG is still unclear [21,25,26,36]. Figure 2 depicts ICA parameterization scheme for ECG segment identification and classification. The flow chart of parameterized ICA steps for morphological analysis of ECG waves is depicted in Figure 3.

In this analysis, an efficient approach for artefacts/noise identification with parameterization technique is discussed. The automatic removal of artefacts and noise in ECG signals based on correction procedures adapted to extracted ICs as discussed above, based on statistical thresholds including estimations of the noise statistical properties is a key requirement in ECG processing [39,42,66].

In this study, the re-parameterization of the standard ICA model is converted into a

‘component model’ which is geometric in nature and free of indeterminacies existing in standard ICA (sICA). More importantly, this new perspective of ECG processing discussed suggests an extension to a more general model of multidimensional independent components [9,15,52].

6. Results

In this study, case studies are carried out for ECG files taken from CSE database with and without parameterization. In these case studies, first the types of artefacts and noise are identified followed by their removal and obtaining the cleaned ECG signals using standard ICA. In this analysis, it is observed that the ECG signals and the uniformly distributed noise each define a one-dimensional source signal space, whereas the sin/cos and the Gaussian noise-subspaces are two-dimensional. For the ECG data set, Jade algorithm is applied first to find smaller subspaces for MICA analysis and can therefore be regarded as a basis algorithm for parameterized MICA analysis.

Sources using Jade assumed non Gaussian distribution in this analysis, which in a way is knowna priorionly, but to fulfil some special requirements in ECG morphological studies, certain extensions of ICA have been studied and implemented on CSE based ECG data sets.

However, if some weak prior information about the source ECG signals is available, even then the performance of the source separation algorithms may be significantly improved.

Figure 3. Flow chart of parameterized ICA for morphological analysis of ECG.

(17)

In fact, one often has priors on the ECG signals, but may not be always necessary. A natural idea of adding the priors in the proposed ECG model was for simplifying or improving the standard ICA separation methods. The Jade algorithm is more appropriate for separating ECG data, because Jade yields lower matrix entries, i.e. higher reliability.

In this analysis, it is observed that the ECG signals and the uniformly distributed noise each define a one-dimensional source signal space, whereas the sin/cos and the Gaussian noise-subspaces are two-dimensional. For the ECG data set, Jade algorithm is applied first to find smaller subspaces for MICA analysis and can therefore be regarded as a basis algorithm for parameterized MICA analysis. Using parameterization and MICA techniques the thresholds in this analysis is brought down to Kurtosis as 4.3 and for Varvar as 0.4, which indicates that the removal of noise and artefacts is faster in this study as compared to [42].

The simulation and graphical results are shown in the respective figures. The results have been obtained for number of points as 5000 and sampling frequency as 500 Hz.

Threshold criterions for kurtosis and variance of variance (Varvar) is obtained after testing 20 – 25, CSE database files as 4.3 and 0.4 after parameterization of ECG data which shows improvement in what is discussed in [42]. In the tables below, ICA1, ICA2 and ICA3 are the three extracted independent components. In this paper, 3-channel ECG data taken from CSE-database and each channel is 10 s duration, and sampling frequency of 500 Hz is used as reimplementation study and was first discussed in [42]. After applying ICA algorithm to the 3-channel ECG data, three independent components are obtained and then kurtosis of each independent component is calculated. A component whose modulus of kurtosis is below the threshold is marked as a noise component. Then each independent component is divided into 20 non-overlapping blocks, each of 0.5 s duration, as compared to 10 non- overlapping blocks, each of 1 s duration as discussed in [42]. The variance of these 10 segments in a group of 2 for each component is calculated, and then the variance of these 10 variance values is obtained as the parameter Varvar. The component whose Varvar value is above the pre-determined threshold is marked as an artefact component [42]. In the graphs shown, ‘mV’ refers to amplitude of an ECG waveform andn¼3.

6.1 Case studies

Case 1. Leads L1, L2 and L3 of File no-03 of CSE database for ECG are used in the simulation. This case deals with 3-channel ECG with baseline drift in the first two channels L1 and L2 and noise in the third channel L3.

From Tables 1 and 1(A), it is also observed that the value of kurtosis coefficients for ICA1 and ICA2 is increased, whereas ICA3, which represents the noise component in this case, is further reduced using PMICA technique. The Varvar coefficients of all three components ICA1, ICA2 and ICA3 are reduced using PMICA technique which shows its dominance over PsICA technique.

Table 1. The jKurtj and Varvar values for each of the three ICA components for Case 1 for parameterized standard ICA (PsICA).

Index ICA1 ICA2 ICA3 Thresholds

jKurtj 6.4660 29.6417 2.8092(Noise) 4.3

Varvar 0.0186 0.1190 0.0098 0.4

Thresholds for Kurtosis¼4.3 and Varvar¼0.4.

(18)

Analysis of this case indicates that channel L3 is a noise component, since it has jKurtj,4.3. It is apparent that channel-1 and channel-2 in Figure 4 has baseline wander and noise in the third channel L3 (Figure 5). It is obvious from Figure 6 and the Table 1, ICA3 is noise component since it hasjKurtj,4.3, hence component ICA2 is made zero.

Case 2. Leads AF, AL and AR of File no-04 of CSE database for ECG are used in the simulation. This case deals with 3-channel ECG with baseline drift in all three channels AF, AL and AR. Channel AF has BLW as well as noise (Figures 7 – 10).

Analysis of this case indicates that channel, AF and AL are noise components, since they havejKurtj,4.3 (Table 2). It is obvious from Figure 11 and the Table 3, ICA1 and ICA2 are noise components since it hasjKurtj,4.3, hence component ICA1 and ICA2 are made zero. ICA3 in this analysis appears to be undefined or in-deterministic (Figures 12 – 15).

The value of kurtosis coefficient for ICA3 obtained is a complex number, using PsICA algorithm and isjKurtj[Kurtosis(ICA3)]¼2.4599 þ 0.0023i(Indeterminacy situation).

The value of kurtosis coefficient for ICA3 obtained is also a complex number using PMICA algorithm and is jKurtj [Kurtosis (ICA3)]¼0.4647 þ 0.0017i (Reduced Indeterminacy situation).From Tables 3 and 3(A), it is observed that the value of kurtosis coefficient for ICA3 obtained using PMICA algorithm is less than compared to the value of kurtosis coefficient for ICA3 obtained using PsICA algorithm and the indeterminacy is reduced in PMICA in comparison to PsICA case. ICA3 in this analysis thus still appears to be undefined or in-deterministic, but the indeterminacy content is reduced. Here still indeterminacy exists but the complex value of kurtosis coefficient for ICA3 is less in Table 1A. ThejKurtjand Varvar values for each of the three ICA components for Case 1 for parameterized multidimensional ICA (PMICA).

jKurtj 8.7860 32.6597 1.9968(Noise) 4.3

Varvar 0.0113 0.1145 0.00088 0.4

Figure 4. 3-Channel ECG with channels L1 and L2 having base line wander and noise in channel L3.

(19)

PMICA as compared to PsICA. From Tables 3 and 3(A), it is also observed that the value of kurtosis coefficients for ICA1 and ICA2, which represents the noise component in this case, is further reduced using PMICA technique.

Case 3. Leads V1, V2, V3 of a CSE base data file of ECG are used in the simulation. This case deals with 3-channel ECG having base-line wander in channel V1.

Analysis of this case indicates that channel V2 is a noise component, since it has jKurtj,4.3. It is apparent that channel-1 in Figure 16 has baseline wander (Figure 17). It is obvious from Figure 18 and that Table 1 ICA2 is noise component, since it has jKurtj,4.3, hence component ICA2 is made zero (Figures 19 – 21).

Figure 5. Extracted independent components.

Figure 6. Reconstructed clean ECG signals.

(20)

Figure 7. Plot of variance and number of segments in ICA1 (Lead L1).

(21)

Table 2. Variance values of components ICA-1, ICA-2 and ICA-3 for PsICA.

S. no Segment number

Comp (ICA-1) L1 (variances)

1 Segment-1 0.4620 1.1304 0.9947

2 Segment-2 0.3653 1.0738 0.9003

3 Segment-3 0.2693 1.1475 0.9143

4 Segment-4 0.0184 0.3968 0.8585

5 Segment-5 0.2563 0.9340 0.9666

6 Segment-6 0.2439 1.1855 1.1314

7 Segment-7 0.2370 0.9896 1.1150

8 Segment-8 0.2585 0.9540 1.1173

9 Segment-9 0.0111 0.1995 0.9233

10 Segment-10 0.2343 1.2057 0.9684

Figure 10. Three-channel ECG having base line wanders.

Figure 11. Extracted independent components for Case 2.

(22)

Table 3. The jKurtj and Varvar values for each of the three ICA components for Case 2 for parameterized standard ICA (PsICA).

jKurtj 1.8093(Noise) 4.1024 (Noise) Complex Value 4.3

Varvar 0.0073 0.1002 3.0284e-030 0.4

Figure 12. Reconstructed clean ECG signals for leads AF, AL and AR with loss of morphology.

Figure 13. Plot of variance and number of segments in ICA1 (Lead AF).

Figure 14. Plot of variance and number of segments in ICA2 (Lead AL).

(23)

From Tables 4 and 4(A), it is also observed that the value of kurtosis coefficients for ICA1 and ICA3 is increased, whereas ICA2, which represents the noise component in this case, is further reduced using PMICA technique. The Varvar coefficients of all the three components ICA1, ICA2 and ICA3 are reduced using PMICA technique which in this case also shows its dominance over PsICA technique as discussed in Case 1 (Tables 5 and 6).

Table 3A. ThejKurtjand Varvar values for each of the three ICA components for Case 2 for parameterized multidimensional ICA (PMICA).

jKurtj 1.7645(Noise) 3.9457 (Noise) 0.4647þ 0.0017i 4.3

Varvar 0.0034 0.0987 0.0000 0.4

Figure 15. Plot of variance and number of segments in ICA3 (Lead AR).

Figure 16. 3-Channel ECG, with channel V1 having base-line wander.

(24)

7. Discussions

ICA is a technique employed to solve a BSS problem that starts the research of the ECG sources from the hypothesis of statistical independence among them. In the basic or instantaneous ICA model, no time delay is involved in the mixing process and the signals picked up by the electrodes are a linear mixture of the sources. Hence, it would be at least reasonable to propose a modification or version of standard ICA technique to extract the important features and segments of ECG signals affected by artefacts and noise, thereby removing indeterminacies if available in the ECG data. By introducing the concept of irreducible independent subspaces or components, generalization to a parameter free mixture Figure 17. Extracted independent components.

Figure 18. Reconstructed ECG signals for Case 3.

(25)

of an ECG model can be presented. However, the condition of utmost one Gaussian can be relaxed by including previous results on non Gaussian component analysis of an ECG data set. After introduction of this general model, joint block diagonalization with unknown block sizes can be discussed and on the basis of a simple extension to Jade can be algorithmically performed to better understand the subspace analysis.

The indeterminacies if available in the ECG data was analysed using a modified version of Jade algorithm to PMICA and PsICA for comparative studies, as suggested in this study.

Table 4. Values ofjKurtjand Varvar for each of the ICA components for Case 3 for parameterized standard ICA (PsICA).

Index ICA1 ICA2 ICA3

jKurtj 14.7901 2.2020 (noise) 5.0424

Varvar 0.0556 0.0037 0.0722

Thresholds for Kurtosis¼4.3 and Varvar ¼0.4.

Figure 19. Plot of variance and number of segmentsin ICA1 (Lead V1).

Figure 20. Plot of variance and number of segments in ICA2 (Lead V2).

(26)

In this paper, the author has discussed a method for monitoring ECG signals by ICA approach, a special case of blind source separation. The ECG, noise and artefacts components are separated from multi-channel acquisitions by exploiting the well-known Jade algorithm available for instantaneous (standard) ICA model. By reconsidering the notion of ICA, a more general perspective can be envisioned: multidimensional independent component analysis. It is based on a geometric parameterization, which is free of the indeterminacies of matrix-based modelling. MICA relies on the idea of vector- valued component rather than on scalar source signals. A canonical MICA decomposition is discussed as an invariant decomposition, which can be empirically computed by post- processing the results of ICA decomposition. This calls for further research on developing Table 4A. ThejKurtjand Varvar values for each of the three ICA components for Case 3 for parameterized multidimensional ICA (PMICA).

Index ICA1 ICA2 ICA3

jKurtj 15.6411 2.1985 (noise) 6.1426

Varvar 0.04856 0.00292 0.06988

Thresholds for Kurtosis ¼4.3 and Varvar ¼0.4.

Table 5. Variance values of components ICA-1, ICA-2 and ICA-3 for Case 3.

Comp (ICA-1) V1

(variances)

Comp (ICA-2) V2

(variances)

Comp (ICA-3) V3

(variances)

1 Segment-1 0.8808 0.2542 0.7303

2 Segment-2 0.8435 0.0923 0.5485

3 Segment-3 0.7535 0.0405 0.4939

4 Segment-4 0.8516 0.0766 0.5624

5 Segment-5 1.4928 0.0617 0.9606

6 Segment-6 0.9358 0.1178 0.6491

7 Segment-7 0.8656 0.1043 0.6314

8 Segment-8 0.8260 0.1084 1.3627

9 Segment-9 1.2138 0.1569 0.5509

10 Segment-10 1.1994 0.0682 0.5318

Figure 21. Plot of variance and number of segments in ICA3 (Lead V3).

(27)

tools for detecting the existence of independent components in the ICA/MICA context.

The important assumptions in applying ICA are: components are independent and have non Gaussian distributions. The ambiguities of ICA on the other hand are: energies of the extracted ICs cannot be determined and their order is undetermined. ICs of parameterized ECG signals displayed the desired aspects of the question at hand much more clearly than the ICs of the original ECG signals, while possibly also reduced the amount of ECG data to a fraction of the original ECG signal preserving morphology. In general, the usefulness of any blind decomposition method applied to ECG data is most likely relative to the fit between the assumptions of the algorithm. Therefore, it is important to consider the physiological basis of the delayed interactions between statistically defined independent components and the possible physiological significance of the derived components.

Therefore, the author concludes that employing the time structure information in ICA calculations can potentially improve artefacts and noise removal and enhance the overall classification.

The salient advantages and contributions of the work proposed in this paper would benefit towards ECG analysis in various ways if used in practice, such as if the extracted independent components were further cleaned by statistical measures like kurtosis and Varvar (Variance of variance). In this study, it is also demonstrated that the value of kurtosis and Varvar coefficients for the independent components, which represents the noise and artefacts components can be further reduced using parameterized multidimensional ICA (PMICA) technique.

In addition to this, the indeterminacies if available in the ECG data are also analysed using a modified version of Jade algorithm developed for PMICA and parameterized standard ICA (PsICA) for comparative studies, highlighting merits and demerits. The indeterminacies if available in the ECG data are reduced in PMICA in a better way as compared to PsICA. The simulation results obtained indicate that ICA can definitely improve signal– noise ratio (SNR), like other higher order digital filtering methods such as Kalman, Butterworth etc. with minimum reconstruction errors. ICA of properly parameterized provided independent components (ICs) displayed more clearly the investigated properties of the original ECG sources. Lastly, in this analysis, it is also confirmed that, re-parameterization of the standard ICA model results into a component model using MICA technique, which is geometric in spirit and free of indeterminacies which generally exist in sICA model.

Table 6. Variance values of components ICA-1, ICA-2 and ICA-3 for Case 2.

Comp (ICA-1) AF

(variances)

Comp (ICA-2) AL

(variances)

Comp (ICA-3) AR

(variances)

1 Segment-1 0.1427 0.7511 0.4479

2 Segment-2 0.1365 0.7051 0.4041

3 Segment-3 0.0961 0.6865 0.3286

4 Segment-4 0.1116 0.6245 0.3221

5 Segment-5 0.2819 0.4921 0.4880

6 Segment-6 0.2066 0.6134 0.3552

7 Segment-7 0.0952 0.8470 0.4024

8 Segment-8 0.0822 0.8873 0.4103

9 Segment-9 0.0000 0.0000 0.0000

10 Segment-10 0.0000 0.0000 0.0000