27.03.2013 Views

BUKU ABSTRAK - Universiti Putra Malaysia

BUKU ABSTRAK - Universiti Putra Malaysia

BUKU ABSTRAK - Universiti Putra Malaysia

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Diagnostic-robust Generalised Potentials for the Identification of Multiple High<br />

Leverage Points<br />

Assoc. Prof. Dr. Habshah Midi<br />

Norazan Mohamed Ramli and A.H.M.R. Imon<br />

Institute of Mathematical Research, University <strong>Putra</strong> <strong>Malaysia</strong>,<br />

43400 UPM Serdang, Selangor, <strong>Malaysia</strong>.<br />

+603-8946 6876; habshahmidi@gmail.com<br />

Mutual Information for Mixture of Bivariate Normal Distributions based on<br />

Robust Kernel Estimation<br />

Assoc. Prof. Dr. Habshah Midi<br />

Kourosh Dadkhah<br />

Institute of Mathematical Research, University <strong>Putra</strong> <strong>Malaysia</strong>,<br />

43400 UPM Serdang, Selangor, <strong>Malaysia</strong>.<br />

+603-8946 6876; habshahmidi@gmail.com<br />

247<br />

Science, Technology & Engineering<br />

Leverage values are being used in regression diagnostics as measures of influential observations in the<br />

X-space. Detection of high leverage values is crucial due to their responsibility for misleading conclusion about<br />

the fitting of a regression model, causing multicollinearity problems, masking and/or swamping of outliers etc.<br />

Much work has been done on the identification of single high leverage points and it is generally believed that the<br />

problem of detection of a single high leverage point has been largely resolved. But there is no general agreement<br />

among the statisticians about the detection of multiple high leverage points. When a group of high leverage<br />

points is present in a data set, mainly due to the masking and/or swamping effects the commonly used diagnostic<br />

methods fail to identify them correctly. On the other hand, the robust alternative methods can identify the high<br />

leverage points correctly but they have a tendency to identify too many low leverage points to be points of high<br />

leverages which are not desired. In this paper an attempt has been made to make a compromise between these<br />

two approaches. We propose an adaptive method where the suspect high leverage points are identified by robust<br />

methods and then the low leverage points (if any) are put back into the estimation data set after diagnostic<br />

checking. The usefulness of our newly proposed method for the detection of multiple high leverage points is<br />

studied by some well-known data sets and Monte Carlo simulations.<br />

Keywords: High leverage points, masking, group deletion, Robust Mahalanobis distance, minimum volume ellipsoid,<br />

diagnostic-robust generalised potentials, Monte Carlo simulation<br />

Mutual Information (MI) measures the degree of association between variables in nonlinear model as well<br />

as linear models. It can also be used to measure the dependency between variables in mixture distribution. The<br />

MI is estimated based on the estimated values of the joint density function and the marginal density functions of<br />

X and Y. A variety of methods for the estimation of the density function have been recommended. In this study,<br />

we only considered the kernel method to estimate the density function. However, the classical kernel density<br />

estimator is not reliable when dealing with mixture density functions which prone to create two distant groups in<br />

the data. In this situation, a robust kernel density estimator is proposed to acquire a more efficient MI estimate in<br />

mixture distribution. The performance of the robust MI is investigated extensively by Monte Carlo simulations.<br />

The results of the study offer substantial improvement over the existing techniques.<br />

Keywords: Mutual information, kernel density, minimum volume ellipsoid, minimum covariance determinant, outliers,<br />

mixture distribution, robust statistics

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!