Some problems of outliers in circular data / Ali H.M. Abuzaid

Abuzaid, Ali H. M. (2010) Some problems of outliers in circular data / Ali H.M. Abuzaid. PhD thesis, University of Malaya.

[img]
Preview
PDF (Full Text)
Download (5Mb) | Preview

    Abstract

    This study considers three problems of outliers in circular statistics. The first problem is an attempt to use the standard outlier detection procedures for linear data set by approximating circular variables by linear variables. This is possible for large values of concentration parameter. Series of simulation studies are carried out to specify the accepted value of the concentration parameter so that the von Mises distribution can be approximated by normal distribution. The second is the problem of outliers in circular samples. Two numerical tests of discordancy are proposed to identify outliers. The test statistics are based on the summation of circular distances and chord lengths respectively from the point of interest to all other observations on the circumference of a unit circle. The approximate distributions of the test statistics are derived. Simulation studies show that both statistics perform better than other known discordancy tests. On the other hand, a boxplot version for circular data sets is proposed. Via simulation studies, we show that the resistant criterion highly depends on the concentration of circular samples. The third problem is the existence of outliers in the circular regression model. Firstly, we propose a new definition of circular residuals which can be used to identify outliers using various graphical and numerical tests. Secondly, three numerical tests are developed to detect influential observations based on row deletion approach. The first two are defined using the circular distance between the observed and fitted values with the derivation of the approximate distributions. The other test is an extended version of the COVRATIO statistic in linear regression to the circular case. In general, the three numerical tests perform well in detecting influential observations. For illustration, we consider two real circular data sets, namely, the frogs’ data set and the wind direction data set. In conclusion, the statistics proposed by this study are able to solve some problems of outliers in circular data.

    Item Type: Thesis (PhD)
    Additional Information: Thesis (Ph.D) -- Institut Sains Matematik, Fakulti Sains, Universiti Malaya, 2010
    Uncontrolled Keywords: Outliers (Statistics); Circular data
    Subjects: Q Science > Q Science (General)
    Q Science > QA Mathematics
    Divisions: Faculty of Science
    Depositing User: Miss Dashini Harikrishnan
    Date Deposited: 25 Sep 2014 09:41
    Last Modified: 25 Sep 2014 09:41
    URI: http://studentsrepo.um.edu.my/id/eprint/4277

    Actions (For repository staff only : Login required)

    View Item