EFFECT OF FEATURE SELECTION AND DATASET SIZE ON THE ACCURACY OF NAÏVE BAYESIAN CLASSIFIER AND LOGISTICS REGRESSION

EFFECT OF FEATURE SELECTION AND DATASET SIZE ON THE ACCURACY OF NAÏVE BAYESIAN CLASSIFIER AND LOGISTICS REGRESSION

  • The Complete Research Material is averagely 82 pages long and it is in Ms Word Format, it has 1-5 Chapters.
  • Major Attributes are Abstract, All Chapters, Figures, Appendix, References.
  • Study Level: BTech, BSc, BEng, BA, HND, ND or NCE.
  • Full Access Fee: ₦7,000

Get the complete project » Instant Download Active

ABSTRACT

Binary Logistics Regression and Naïve Bayesian classifier are two of the common classification modelling techniques that allow one to predict the category that a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known. We studied the classification performances of the two linear classification under different feature (variable) selection criteria and dataset size conditions on a medical domain area were studied based on the datasets (breast cancer and heart diseases) obtained from the University of California, Irvine, online respiratory. The result indicated that logistics Regression for classification on relatively large datasets without the application of PCA (for variable selection) has the great accuracy (91.4%), while Naïve Bayesian classifier with PCA (for variable/ feature selection) tops the smaller dataset classification with an accuracy of 90.2%. These two accuracies are close enough and high enough, which is an indication of high relevance of their selections in solving classification problems on datasets from this kind of domain.


You either get what you want or your money back. T&C Apply







You can find more project topics easily, just search

Quick Project Topic Search