Journal of Applied Mathematics
Volume 2012 (2012), Article ID 258054, 13 pages
http://dx.doi.org/10.1155/2012/258054
Research Article

Applying Randomness Effectively Based on Random Forests for Classification Task of Datasets of Insufficient Information

Division of Computer and Information Engineering, Dongseo University, Busan 617-716, Republic of Korea

Received 20 July 2012; Revised 8 October 2012; Accepted 8 October 2012

Academic Editor: Hak-Keung Lam

Copyright © 2012 Hyontai Sug. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Random forests are known to be good for data mining of classification tasks, because random forests are robust for datasets having insufficient information possibly with some errors. But applying random forests blindly may not produce good results, and a dataset in the domain of rotogravure printing is one of such datasets. Hence, in this paper, some best classification accuracy based on clever application of random forests to predict the occurrence of cylinder bands in rotogravure printing is investigated. Since random forests could generate good results with an appropriate combination of parameters like the number of randomly selected attributes for each split and the number of trees in the forests, an effective data mining procedure considering the property of the target dataset by way of trial random forests is investigated. The effectiveness of the suggested procedure is shown by experiments with very good results.