Abstract

Data mining has become the fastest growing topic of interest in business programs in the past decade. This book is intended to first describe the benefits of data mining in business, describe the process and typical business applications, describe the workings of basic data mining models, and demonstrate each with widely available free software. This second edition updates Chapter 1, and adds more details on Rattle data mining tools.

The book focuses on demonstrating common business data mining applications. It provides exposure to the data mining process, to include problem identification, data management, and available modeling tools. The book takes the approach of demonstrating typical business data sets with open source software. KNIME is a very easy-to-use tool, and is used as the primary means of demonstration. R is much more powerful and is a commercially viable data mining tool. We will demonstrate use of R through Rattle. We also demonstrate WEKA, which is a highly useful academic software, although it is difficult to manipulate test sets and new cases, making it problematic for commercial use. We will demonstrate methods with a small but typical business dataset. We use a larger (but still small) realistic business dataset for Chapter 9.

Keywords

big data, business analytics, clustering, data mining, decision trees, neural network models, regression models

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset