This section briefly introduces the authors, the coverage of this book, the technical skills you'll need to get started, and the hardware and software requirements required to complete all of the included activities and exercises.
Practical Machine Learning with R gives you the tools to solve a wide range of business problems - starting by forming a good problem statement, selecting the most appropriate model to solve your problem, and then ensuring that you do not overtrain the model.
Brindha Priyadarshini Jeyaraman is a senior data scientist at AIDA Technologies. She has completed her M.Tech in knowledge engineering with a gold medal from the National University of Singapore. She has more than 10 years of work experience and she is an expert in understanding business problems, and designing and implementing solutions using machine learning. She has worked on several real data science projects in the insurance and finance domain. This book provides a great platform for her to share the knowledge she has gained over the past few years of working in data science and machine learning.
Ludvig Renbo Olsen, BSc in Cognitive Science from Aarhus University, is the author of multiple R packages, such as groupdata2 and cvms. With 4 years of R and python experience, including working as a machine learning researcher at the Danish startup UNSILO, he is passionate about creating tools and tutorials for students and scientists. Guided by Effective Altruism, he intends to positively impact the world through his career.
Monicah Wambugu is the lead data scientist at a financial technology company that offers micro-loans by leveraging on data, machine learning, and analytics to perform alternative credit scoring. She is a graduate student at the School of Information at UC Berkeley Masters in Information Management and Systems. Monicah is particularly interested in how data science and machine learning can be used to design products and applications that respond to the behavioral and socio-economic needs of target audiences.
With huge amounts of data being generated every moment, businesses need applications that apply complex mathematical calculations to data repeatedly and at speed. With machine learning techniques and R, you can easily develop these kinds of applications in an efficient way.
Practical Machine Learning with R begins by helping you grasp the basics of machine learning methods, while also highlighting how and why they work. You will understand how to get these algorithms to work in practice, rather than focusing on mathematical derivations. As you progress from one chapter to another, you will gain hands-on experience of building a machine learning solution in R. Next, using R packages such as rpart, random forest, Multiple Imputation by Chained Equations (MICE) and neuralnet, you will learn to implement algorithms including neural networks, decision trees, and linear and logistic regression. As you progress through the book, you'll delve into various machine learning techniques for both supervised and unsupervised learning approaches. In addition to this, you'll gain insights into partitioning the datasets and mechanisms to evaluate the results from each model and be able to compare them.
By the end of this book, you will have gained expertise in solving your business problems, starting by forming a good problem statement, selecting the most appropriate model to solve your problem, and then ensuring that you do not overtrain it.
If you are a data analyst, data scientist, or a business analyst who wants to understand the process of machine learning and apply it to a real dataset using R, this book is just what you need. Data scientists who use Python and want to implement their machine learning solutions using R will also find this book very useful. The book will also enable novice programmers to start their journey in data science. Basic knowledge of any programming language is all you need to get started.
Practical Machine Learning with R uses a practical and hands-on approach to teach all concepts. You will explore different machine learning algorithms with a project-based approach. By solving problems using concepts taught in the previous chapters, the book demystifies the complexity of machine learning and gives you the confidence to tackle even more challenging problems.
For the optimal student experience, we recommend the following hardware configuration:
You'll also need the following software installed in advance:
Code, database table names, file and folder names, file extensions, pathnames, URLs, user input, and Twitter handles are shown as follows: "The pre-loaded datasets of R can be viewed using the data() command"
A block of code is set as follows:
# Installing necessary packages
install.packages("mlbench")
install.packages("caret")
# Loading the datasets
data(package = .packages(all.available = TRUE))
New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: "The Help tab will display all the information about the dataset. "
We will be installing R, Rstudio and R packages.
The current installation process is for Windows and will be similar for other operating systems.
The status of the installation of the package can be viewed in the console.
Alternatively, the install.packages("packagename") can also be used.
Copy the code bundle for the class to the C:/Code folder (for Windows).
The code bundle for this book is hosted on GitHub at: https://github.com/TrainingByPackt/Practical-Machine-Learning-with-R.
We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!