0%

Book Description

Put your Haskell skills to work and generate publication-ready visualizations in no time at all

Key Features

  • Take your data analysis skills to the next level using the power of Haskell
  • Understand regression analysis, perform multivariate regression, and untangle different cluster varieties
  • Create publication-ready visualizations of data

Book Description

Every business and organization that collects data is capable of tapping into its own data to gain insights how to improve. Haskell is a purely functional and lazy programming language, well-suited to handling large data analysis problems. This book will take you through the more difficult problems of data analysis in a hands-on manner.

This book will help you get up-to-speed with the basics of data analysis and approaches in the Haskell language. You'll learn about statistical computing, file formats (CSV and SQLite3), descriptive statistics, charts, and progress to more advanced concepts such as understanding the importance of normal distribution. While mathematics is a big part of data analysis, we've tried to keep this course simple and approachable so that you can apply what you learn to the real world.

By the end of this book, you will have a thorough understanding of data analysis, and the different ways of analyzing data. You will have a mastery of all the tools and techniques in Haskell for effective data analysis.

What you will learn

  • Learn to parse a CSV file and read data into the Haskell environment
  • Create Haskell functions for common descriptive statistics functions
  • Create an SQLite3 database using an existing CSV file
  • Learn the versatility of SELECT queries for slicing data into smaller chunks
  • Apply regular expressions in large-scale datasets using both CSV and SQLite3 files
  • Create a Kernel Density Estimator visualization using normal distribution

Who this book is for

This book is intended for people who wish to expand their knowledge of statistics and data analysis via real-world examples. A basic understanding of the Haskell language is expected. If you are feeling brave, you can jump right into the functional programming style.

Table of Contents

  1. Title Page
  2. Copyright and Credits
    1. Getting Started with Haskell Data Analysis
  3. Packt Upsell
    1. Why subscribe?
    2. Packt.com
  4. Contributors
    1. About the author
    2. Packt is searching for authors like you
  5. Preface
    1. Who this book is for
    2. What this book covers
    3. To get the most out of this book
      1. Download the example code files
      2. Download the color images
      3. Conventions used
    4. Get in touch
      1. Reviews
  6. Descriptive Statistics
    1. The CSV library – working with CSV files
    2. Data range
    3. Data mean and standard deviation
    4. Data median
    5. Data mode
    6. Summary
  7. SQLite3
    1. SQLite3 command line
    2. Working with SQLite3 and Haskell
    3. Slices of data
    4. Working with SQLite3 and descriptive statistics
    5. Summary
  8. Regular Expressions
    1. Dots and pipes
    2. Atom and Atom modifiers
    3. Character classes
    4. Regular expressions in CSV files
    5. SQLite3 and regular expressions
    6. Summary
  9. Visualizations
    1. Line plots of a single variable
    2. Plotting a moving average
    3. Creating publication-ready plots
    4. Feature scaling
    5. Scatter plots
    6. Summary
  10. Kernel Density Estimation
    1. The central limit theorem
    2. Normal distribution
    3. Introducing kernel density estimation
    4. Application of the KDE
    5. Summary
  11. Course Review
    1. Converting CSV variation files into SQLite3
    2. Using SQLite3 SELECT and the DescriptiveStats module for descriptive statistics
    3. Creating compelling visualizations using EasyPlot
    4. Reintroducing kernel density estimation
    5. Summary
  12. Other Books You May Enjoy
    1. Leave a review - let other readers know what you think