So you have a data set that you would like to prepare for analysis, and you want to learn Tableau Prep Builder. This chapter will take you through the basics of Tableau Prep Builder, including downloading the software, familiarizing yourself with Prep terminology, and using the software to produce a data set for the first time.
Tableau’s products are available for download through the company’s website. Figure 2-1 shows Tableau’s download page for Prep Builder, where you can select the version of the program you want to use.
Click the version number you want to use, select Download Tableau Prep <version number> at the top of the screen, and choose whether you want to download the Windows or Mac version of the software. After you make your selection, the file will download to your machine. Once the download is complete, you can run the file and follow the instructions given.
After downloading Prep Builder, you’ll see a new folder in your Documents folder, called My Tableau Prep Repository, which contains a lot of useful files and subfolders for storing the flows and data sets you’ll create within Prep. Figure 2-2 shows the view of my Documents in Finder on Mac.
For most users, Tableau Prep Builder is not free. The main way to get Prep is to purchase a Tableau Creator license, which is a monthly paid subscription that packages together Tableau Prep Builder, Tableau Desktop, and a single access to Tableau Server or Tableau Online (where Tableau hosts the Server instance). Tableau Prep Conductor is part of the Data Management add-on for Tableau Server and Tableau Online that can be purchased separately. All of these licensing options apply on a per-user basis.
There are 14-day trials of all of the full Tableau tools available from the main Tableau website. Educators and currently enrolled students can also get Tableau for free after a simple verification process.
After downloading and installing the application, you will be prompted to enter your licensing information or sign up for a trial (Figure 2-3).
When you load Prep Builder for the first time, you will be presented with the screen shown in Figure 2-4.
Let’s walk through this screen:
Once you are connected to your data set, you will be taken to the screen shown in Figure 2-5.
This screen is split into two parts:
In this section, we’ll dig into a few key steps of data preparation.
After connecting to a data source, select the input by dragging the data source from the Connections pane onto the canvas. By default, Prep Builder samples the input data set to speed up the process of building your workflow. When you run the workflow when an output is set up, all of the data will automatically be processed. The Input step is shown in Figure 2-6.
The three key parts of this screen (excluding the Connections pane) are:
The Clean step is where the majority of the work takes place, so getting familiar with it is key to your overall understanding of Prep Builder. To begin the Clean step, click the plus sign to the right of the selected input data, as shown in Figure 2-7.
There are four key parts to this screen, including the Connections and Flow panes. The other two are:
Once you have cleaned the data, you can output the data set to make it available for analysis in other tools (Figure 2-9).
The Output step gives you a number of choices about how to output your data. Part IV of this book is dedicated to outputs, with a focus on outputting to files in Chapter 19 and to databases in Chapter 20. The default option for the output is to save to a file in your My Tableau Prep Repository folder.
Whether you have completed your flow or just need to pause your work to continue later, saving the flow is key. To do so, select Save or Save As from the menu at the top of the screen (Figure 2-10).
The Save option is the same as Save As if the file hasn’t been saved before. If the file has been saved previously, use Save As to change the filename.
When saving the file, you’ll see a screen where you have two choices for the file type (Figure 2-11):
Tableau flow file (.tfl) saves the logic of the flow as well as the input and output file locations. Therefore, you’ll need access to the input and output locations to make use of this file format.
Packaged Tableau flow file (.tflx) saves not just the logic but also the input and output files.
Once you’ve selected a save option, you can process the flow by clicking the run icon (▷) either at the top of the Flow pane (to run every output) or on the output step icon (to run a single output), as shown in Figure 2-12.
This chapter has given you the basics to start working with a simple flow in Tableau Prep Builder. Subsequent chapters will build upon this knowledge. The steps and other screen layouts you’ll need to be familiar with are covered in the chapters on specific techniques:
Aggregate: Chapter 15
Join: Chapter 16
Union: Chapter 17
Now that you’ve gained some familiarity with the tool, you’re better equipped to navigate the terminology in the other chapters and throughout your use of Prep Builder.