Index
Symbols
- ! (not), Boolean Algebra
- " (double quotes), Creating a String
- # (pound sign), Comments
- $ (dollar sign)
- %% (remainder computation), Modular Arithmetic
- %/% (integer division), Modular Arithmetic
- %>% pipe, The Pipe
- %in%, filter()
, %in%, Arithmetic and Recycling Rules
- & (ampersand) operators, Layout, Boolean Algebra
- && (ampersand) short-circuiting operator, Boolean Algebra
- ' (single quotes), Creating a String
- () (parentheses), in regex, Operator Precedence and Parentheses
- + operator
- / (slashes), Relative and Absolute Paths
- := operator, Combining with Other Tidyverse Packages
- <- (assignment operator), Coding Basics, filter()
, Spaces
- = (equals sign), Common Mistakes, select()
- == (equals sign), Floating-Point Comparison, Arithmetic and Recycling Rules
- [ (left bracket)
- [[ (left brackets)
- [] (brackets), Pattern Basics
- (backslashes)
-
(new line), Other Special Characters
- (tab), Other Special Characters
- u (escape), Other Special Characters
- U (escape), Other Special Characters
- ` (backticks), Practical Advice
- {} (brackets), str_glue()
- | (alternation), Pattern Basics
- | (or), Boolean Algebra
- |> (pipe), Pipes-Pipes
- || (short-circuiting operator), Boolean Algebra
A
- absolute paths, relative paths versus, Relative and Absolute Paths
- across() function
- aes() function, Plot Functions-Labeling
- aesthetic mappings, Aesthetic Mappings-Aesthetic Mappings
- aesthetics, Adding Aesthetics and Layers-Adding Aesthetics and Layers
- aggregation
- aliasing, SELECT
- all() function, Logical Summaries
- alternation (|), Pattern Basics
- analysis notebooks, Workflow-Workflow
- anchors, regex and, Anchors
- annotate() function, Annotations
- annotations, Annotations-Annotations
- anti-joins, Filtering Joins
- anti_join() function, Joins
- any() function, Logical Summaries
- Apache Arrow (see Arrow)
- apply() function, Apply Family
- arrange() function, arrange()
, dplyr Equivalents
- array, JSON, Data Types
- Arrow, Introduction-Summary
- ASCII, Encoding
- assignment statements, Coding Basics
- as_date() function, From Other Types
- as_datetime() function, From Other Types
- axis ticks, Axis Ticks and Legend Keys-Axis Ticks and Legend Keys
B
- back reference, Grouping and Capturing
- Banker's rounding, Rounding
- Base R, Introduction-Summary
- apply family of functions, Apply Family-Apply Family
- dplyr verbs that are special cases of [, dplyr Equivalents-dplyr Equivalents
- extracting columns from data frames with $ and [[, Data Frames
- for loops, for Loops-for Loops
- lists, Lists-Lists
- plotting functions, Plots
- regex applications, Base R
- selecting a single element with $ and [[, Selecting a Single Element with $ and [[-Lists
- selecting multiple elements with [, Selecting Multiple Elements with [-Exercises
- subsetting data frames, Subsetting Data Frames-Subsetting Data Frames
- subsetting vectors, Subsetting Vectors
- tibbles, Tibbles
- books, generating with Quarto, Websites and Books-Websites and Books
- Boolean algebra, Boolean Algebra-%in%
- Boolean operations
- boxplot, defined, A Numerical and a Categorical Variable
- breaks argument, Axis Ticks and Legend Keys, Axis Ticks and Legend Keys
- .by argument, .by
C
- c() function, Coding Basics, Hierarchy
- caching, in Quarto, Caching-Exercises
- capturing groups, regex, Grouping and Capturing-Grouping and Capturing
- case_when() function, case_when()-case_when()
- categorical variables, A Categorical Variable
- ceiling() function, Rounding
- center, numeric summary functions for, Center-Center
- character classes (character sets), Pattern Basics, Character Classes-Summary
- chunks (see code chunks)
- closest() function, Rolling Joins
- code chunks
- code style, Workflow: Code Style-Summary
- coding basics, Coding Basics
- coll() function, Fixed Matches
- color, Replacing a Scale-Replacing a Scale
- ColorBrewer scales, Replacing a Scale-Replacing a Scale
- .cols, Selecting Columns with .cols
- column headers, data and variable names in, Data and Variable Names in the Column Headers
- column types
- column-oriented databases, Database Basics
- columns, Columns-relocate()
- across() in functions, across() in Functions-Versus pivot_longer()
- calling a single function, Calling a Single Function
- calling multiple functions, Calling Multiple Functions-Calling Multiple Functions
- data and variable names in column headers, Data and Variable Names in the Column Headers
- data transformation, Columns-relocate()
- filtering, Filtering
- lengthening data in column names, Data in Column Names-Data in Column Names
- modifying multiple columns, Modifying Multiple Columns-Versus pivot_longer()
- mutate(), mutate()-mutate()
- names, Column Names
- relocate(), relocate()
- rename(), rename()
- select(), select()
- selecting with .cols, Selecting Columns with .cols
- separating strings into, Separating into Columns
- working with many variables in column names, Many Variables in Column Names-Many Variables in Column Names
- command palette, Workflow: Code Style
- commas, spaces and, Spaces
- comments, Comments, Sectioning Comments
- communication, Introduction-Summary
- community, R, Investing in Yourself
- comparisons
- compatible types, Compatible Types
- complete() function, Complete
- compound keys, Primary and Foreign Keys
- conditional transformations, Conditional Transformations-Compatible Types
- consecutive identifiers, Consecutive Identifiers
- coordinate systems, Coordinate Systems
- copyright law, Copyright
- count() function, Counts-Counts
- counting, Counts-Counts
- covariation, Covariation-Two Numerical Variables
- cross joins, Cross Joins
- CSL files, Bibliographies and Citations
- CSS, Find Elements
- CSV files
- cumulative aggregates, Cumulative and Rolling Aggregates
D
- data entry, Data Entry
- data frame functions, Data Frame Functions-Data Masking Versus Tidy Selection
- data frames
- data import, Introduction-Summary
- data masking, tidy selection versus, Data Masking Versus Tidy Selection-Data Masking Versus Tidy Selection
- data point, The penguins Data Frame
- data tidying, Introduction-Summary
- data transformation, Introduction-Summary
- data types, Data Types
- data visualization, Introduction
- data.frame() function, List Columns
- databases, Introduction-Summary
- date-time
- date/times
- dates and times, Introduction-Summary
- dbListTable() function, DBI Basics
- dbplyr, dbplyr Basics
- dbReadTable() function, DBI Basics
- dbWriteTable() function, Load Some Data
- density plot, A Numerical Variable
- distinct() function, distinct()
- distributions
- double quotes ("), Creating a String
- dplyr, dplyr Basics
- duckdb, In This Book, Using dbplyr with Arrow
- durations, Durations
E
- EDA (see exploratory data analysis)
- elements, HTML
- embracing a variable
- empty groups, Factors and Empty Groups-Factors and Empty Groups
- encoding non-English text, Encoding
- equi joins, How Do Joins Work?
- escapes (), Escapes, Escaping
- ethical issues, in web scraping, Scraping Ethics and Legalities-Copyright
- Excel, Excel-Formatted Output
- explicit missing values
- exploratory data analysis (EDA), Introduction-Summary
F
- faceting, Three or More Variables, Facets-Facets
- facet_grid() function, Facets
- facet_wrap() function, Three or More Variables
- factors, Introduction-Summary
- fct_collapse() function, Modifying Factor Levels
- fct_infreq() function, Modifying Factor Order
- fct_lump_*() functions, Modifying Factor Levels
- fct_recode() function, Modifying Factor Levels
- fct_reorder() function, Modifying Factor Order-Modifying Factor Order
- figures
- files, reading multiple
- filter() function, filter()
-filter()
, Filtering, dplyr Equivalents
- filtering joins
- fixed() function, Fixed Matches
- flags, regex, Regex Flags-Regex Flags
- floating-point comparisons, Floating-Point Comparison
- floor() function, Rounding
- for loops, for Loops-for Loops
- foreign keys, Primary and Foreign Keys
- FROM clause, SQL, SQL Basics, FROM
- full join, How Do Joins Work?
- function translations, Function Translations-Function Translations
- functional programming tools, Introduction
- functions, writing, Introduction-Summary
G
- geom, defined, Creating a ggplot
- geometric objects, in plots, Geometric Objects-Geometric Objects
- geom_bin2d() function, Two Numerical Variables
- geom_point() function, Creating a ggplot
- geom_smooth() function, Geometric Objects
- geom_text() function, Annotations
- geom_text_repel() function, Annotations
- ggplot, creating, Creating a ggplot-Creating a ggplot
- ggplot2, Creating a ggplot
- ggsave() function, Saving Your Plots
- Google, as source of information on R, Google Is Your Friend
- Google Sheets, Google Sheets-Authentication
- grammar of graphics
- GROUP BY clause, SQL, SQL Basics, GROUP BY
- groups
- group_by() function, group_by()
- guides (axes and legends), Axis Ticks and Legend Keys-Axis Ticks and Legend Keys
- guides() function, Legend Layout
H
- header row, Reading Data from a File
- headers, data and variable names in, Data and Variable Names in the Column Headers
- help, getting
- hierarchical data, Introduction-List Columns
- HTML, HTML Basics
- htmlwidgets, htmlwidgets
- html_attr() function, Text and Attributes
- html_element() function, Find Elements-Nesting Selections
- html_text2() function, Text and Attributes
I
- IANA time zones, Time Zones
- identifiers, consecutive, Consecutive Identifiers
- if_else() function, Practical Advice, if_else()
- images, saving plots as, Saving Your Plots
- implicit missing values
- IN operator, SQL, WHERE
- indirection, Indirection and Tidy Evaluation-Indirection and Tidy Evaluation
- inequality joins, Inequality Joins
- interquartile range (IQR), A Numerical and a Categorical Variable, Spread
- intervals of time, Intervals
- IQR() function, Spread
- is.na() function, is.na()-is.na()
- ISO8601 date standard, During Import
- iteration, Introduction-Summary
J
- janitor::clean_names() function, Practical Advice
- jitter, Position Adjustments
- joins, Introduction-Summary
- basic, Basic Joins-Filtering Joins
- cross joins, Cross Joins
- filtering, Filtering Joins, Filtering Joins
- how they work, How Do Joins Work?-Filtering Joins
- inequality joins, Inequality Joins
- keys, Keys-Surrogate Keys
- mutating joins, Mutating Joins-Mutating Joins
- non-equi joins, Non-Equi Joins-Overlap Joins
- overlap joins, Overlap Joins
- rolling joins, Rolling Joins-Rolling Joins
- row matching, Row Matching-Row Matching
- specifying join keys, Specifying Join Keys-Specifying Join Keys
- SQL, Joins
- join_by() function, Specifying Join Keys-Specifying Join Keys
- JSON, JSON-Starting the Rectangling Process
- jsonlite package, jsonlite
L
- labels argument, Axis Ticks and Legend Keys-Axis Ticks and Legend Keys
- labels/labeling
- labs() function, Adding Aesthetics and Layers, Labels
- lapply() function, Apply Family
- layers, Introduction-Summary
- layout, Layout-Layout
- left join, How Do Joins Work?
- left_join() function, Mutating Joins-Mutating Joins
- legal issues, web scraping and, Scraping Ethics and Legalities-Copyright
- legend keys, Axis Ticks and Legend Keys-Axis Ticks and Legend Keys
- legend layout, Legend Layout-Legend Layout
- lengthening data, Lengthening Data-Data and Variable Names in the Column Headers
- letters in strings, Letters
- list() function, Lists
- list-columns, List Columns-Other Functions
- list.files() function, Listing Files in a Directory
- lists, Lists-List Columns
- literal characters, Pattern Basics
- locale-dependent functions, Locale-Dependent Functions
- log() function, Logarithms
- logarithms, Logarithms
- logical subsetting, Logical Subsetting
- logical summaries, Logical Summaries
- logical vectors, Introduction-Summary
- lubridate package, Prerequisites
- (see also dates and times)
M
- magrittr package, The Pipe
- make_date() function, From Individual Components
- make_datetime() function, From Individual Components
- many-to-many joins, Row Matching
- mappings, aesthetic, Aesthetic Mappings-Aesthetic Mappings
- mathematical operators, spaces and, Spaces
- mean() function, Numeric Summaries of Logical Vectors, Center
- median() function, Center, Calling Multiple Functions-Calling Multiple Functions
- metacharacters, Pattern Basics
- min_rank() function, Ranks
- missing values, Introduction-Summary
- Boolean algebra rules for, Missing Values
- comparisons and, Missing Values
- explicit missing values, Explicit Missing Values-NaN
- factors and empty groups, Factors and Empty Groups-Factors and Empty Groups
- fixed values, Fixed Values
- ggplot2 and, Creating a ggplot
- implicit missing values, Implicit Missing Values-Joins
- is.na() and, is.na()-is.na()
- last observation carried forward, Last Observation Carried Forward
- NaN, NaN
- mode, Center
- models, Patterns and Models-Patterns and Models
- modular arithmetic, Modular Arithmetic
- mutate() function, Mutate Functions-Mutate Functions
- mutating joins, Mutating Joins-Mutating Joins
N
- n() function, Counts
- names
- NaN (not a number), NaN
- natural joins, Specifying Join Keys
- nesting selections, Nesting Selections
- non-English text, Non-English Text-Locale-Dependent Functions
- non-equi joins, Non-Equi Joins-Overlap Joins
- nonsyntactic names, Practical Advice
- NULL operator, SQL, WHERE
- null, JSON handling of, Data Types
- numbers, working with, Introduction-Summary
- numeric comparisons, Comparisons-is.na()
- numeric summaries, Numeric Summaries of Logical Vectors
- numeric summary functions, Numeric Summaries-With mutate()
- numeric transformations, Numeric Transformations-Cumulative and Rolling Aggregates
- numeric vectors (see numbers, working with)
- numerical variables, A Numerical Variable-A Numerical Variable
O
- objects
- observation, defined, The penguins Data Frame
- offsets, Offsets
- operator precedence, regex, Operator Precedence and Parentheses
- options, for customizing chunk output, Chunk Options, Global Options
- ORDER BY clause, SQL, SQL Basics, ORDER BY
- order of operations, Boolean algebra, Order of Operations
- ordered() factors, Ordered Factors
- outer joins, How Do Joins Work?
- outliers, variation and, Unusual Values-Unusual Values
- overlap joins, Overlap Joins
- overplotting, Position Adjustments
P
- parentheses (), in regex, Operator Precedence and Parentheses
- parquet files, Writing to a File, The Parquet Format-Rewriting the Seattle Library Data
- parse_double() function, Making Numbers
- parse_number() function, Making Numbers
- partial matching, Tibbles
- paths, relative versus absolute, Relative and Absolute Paths
- pattern control, regex, Pattern Control-Fixed Matches
- patterns, Patterns and Models-Patterns and Models
- PDF output, from Quarto, Other Important Options
- per-operation grouping, .by
- periods, Periods-Periods
- personally identifiable information, Personally Identifiable Information
- pipe (|>), dplyr Basics, The Pipe-The Pipe
- pivoting
- pivot_longer() function
- pivot_wider() function, How Does pivot_wider() Work?-How Does pivot_wider() Work?
- plot functions, Plot Functions-Labeling
- plots, saving as image, Saving Your Plots
- pmax() function, Minimum and Maximum
- pmin() function, Minimum and Maximum
- position adjustments, Position Adjustments-Position Adjustments
- positions, summary functions for, Positions
- presentations, Quarto and, Presentations
- primary keys
- projects, Projects-Relative and Absolute Paths
- purrr::list_rbind() function, purrr::map() and list_rbind()
- purrr::map() function, purrr::map() and list_rbind()
Q
- quantifiers, regex, Pattern Basics, Quantifiers
- quantile() function, Minimum, Maximum, and Quantiles
- Quarto, Introduction-Summary
- basics, Quarto Basics-Quarto Basics
- bibliographies and citations, Bibliographies and Citations-Bibliographies and Citations
- caching, Caching-Exercises
- chunk label, Chunk Label
- chunk options, Chunk Options
- code chunks, Code Chunks-Inline Code
- document formats, Documents
- document parameters, Parameters
- figures, Figures-Other Important Options
- formats other than HTML, Introduction-Summary
- generating websites and books, Websites and Books-Websites and Books
- htmlwidgets, htmlwidgets
- interactivity options, Interactivity-Shiny
- output options, Output Options, Other Formats
- presentations, Presentations
- self-contained documents, Self-Contained
- shiny package, Shiny-Shiny
- source editor, Source Editor
- tables, Tables
- troubleshooting, Troubleshooting
- visual editor, Visual Editor-Visual Editor
- workflow, Workflow-Workflow
- YAML header, YAML Header-Bibliographies and Citations
- queries, SQL, SQL Basics
R
- R community, Investing in Yourself
- R scripts, as source of truth, What Is the Source of Truth?-What Is the Source of Truth?
- ranks/ranking, Ranks
- raw strings, Raw Strings
- readr package (see data import)
- readxl package, Prerequisites
- read_csv() function, Reading Data from a File, Other Arguments
- read_excel() function, Reading Excel Spreadsheets-Reading Excel Spreadsheets
- read_html() function, Extracting Data
- read_rds() function, Writing to a File
- read_sheet() function, Getting Started-Reading Google Sheets
- rectangling, Starting the Rectangling Process
- (see also hierarchical data)
- recycling rules, Arithmetic and Recycling Rules
- regular expressions (regex), Introduction
- anchors, Anchors
- Base R applications, Base R
- Boolean operations, Boolean Operations-Boolean Operations
- character classes, Character Classes-Grouping and Capturing
- checking work with, Check Your Work-Check Your Work
- counting matches, Count Matches-Count Matches
- creating a pattern with code, Creating a Pattern with Code-Exercises
- detecting matches, Detect Matches-Detect Matches
- escaping for matching literals, Escaping
- extracting variables, Extract Variables-Extract Variables
- fixed matches, Fixed Matches
- flags, Regex Flags-Regex Flags
- grouping and capturing, Grouping and Capturing-Grouping and Capturing
- key functions, Key Functions-Extract Variables
- operator precedence and parentheses, Operator Precedence and Parentheses
- pattern basics, Pattern Basics-Pattern Basics
- pattern control, Pattern Control-Fixed Matches
- pattern details, Pattern Details
- practical applications, Practice-Exercises
- quantifiers, Quantifiers
- replacing values, Replace Values
- tidyverse applications, Tidyverse
- relationships, visualizing, Visualizing Relationships-Three or More Variables
- relative paths, absolute paths versus, Relative and Absolute Paths
- relocate() function, relocate()
, dplyr Equivalents
- rename() function, rename()
- reprex, creating, Making a reprex-Making a reprex
- reserved words, SELECT
- right join, How Do Joins Work?
- rolling aggregates, Cumulative and Rolling Aggregates
- rolling joins, Rolling Joins-Rolling Joins
- round() function, Rounding
- rounding, Rounding
- row-oriented databases, Database Basics
- rows, Rows-distinct()
- row_number() function, Ranks
- RStudio
- RStudio Server, What Is the Source of Truth?
- rvest (see web scraping)
S
- sample size, aggregates and, Case Study: Aggregates and Sample Size-Case Study: Aggregates and Sample Size
- sapply() function, Apply Family
- scales, Scales-Zooming
- scale_color_manual() function, Replacing a Scale
- scaling, Adding Aesthetics and Layers
- scripts, Scripts-Saving and Naming
- sectioning comments, Sectioning Comments
- SELECT clause, SQL, SQL Basics, SELECT-SELECT
- select() function, select(), dplyr Equivalents
- SelectorGadget, Finding the Right Selectors
- selectors, Finding the Right Selectors
- self-joins, Cross Joins
- semi-joins, Filtering Joins
- separate_longer_delim() function, Separating into Rows
- separate_longer_position() function, Separating into Rows
- separate_wider_delim() function, Separating into Columns, Diagnosing Widening Problems-Diagnosing Widening Problems
- separate_wider_position() function, Separating into Columns
- separate_wider_regex() function, Extract Variables-Extract Variables
- set_names() function, Data in the Path
- shiny, Shiny-Shiny
- short-circuiting operators, Boolean Algebra
- single quotes ('), Creating a String
- slice_ functions, The slice_ Functions
- snake_case, What’s in a Name?
- spaces, code style for, Spaces
- spread of data, Spread
- spreadsheets, Introduction-Summary
- SQL, SQL-Other Verbs
- statements, SQL, SQL Basics
- statistical transformations, Statistical Transformations-Statistical Transformations
- str() function, Lists
- stringr package, Introduction-Summary
- strings, Introduction-Summary
- creating, Creating a String-Exercises
- creating date/times from, From Strings
- creating many strings from data, Creating Many Strings from Data-str_flatten()
- diagnosing widening problems, Diagnosing Widening Problems-Diagnosing Widening Problems
- escapes, Escapes
- extracting data from, Extracting Data from Strings-Diagnosing Widening Problems
- JSON, Data Types
- making numbers from, Making Numbers
- non-English text, Non-English Text-Locale-Dependent Functions
- raw, Raw Strings
- separating into columns, Separating into Columns
- separating into rows, Separating into Rows
- str_c(), str_c()
- str_flatten(), str_flatten()
- str_glue(), str_glue()
- subsetting, Subsetting
- working with individual letters, Letters
- str_c() function, str_c()
- str_count() function, Count Matches-Count Matches
- str_detect() function, Detect Matches
- str_flatten() function, str_flatten()
- str_glue() function, str_glue()
- str_length() function, Length
- str_remove() function, Replace Values
- str_replace() function, Replace Values
- str_sub() function, Subsetting
- str_subset() function, Detect Matches
- str_view() function, Pattern Basics
- str_which() function, Detect Matches
- style (see code style)
- styler package, Workflow: Code Style
- subqueries, SQL, Subqueries
- subset() function, dplyr Equivalents
- subsetting
- sum() function, Numeric Summaries of Logical Vectors
- summaries, of logical vectors, Summaries-Logical Subsetting
- summarize() function, summarize()-summarize(), str_flatten()
- summary functions, Summary Functions
- surrogate keys, Surrogate Keys
T
- tables
- tabular data, The penguins Data Frame
- tapply() function, Apply Family
- tbl() function, dbplyr Basics
- terms of service, Terms of Service
- text, non-English (see non-English text)
- theme() function, Legend Layout-Legend Layout
- themes, Themes-Themes
- tibble() function, Data Entry
- tibbles, Tibbles
- tidy data (see data tidying)
- tidy evaluation, Indirection and Tidy Evaluation-Indirection and Tidy Evaluation
- tidy selection, data masking versus, Data Masking Versus Tidy Selection-Data Masking Versus Tidy Selection
- tidy tabular data, The penguins Data Frame
- tidyr, Prerequisites
- tidyr::complete() function, Complete
- tidyverse, regex applications for, Tidyverse
- time (see dates and times)
- time spans, Time Spans-Intervals
- time zones, Time Zones-Time Zones
- to_duckdb() function, Using dbplyr with Arrow
- tribble() function, Data Entry
- truth, source of, What Is the Source of Truth?-What Is the Source of Truth?
W
- web scraping, Introduction-Summary
- dynamic websites, Dynamic Sites
- examples, Putting It All Together-IMDb Top Films
- extracting data, Extracting Data-Tables
- finding HTML elements, Find Elements
- finding the right selectors, Finding the Right Selectors
- HTML basics, HTML Basics
- IMDb top films example, IMDb Top Films-IMDb Top Films
- legal/ethical issues, Scraping Ethics and Legalities-Copyright
- nesting selections, Nesting Selections
- Star Wars example, Star Wars-Star Wars
- websites, generating with Quarto, Websites and Books-Websites and Books
- WHERE clause, SQL, SQL Basics, WHERE-WHERE
- where() function, Selecting Columns with .cols
- widening data, Widening Data-How Does pivot_wider() Work?
- workflow
- basics, Workflow: Basics-Summary
- calling functions, Calling Functions
- code style, Workflow: Code Style-Summary
- coding basics, Coding Basics
- comments, Comments
- getting help, Workflow: Getting Help-Summary
- object names, What’s in a Name?
- projects, Projects-Relative and Absolute Paths
- Quarto, Workflow-Workflow
- scripts, Scripts-Saving and Naming
- working directory, Where Does Your Analysis Live?
- worksheets, Excel, Reading Worksheets-Reading Worksheets
- write_csv() function, Writing to a File
- write_rds() function, Writing to a File
- write_tsv() function, Writing to a File
..................Content has been hidden....................
You can't read the all page of ebook, please click
here login for view all page.