Data Analysis with R Programming (Terms) Flashcards
A body of reusable code for performing specific tasks in R
Function (R)
A collection of columns containing data, similar to a spreadsheet or SQL table
Data frame
A collection of materials that can be shared with potential employers
Portfolio
A common way for employers to assess job skills and gain insight into how a candidate approaches common data-related challenges
Case study
A computer-generated file that records events from operating systems and other software programs
Log file
A declaration that if a certain condition holds, then a certain event must take place
Conditional statement
A directory containing all of a data analyst’s installed packages
Library
A document for running code and displaying the graphs and charts that visualize the code
R Notebook
A file format for making dynamic documents with R
R Markdown
A format for organizing and storing data
Data structure
A function that is completely contained within another function
Nested function
A general-purpose programming language
Python
A group of data elements of the same type stored in a one-dimensional sequence in R
Vector (R)
A group of R functions used for customizing a plot
Labels and annotations (R)
A language that translates data to improve readability
YAML
A line on a data visualization that uses smoothing to represent a trend
Smoothing line (R)
A piece of code added in an R Markdown file that is used to process, visualize or analyze data
Code chunk
A process for smoothing plots with a large number of points
GAM (generalized additive model) smoothing (R)
A process used for smoothing plots with fewer than 1,000 points
Loess smoothing (R)
A process used to make data visualizations in R clearer and more readable
Smoothing (R)
A programming language for macOS, iOS, watchOS, and tvOS
Swift
A programming language for web application development
PHP (Hypertext Preprocessor)
A programming language that provides structure for web pages and connects to hosting platforms
HTML5
A programming language used for web page design that controls graphic elements and page presentation
CSS (Cascading Style Sheets)
A programming language widely used to create enterprise web applications that can run on multiple clients
Java
A representation of a value in R that can be stored for later use
Variable (R)
A series of functions that splits data into subsets in a matrix of panels
Facets (R)
A short statement describing an idea or concept
Elevator pitch
A software application that brings together all the tools a data analyst may want to use in a single place
IDE (Integrated Development Environment)
A SQL function that returns records with matching values in both tables
INNER JOIN
A streamlined variation of data frames
Tibble (R)
A syntax for formatting plain text files
Markdown (R)
A system of packages in R with a common design philosophy for data manipulation, exploration, and visualization
Tidyverse (R)
A system of words and symbols used to write instructions that computers follow
Programming language
A text file that stores a data table by separating columns of data with tabs
TSV (Tab-separated values file)
A text file with a specific format, which enables the saving of textual data in an organized fashion
FWF (fixed-width file)
A tool in R for expressing a sequence of multiple operations, represented with “%>%”
Pipe (R)
A two-dimensional collection of data elements with rows and columns
Matrix
A unit of reproducible R code
Package (R)
A vector whose elements can be of any type
List
A visual property of an object in a plot
Aesthetic (R)
A way of standardizing the organization of data within R
Tidy data (R)
An extension of the C programming language that is used to create console games, such as those for Xbox
C++
An object that stores categorical data where the data values are limited and usually based on a finite group, such as country or year
Factor (R)
An object-oriented programming language for web application development
Ruby
An object-oriented programming language used to create games and mobile apps in the .NET open source developer platform
C#
An online archive with R packages, source code, manuals, and documentation
CRAN (Comprehensive R Archive Network) (R)
An open-source web application used to create and share documents that contain live code, equations, visualizations and narrative text
Jupyter Notebook
An operator that returns a logical data type
Logical operator
An operator used to assign values to variables and vectors
Assignment operator
An operator used to compare values, also known as a comparator
Relational operator
An operator used to perform basic math operations such as addition, subtraction, multiplication, and division
Arithmetic operator
An R function that makes changes to a dataframe separating and merging columns or creating new variables
mutate() (R)
An R function that returns a preview of the column names and the first few rows of a dataset
head() (R)
An R package in Tidyverse that creates a variety of data visualizations by applying different visual properties to the data variables in R
ggplot2 (R)
An R package in Tidyverse that offers a consistent set of functions to complete common data-manipulation tasks
dplyr (R)
An R package in Tidyverse used for data cleaning to make tidy data
tidyr (R)
An R package in Tidyverse used for importing data
readr (R)
An R package used to build interactive web apps with R code
Shiny (R)
Code that can be inserted directly into the text of an R Markdown file
Inline code
Code that is freely available and may be modified and shared by the people who use it
Open-source
Code that performs a particular function and is contained within code that performs a broader function
Nested
Documentation for an R package that describes the problem the package is designed to solve, explains how its functions can be used, and lists any dependencies on other packages
Vignette (R)
Four datasets that have nearly identical summary statistics but contain different plotted values
Anscombe’s quartet
Information needed by a function in R in order to run
Argument (R)
The geometric object used to represent data
Geom (R)
The process of giving instructions to a computer in order to perform an action or set of actions
Computer programming
The process of matching up a specific variable in a dataset with a specific aesthetic
Mapping (R)
The process of writing instructions to a computer in the syntax of a specific programming language
Coding
The set of markup symbols or codes used to create a webpage
HTML (Hypertext Markup Language)
A tool for showing only the data that meets a specific criteria while hiding the rest
Dashboard filter
The tendency for different people to observe things differently (Refer to Observer bias)
Experimenter bias
Building relationships by meeting people both in person and online (Refer to Networking)
Professional relationship building