Importing data into R Flashcards by Jared Lowe

function to load in CSV files

read.csv(“data.csv”, stringsAsFactors = FALSE)
data must be in your working directory, or the path must be specified. Strings as factors default is TRUE, sets strings as factors

How well did you know this?

Not at all

Perfectly

List the files in your working directory

dir()

How well did you know this?

Not at all

Perfectly

import tab delimited data

read.delim(x, sep = “/t” (space), header = TRUE)

How well did you know this?

Not at all

Perfectly

import any tabular data

read.table(x, sep = “”, header = FALSE, stringsAsFactors = TRUE, col.names = “”)

How well did you know this?

Not at all

Perfectly

which.min

returns the index of the smallest value in a vector

ex: cars[which.min(cars$MPG),] will return the value which the minimum MPG in the cars vector

How well did you know this?

Not at all

Perfectly

colClasses

an argument in the read.delim & read.table functions. Use this argument to specify the data class of the variables you are importing

ex: read.delim(x, sep = “”, colClasses = c(“character”, “logical”, “numeric”))

How well did you know this?

Not at all

Perfectly

Hadley’s data import package

readr

How well did you know this?

Not at all

Perfectly

read_csv()

readr version of read.csv
read_csv(“mydata”)
loads data as a “tibble”

How well did you know this?

Not at all

Perfectly

read.delim for readr

read_tsv (tab seperated value)(“potatoes.txt”, col_names = c(“type”, “weight”))

How well did you know this?

Not at all

Perfectly

col_types

argument to specify the variable classes in readr package

How well did you know this?

Not at all

Perfectly

read_delim

the main import function in the readr package. Similiar to read.table
Must specify the file and delim arguments
ex/ read_delim(“cars.txt”, delim = “/t”, col_names = c(“automaker”,”mpg”))

How well did you know this?

Not at all

Perfectly

skip

skip rows in your import functions.

ex: skip = 5 will skip the first 5 rows and then begin reading in data

How well did you know this?

Not at all

Perfectly

n_max

specifies the number of rows you want to read in, often used with skip

ex: read_delim(“cars.txt”, delim = “/”, skip = 2, n_max = 3)
skips the first two rows and reads in rows 3,4, and 5

How well did you know this?

Not at all

Perfectly

readxl

Haddley’s excel data import package

How well did you know this?

Not at all

Perfectly

function to list different sheets in excel: readxl package

excel_sheets()

How well did you know this?

Not at all

Perfectly

read_excel()

Study These Flashcards

import excel data into R

import data from the second sheet in an excel doc

Study These Flashcards

read_excel(“cars.xls”, sheet = 2)

pop_list

Study These Flashcards

Utilize lapply with readxl functions to read in all sheets in an excel file at once. Must specify the path as a separate argument because the excel_sheets argument only lists the sheets in the file, it does not list the file path

col_types

Study These Flashcards

argument in read_excel. specify the data type of columns “text”, “numeric”, “date”, “blank”

col_types = (“blank”)

Study These Flashcards

read_excel will skip the import of a column with “blank” as col_type.
ex: read_excel(“my.data.xlsx”, col_types = c(“numeric”, “blank”)) will only import column 1 as a numeric column from the excel document

read_excel(“data.xlsxl”, skip = 2)

Study These Flashcards

skip the first two rows of an excel document and then begin importing data

XLConnect

Study These Flashcards

a package that creates a bridge between r session and excel

XLConnct function that builds a bridge betwen R and excel

Study These Flashcards

loadWorkbook()

XLConnect function lists the available sheets in an excel workbook. Requires an XLConnect workbook object as first argument (created through loadWorkbook() function)

Study These Flashcards

getSheets()

XLConnect function loads worksheets in as data. Requires an XLConnect workbook object as first argument

readWorksheet()

arguments in readWorksheet( my_book, startCol = 1, endCol = 3, starRow = 1, endRow = 3)

import in data from the "my_book" workbook object starting in column 1 and ending in column 3. ie only import data from column 1:3 and row 1:3

XLConnect function to add new sheets to an excel workbook object (the bridget created through loadWorkbook())

createSheet( workbook object, "new_SheetName")

add data to an XLConnect workbook

writeWorksheet(workbook object, new data, "sheet_to_write_to)

save an XLConnect workbook to a new file

saveWorkbook(my_workbook, "filename.xlsx")

rename a sheet using XLConnect

renameSheet(my_workbook, sheet = 1, newName = "cars")

Importing data into R Flashcards

(30 cards)