P1: Intro to R and 16S Data Flashcards

1
Q

what is R

A

a programming language primarily used for statistics, data analysis, and geographical representations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

why learn R

A
  • widely used in biology
  • ideal for working with large datasets
  • handles various data structures
  • R code is great for reproducibility
  • specialized packages/repositories
  • active community and support, free and open resource
  • highly values skill
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

why learn R - how is it ideal for working with large data sets

A

excel does not work well with thousands of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

why learn R: how is it ideal for working with large data sets - examples of large data sets

A
  • genomics
  • ecological
  • microbial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

why learn R - how is it great for reproducibility

A
  • script-based analysis
  • its a written protocol and using this, you can run a script in any computer using the exact same package and you should get the exact same result
  • easy to keep track of
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

why learn R: specialized packages/repositories - define package

A
  • software with different functions
  • recipe for how to treat data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

why learn R: specialized packages/repositories - define repository

A

sites online to download packages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

why learn R: active community and support, free and open-source - what do we mean by open-source

A
  • anyone can write a function and upload it onto a repository as a package
  • can only be done is it is written in R
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

why learn R - how is it a highly valued skill

A
  • can be a significant advantage in academic, government, and industrial roles
  • it is a highly sought-after skill for data analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

R data structure

A
  1. vector
  2. list
  3. data frame
  4. matrix
  5. array
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

R data structure - vector

A
  • simplest to import data
  • sequential set of variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

R data structure - list

A

list of different vectors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

R data structure - data frame

A
  • tables with variables and information in different columns and rows
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

R data structure - matrix

A
  • similar to a data frame
  • but all columns need to have the same information (homogenous)
  • typically numerical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

R data structure - array

A

groups of matrixes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

data types in R

A
  • character: text
  • numeric: numbers that can use mathematical operations and can be converted into character/text
  • factor: splitting data into categories