Intro to DataScience Flashcards
What is Data Science?
Application of computational and statistical techniques on data to gain insight
What is the difference between data and information
Data is unusable until organized, however information in the result of processed data when put into context
Explain what computer programming is
Create a sequence of instruction capable of automating a system to performing specific task
What are the main features of using IDE?
Availability of tools to test and debug
What does Jupyter Notebook stand for?
Julia, Python, and R
Whats the difference between structured and ustructured data?
Structured data is organized with predetermined set of rules.
Unstructured data is sets of data where it is difficult to determined predetermined sets of rules to organize
XML vs JSON, which takes less storage? and what do they stand for?
JSON, as they don’t use end tags
Java Script Object Notation
Extensible Markup Language
What are the properties of a list, tuple, sets, and dictionaries
Lists: Ordered, changeable, allow duplicate
Tuple: Ordered, unchangeable, allow duplicate
Set: Unordered, Unchangeable, Unindexed, no duplicate (Unchangeable but you can add
or remove items)
Dictionary: Unordered, changeable, no duplicate (Ordered as of python 3.7, patch 3.6 and earlier stil consider it to be unordered)
Types of data structures that can contain different datatypes
Lists and Dataframes
How to access components of a list?
Using the $ sign or [[ ]]
How to find the length of a string in R?
nchar()
What is a database
Collection of data stored in a computer system
What does a DBMS allow us to do?
Store, Query, Update, Manage, Control access to
Advantage of using a DBMS
Store massive amounts of data
Access to multiple users
Concurrency
Efficient Manipulation
What are iterations?
Command to order the computer to run the same commands repeatedly
What are the 3 main types of iterations?
for, while, repeat
Can you rewrite a for loop with a while loop?
Yes, it also works vice versa
Can you rewrite a while loop with repeat?
Yes, but the converse is not true
What is debugging?
Task of fixing problems in our code
State 3 Condition handling tools
withCallingHandlers()
tryCatch()
try()
State 3 debugging tools
traceback()
options()
browser()
What is defensive programming
Strategy of making a code fail in a well defined manner
State the Fail-fast principle
Avoid Functions with non-standard evaluation
Avoid Functions that return different output based on the input
What objects are mutable?
List and Dictionaries
Does aliasing work the same way in both Python and R?
R uses a copy-on-modify strategy, while Python rewrites the original copy
What is the difference between nums.sort() and sorted(nums)
nums.sort() alters the original list into a sorted list
sorted(nums) only display the sorted version of the list without storing it to any variable
In OOP, what is a class?
Type of object that allows us to bundle data and functionality together
Attributes are attached to maintain the state, and methods to modify
In OOP, What is encapsulation?
Bundling data and methods to restrict direct access of data to object
In OOP, What is inheritance
A child class is based on the parent class and has access to methods from the parent class