Unit 1 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Algorithms

A

A set of step-by-step instructions to solve a problem or complete a task.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Model

A

A representation of the relationships and patterns found in data to make predictions or analyze complex systems, retaining essential elements needed for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Outliers

A

When a data point or points occur significantly outside of most of the other data in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Quantitative Analysis

A

A systematic approach using mathematical and statistical analysis is used to interpret numerical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Structured Data

A

Data is organized and formatted into a predictable schema, usually related tables with rows and columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Unstructured Data

A

This data type often includes text, images, videos, and other content that does not fit neatly into rows and columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

CSV/TSV

A

Commonly used format for storing tabular data as plain text, where either the comma or the tab separates each value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data File Types

A

A computer file configuration is designed to store data in a specific way.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data Format

A

How data is encoded so it can be stored within a data file type.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data Visualization

A

A way of representing data in a readily understandable way makes it easier to see trends in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Delimited Text File

A

A plain text file where a specific character separates the data values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Extensible Markup Language (XML)

A

A language designed to structure, store, and enable data exchange between various technologies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Hadoop

A

An open-source framework designed to store and process large datasets across clusters of computers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

JavaScript Object Notation (JSON)

A

A data format compatible with various programming languages for two applications to exchange structured data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Jupyter Notebooks

A

A computational environment that allows users to create and share documents containing code, equations, visualizations, and explanatory text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Nearest Neighbor

A

A machine learning algorithm that predicts a target variable based on its similarity to other values in the dataset.

17
Q

Neural Networks

A

A computational model used in deep learning that mimics the structure and functioning of the human brain’s neural pathways. It takes an input, processes it using previous learning, and produces an output.

18
Q

Pandas

A

An open-source Python library that provides tools for working with structured data is often used for data manipulation and analysis.

19
Q

R

A

An open-source programming language used for statistical computing, data analysis, and data visualization.

20
Q

Python Notebooks

A

Computational environment allows users to create and share documents containing code, equations, visualizations, and explanatory text.

21
Q

Recommendation Engine

A

A computer program that analyzes user input, such as behaviors or preferences, and makes personalized recommendations based on that analysis.

22
Q

Regression

A

A statistical model that shows a relationship between one or more predictor variables with a response variable.

23
Q

Tabular Data

A

Data that is organized into rows and columns.

24
Q

XLSX

A

The Microsoft Excel spreadsheet file format.

25
Q

Open Source

A

Refers to any software whose source code is made available free for any third party to review and modify.