Data analytics Flashcards

1
Q

data analytics

A

process and practice of analyzing data to answer questions, extract insights, and identify trends

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

data science

A

process of building, cleaning, and structuring datasets to analyze and extract meaning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

big data

A

data sets whose size or type is beyond the ability of traditional relational databases to capture, manage and process the data with low latency.
Sources of data are becoming more complex than those for traditional data because they are being driven by artificial intelligence (AI), mobile devices, social media and the Internet of Things (IoT). For example, the different types of data originate from sensors, devices, video/audio, networks, log files, transactional applications, web and social media — much of it generated in real time and at a very large scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

big data analytics

A

you can ultimately fuel better and faster decision-making, modelling and predicting of future outcomes and enhanced business intelligence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

types of analytics

A

https://online.hbs.edu/Documents/a-beginners-guide-to-data-and-analytics.pdf
Analytics is used to extract meaningful
insights from data that can drive decisionmaking and strategy formulation. There
are four types of analytics you can leverage
depending on the data you have and the type
of knowledge you’d like to gain.
1. Descriptive analytics looks at data
to examine, understand, and describe
something that’s already happened.
2. Diagnostic analytics goes deeper
than descriptive analytics by seeking
to understand the “why” behind what
happened.
3. Predictive analytics relies on historical
data, past trends, and assumptions to answer
questions about what will happen in the
future.
4. Prescriptive analytics identifies specific
actions an individual or organization should
take to reach future targets or goals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

data analytics in business

A

The main goal of business analytics is to extract meaningful insights from
data that an organization can use to inform its strategy and, ultimately,
reach its objectives. Business analytics can be used for:
* Budgeting and forecasting: By assessing a company’s historical
revenue, sales, and costs data alongside its goals for future growth,
an analyst can identify the budget and investments required to make
those goals a reality.
* Risk management: By understanding the likelihood of certain
business risks occurring—and their associated expenses—an analyst
can make cost-effective recommendations to help mitigate them.
* Marketing and sales: By understanding key metrics, such as leadto-customer conversion rate, a marketing analyst can identify the
number of leads their efforts must generate to fill the sales pipeline.
* Product development (or research and development): By
understanding how customers reacted to product features in the
past, an analyst can help guide product development, design, and
user experience in the future.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

data ecosystem

A

The term data ecosystem refers to the
programming languages, packages,
algorithms, cloud-computing services, and
general infrastructure an organization uses to
collect, store, analyze, and leverage data. No
two organizations leverage the same data in
the same way. As such, each organization has
a unique data ecosystem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

data life cycle

A

While the data ecosystem encompasses
everything that handles, organizes, and
processes data, the data life cycle describes
the path data takes from when it’s first
generated to when it’s interpreted into
actionable insights. This life cycle can be
split into eight steps: generation, collection,
processing, storage, management, analysis,
visualization, and interpretation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

data privacy

A

Data privacy, also known as information privacy, is a
subcategory of data protection that encompasses the ethical and legal obligation to protect access to personally identifiable information (PII), which is any information that can be linked to a specific individual. Some examples of PII include full name, address, Social Security number, and passport number.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

data integrity

A

Data integrity is the accuracy, completeness, and quality of data as it’s maintained
over time and across formats. Preserving the integrity of your company’s data is a
constant process.
Threats to a dataset’s integrity include:
* Human error: For instance, accidentally deleting a row of data in a spreadsheet.
* Inconsistencies across formats: For instance, a dataset in Microsoft Excel that
relies on cell referencing may not be accurate in a different format that doesn’t
allow those cells to be referenced.
* Collection error: For instance, data collected is inaccurate or lacking
information, creating an incomplete picture of the subject.
* Cybersecurity or internal privacy breaches: For instance, someone hacks into
your company’s database with the intent to damage or steal information, or an
internal employee damages data with malicious intent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

data analytics skills: critical thinking

A
  1. Critical Thinking
    If you’re interested in using data to solve business problems, you need to be adept at thinking critically about challenges and solutions. While data can provide many answers, it’s nothing without a human’s discerning eye.
    “From the first steps of determining the quality of a data source to determining the success of an algorithm, critical thinking is at the heart of every decision data scientists—and those who work with them—make,” Tingley says in the
    Harvard Online course Data Science Principles. “Data science is a discipline that’s built on a foundation of critical thinking.”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

data analytics skills: hypothesis formation and testing

A
  1. Hypothesis Formation and Testing
    At the heart of data and analytics is the desire to answer questions. The proposed explanations for these leading questions are called hypotheses, which must be formed before analysis takes place. An example of a hypothesis is, “I predict that a person’s likelihood of recommending our product is directly proportional to their reported satisfaction with the product.” You predict the data will show this trend and must prove or disprove the hypothesis through analysis. Without a hypothesis, your analysis has no clear direction.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

data analytics skills: data wrangling

A
  1. Data Wrangling
    Data wrangling is the process of cleaning raw data in preparation for analysis. It involves identifying and resolving mistakes, filling in missing data, and organizing and transferring it into an easily understandable format.
    This is an important skill for anyone dealing with data to acquire because it leads to a more efficient and organized data analysis process. You can extract valuable insights from data more quickly when it’s cleaned and in its optimal viewing format.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

data analytics skills: mathematical ability

A
  1. Mathematical Ability
    You don’t have to be a mathematician to become data literate, but strong math skills become increasingly important as you deal with more complex analyses.
    A seasoned data professional needs a solid understanding of statistics, probability, linear algebra, and multivariable calculus. Data scientists often call on statistical methods to find structure in data and make predictions, and linear
    algebra and calculus can make machine-learning algorithms easier to comprehend.
    If you’re not a data scientist or analyst, your work may not require you to understand the more complex mathematical concepts, but having a basic understanding of statistics can go a long way.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

data analytics skills: data visualization

A
  1. Data Visualization
    It’s crucial to know how to transform
    raw data into compelling visuals that
    tell a story. Rather than simply presenting a list of
    values to your stakeholders, it’s more
    effective to visually communicate data
    in a way that’s easily digestible. Some
    popular data visualization techniques
    that all business professionals should
    know include pie charts, bar charts,
    and histograms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

data analytics skills: programming

A
  1. Programming
    Programming languages, like Python and R, are commonly used to solve
    complex statistical problems with data. Proficiency in a database querying
    language, like SQL, can also help you more easily extract and change data
    in a database.
    While programming skills are immensely valuable, they’re not necessary
    for beginners dabbling in data. It’s more important to focus on effectively
    analyzing and visualizing data to draw conclusions.
17
Q

data analytics skills: machine learning

A
  1. Machine Learning
    As artificial intelligence grows in popularity,
    machine learning is a highly valuable skill for
    professionals working with big data.
    Machine learning refers to the use of computer
    algorithms that automatically learn from and adapt
    in response to data. Some business applications
    of machine learning include risk management,
    performance analysis, trading, and automation.
    Even if you’re not responsible for writing code,
    knowing the basics of machine learning can
    help you gain a deeper understanding of your
    organization and boost efficiency through
    automation.
18
Q

tools and programming languages

A

SQL, python
BI tool like Qlik Sense or Power BI

19
Q

Venn diagram

A

A Venn diagram is an illustration that uses circles to show the relationships among things or finite groups of things. Circles that overlap have a commonality while circles that do not overlap do not share those traits.

Venn diagrams help to visually represent the similarities and differences between two concepts.

20
Q

funktionen von data analytics tools

A

sortierung und filterung
darstellungs- und auswertungsmöglichkeiten
modellrechnungen wie prognosen und szenarien
optimierungsalgorithmen
etc
Menschen übernehmen Interpretation, Bewertung und Beurteilung er Infos und Entscheidungsfindung

21
Q

bits und bytes

A

bit: kleinste informationseinheit
z.B. strom aus = 0, strom ein = 1

8 bits = 1 byte

ein byte 2^8 = 256 kann 256 zustände wiedergeben

22
Q

bytes

A

bytes, oder bitmuster, werden bei der ausgabe in für den menschen lesbare zeichen umgewandelt

23
Q

1 kilobyte kb

A

= 1024 bytes

und 1 megabyte mb sind 1024 kilobytes

24
Q

strukturierte daten und SQL

A

daten, die in einer struktur organisiert sind

SQL structured query language
können teile von infos ausgelesen werden, die auf zeilen und spalten basieren

25
Q

strukturierte daten sind:

A

CSV, austauschformat
dateiformate einer tabellenkalkulation, e.g. excel
XML-daten liegen auch in strukturiertem format vor

26
Q

unstrukturierte daten

A

audiodaten, bilder, videos, blogs

27
Q

Entscheidungsunterstützungsysteme

A

MIS (management informationssystem), DSS, EIS, DWH, BI (business intelligence)

28
Q

business intelligence

A

werkzeuge zur verteilung von infos im unternehmen in form von kennzahlen-cockpits, berichten und analysen

oberbegriff, der z.B. DWH und andere umfasst

29
Q

3 merkmale von big data

A

volume, menge (groß und komplex)
variety, vielfalt (datentypen, quellen, strukturiert und unstrukturiert)
velocity, geschwindigkeit (von auswertung, generieren und weiterverarbeiten)

30
Q

CEP

A

screaming und compley event processing