Data Identifying, Gathering, & Importation Process Flashcards

1
Q

Process of Identifying Data

A

Step 1: Determine the information you want to collect.
Step 2: Define a plan for collecting Data.
Step 3: Determine your data collection methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Step 1: Determine the information you want to collect

A

By making decision regarding the specific information you need.
And the possible sources for this data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Step 2: Define a plan for collecting Data

A
  • Stablish a timeframe for collecting the data you need. Some of the data needs a time frame or a real-time track.
  • How much data is sufficient for a credible analysis. It can be the volume or a dataset (statistical/limit number).
  • Define the dependencies, risks, and mitigation plan.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Step 3: Determine your data collection methods

A
  • How you will collect the data from the data source you identified, being internal systems or social media sites.
  • Type of data.
  • Timeframe over which you need the data.
    Volume of data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data Quality

A

Working with data without considering how it measures against the quality metric can lead to failure.
In order to be reliable, data needs to be:
- Free of errors.
- Accurate.
- Complete.
- Relevant.
- Accessible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data Governance

A

Data Governance policies and procures relate to the usability, integrity, and availability of data
Issues pertaining to data governance include:
- Security.
- Regulation.
- Compliances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data Privacy

A

Loss of trust in the data used for analysis can compromise the process, result in suspect findings, and invite penalties.
Data privacy includes issues such as:
- Confidentiality.
- License for use.
- Compliance to mandated regulations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data Sources

A
  • Primary Data.
  • Secondary Data.
  • Third-party Data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Primary Data

A

Primary data refers to information obtained directly from the source, this can be from:
- Data from the organization’s CRM, HR, or workflow application.
- Data you gather directly through surveys, interviews, discussions, observations, and focus groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Secondary Data

A

Secondary data refers to information retrieved from existing sources, like:
- External databases.
- Research articles, publications, training material, internet searches, or financial records available as public data.
- Data collected through externally conducted surveys, interviews, discussions, observations, and focus groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Third-party Data

A

Third-party data refers to data purchased from aggregators who collect data from various sources and combine it into comprehensive datasets for purpose of selling the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Sources for Gathering Data

A
  • Databases.
  • Web.
  • Social media sites and Interactive platforms.
  • Sensor data.
  • Data exchange.
  • Surveys.
  • Census.
  • Interviews.
  • Observation studies.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What to use to Gather and import Data?

A
  • APIs.
  • Web Scraping.
  • Sensor Data.
  • Data Exchange.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

APIs

A
  • Used for extracting data from a variety of data sources.
  • Used for Data validation. A Data analyst may utilize an API to validate postal addresses and zip codes.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Web Scraping

A
  • Used for downloading specific data from web pages based on defined parameters.
  • Used to extract data such as text, contact information, images, videos. Podcasts, and product items from a web property.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
A