EDA Flashcards
collection of raw facts from which conclusions may be drawn
Data
Who creates data?
◦ Individuals
◦ Businesses
CATEGORIES OF DATA
- STRUCTURED
- UNSTRUCTURED
STRUCTURED
- Data Bases
- Spread Sheets
UNSTRUCTURED
- Forms
- Images
- Audio
- Movies
What do individuals/businesses do with the data they collect?
They turn it into “information”
” ____ is the intelligence and knowledge derived from data”
Information
___ analyze raw data in order to identify meaningful trends
Businesses
Value of Information to a Business
- Creating a competitive advantage
- Identifying new business opportunities
- Identifying patterns that lead to changes in existing business
- New services
- Targeted marketing campaigns
___ used depends on the type of data and its creation/usage rate.
Type of storage
Storage
* Data created by individuals/businesses must be ____
stored for further processing.
Storage Examples:
* __ : Digital camera, Cell phone, DVDs, Hard disk
Individuals
Storage Examples:
* __ : Hard disk, External disk arrays, Tape library
Businesses
Storage Model: An Evolution
Centralized & Decentralized
Mainframe computers
Centralized
- __ : __ (Data spread across many servers)
Decentralized: Client-server model
- __ : __ (Huge repositories)
Centralized: Storage networking
for order entry
Application
to store customer and product information
Database Management System (DBMS)
on which the application and database programs are run
Server/Operating System (OS)
database is stored on physical disks in the ___
Storage Array
The Core Elements
- Applications
- Databases – Database Management System (DBMS) and the physical and logical storage of data
- Servers/Operating systems
- Networks (LAN and SAN)
- Storage arrays
- A customer order is entered via the _ on a client
Application User Interface
- The client accesses the server over a _
Local Area Network
- A __ uses the operating system on the server to read and write this data to the physical location on a disk
DBMS
A dedicated __ provides the communication link between the server and the storage array, and transports the read/write commands and data between the server and the storage array
Storage Area Network
A ___ receives the read/write commands and data from the server and performs the necessary operations to store the data on the physical disks
storage array
KEY REQUIREMENTS FOR DATA CENTER ELEMENTS
- Data Integrity
- Availability
- Security
- Performance
- Scalability
- Capacity
- MIDDLE ILAGAY: MANAGEABILITY
Challenges in Managing Information
- Exploding digital universe
- Increasing dependency on information
- Changing value of information
Multifold increase of information growth
- Exploding digital universe
The strategic use of information plays
- Increasing dependency on information
Information that is valuable today may become less important tomorrow.
- Changing value of information
Some Constraints to Meeting the
Requirements
- Cost
- Physical environment
- Maintenance and support
- Compliance – regulatory and legal
- Hardware and software infrastructure
- Interoperability and compatibility
- Applications runs on __
Host
__ can range from simple
laptops to complex server clusters
Hosts
PHYSICAL COMPONENTS of host:
- CPU
- Storage
- I/O device
Disk device and internal memory
Storage
I/O device
- Host to host communications (H2H)
- Host to storage device
communications (H2SD)
- Host to host communications (H2H)
Network Interface Card (NIC)
- Host to storage device
communications (H2SD)
- Host Bus Adapter (HBA)
I/O Devices
- Human interface
- Computer-computer interface
- Computer-peripheral interface
- Human interface
- Keyboard
- Mouse
- Monitor
- Computer-computer interface
- Network Interface Card (NIC)
- Computer-peripheral interface
- USB (Universal Serial Bus) port
- Host Bus Adapter (HBA)
Logical Components of the Host
- Application
- Operating system
- Interface between user and the host
Application
- Three-tiered architecture
Application
- Three-tiered architecture
- Application UI, computing logic and underlying databases
- Application data access can be classifies as:
- Block-level access
- File-level access
Data stored and retrieved in blocks, specifying the LBA
- Block-level access
Data stored and retrieved by specifying the name and path of files
- File-level access
Controls the environment
Operating system
Resides between the applications and the hardware
Operating system
It is a systematic grouping of units according to their common characteristics.
Classification of data
Classification of data
Functions:
● Simplifies and makes data more comprehensible
● Condense the data
● Brings out the points of similarity and dissimilarity
● Comparison of characteristics
● Brings out the cause and effect relationship.
● Prepare the data for tabulation
On the basis of nature of Variable
● Quantitative data
● Qualitative data
● Discrete data
● Continuous data
● Chronological or temporal data
● Geographical or spatial data
On the basis of Source of Collection
● Primary data
● Secondary data
On the basis of Presentation
● Grouped data
● Ungrouped data
On the basis on content
● Simple Classification
● Manifold Classification
Classification of data according to __ characteristics such as age, weight, height, marks etc.
Quantitative data
quantitative
Classification of data according to __ characteristics such as sex, honesty, intelligence, literacy, colour, religion, marital status etc.
Qualitative data
Classification of data which takes exact numerical values (whole numbers)
Discrete data
Eg: No of Children in a family, shoe size
Discrete data
Classification of data which takes numerical values within a certain range
Continuous data
Eg: Weight of girl baby of one month is given as 3.8kg, but exact weight could be between 3.2 and 5.4
Continuous data
When data are classified or arranged by their time of occurrence, such as years, months, weeks, days etc.
Chronological or temporal data
Eg: Time series data
Chronological or temporal data
When data are classified by __ regions or location, like states, provinces, cities, countries etc.
Geographical or spatial data
geographical
data which is directly collected by the researcher/investigator
Primary data
Primary Quantitative Data:
- Questionnaires
- Structured Interviews
Primary Qualitative Data:
- Participant Observation
- Unstructured Interviews
data which is not directly collected by the researcher/investigator .
Secondary data
Secondary Quantitative Data:
- Official statistics
Secondary Qualitative Data:
- Letters, articles, newspapers
data which is presented in group
Eg: Age: 20-25 (12 persons),25-30 (8 persons)
Grouped data
data which is presented individually
Ungrouped data
Eg: Age: 28 years, 27 years, 23 years, 25 years, 26 years
Ungrouped data
Classification of data with one characteristics
Simple Classification
Classification of data with more than one characteristics
Manifold Classification
Incorporate computers to help manage data and achieve business objectives
Information System
Combinations of hardware, software, and telecommunications networks that people build and use to collect, create, and distribute useful data, typically in organizational settings
Information System
Interrelated components working together to collect, process, store, and disseminate information to support decision making, coordination, control, analysis, and visualization in an organization
Information System
is a raw fact and can take the form of a number or statement such as a date or a measurement.
Data
is data that have been processed so that they are meaningful.
Information
Components of Information System
hardware
software
networks
people
data
- Collection of related files
Database
- A computed-based database offers the advantage of powerful search facilities which can be used to locate and
retrieved information many times faster than by manual methods.
Database
enables a user to locate, sort, update or extract records from the database.
Query
can be used to locate and display any records meeting a set of specified conditions.
Selection Query
There are two types of query called
selection queries and update queries
can be used to modify records in a variety of ways such as according to a set of conditions specified by the user.
Update Query
Combination of devices connected to each other through communication links to provide the channels for information to flow continuously between people.
Networks
Helps business to connect with its customer, suppliers and collaborators.
Networks
Basic Network Components
(Component of Computer Networking)
- Internet
- Router
- Switch
- Printer
- Server
Benefits of Information System
- New Products and Services
- Information Storage
- Simplified Decision Making
- Behavioral Change
What IS can do for business?
- Store and analyze information:
- Assist with making decisions:
- Assist with business processes:
Sophisticated and comprehensive
databases, sometimes cloud-based, are used to store and analyze information pertaining to business functions, customers, transaction data, and both employee and customer activity.
Store and analyze information
Information systems can compare in- __ to external sources to, for example, compare internal insights to information about the general state of the economy or competitors’ financial reports. Decision-makers use these insights review the adequacy and quality of their strategic decisions.
Assist with making decisions
house analyses
Information systems are used to
_ for business functions. Business processes can be simplified, and unnecessary activities can be streamlined using information systems adapted to common business tasks, such as manufacturing, supply chain, and employee processes.
- Assist with business processes
develop value-added systems
Why are information systems key for pandemic response?
They provide essential evidence for taking action, making the most informed decisions possible, and adjusting policies to allow for better intelligence on actions to improve health.
Why are information systems key for pandemic response?
With properly disaggregated health data, it is possible to plan actions that reduce potential health inequities at all levels of care and facilitate the implementation of strategies to address such inequities.
What are the main areas to prioritize?
- Governance of information systems
- Technological infrastructure
- Automation and interoperability of electronic health record
- Data privacy, confidentiality, and security
- Data and information processing
- Knowledge management and sharing
- Innovation
Establish or strengthen mechanisms and processes connected with the effective use of information technology, and with the production, management, and processing of the data needed for response; infrastructure for Internet access; regulations and standards for the development or adoption of computer applications and databases; a process for capacity building, and the review and updating of legislation.
- Governance of information systems
Have secure technological infrastructure that meets the needs. At a minimum, this should allow for: data capture and analysis platforms; real-time dissemination of information; electronic health records; patient Better information systems for health mean better health outcomes and continuity of care, so that all people receive the best possible medical care over time. portals; and the establishment of appropriate communication channels for teleconsultation (workstations and Internet access with sufficient bandwidth for multimedia services).
- Technological infrastructure
Automate or boost the capacity of the various existing systems communicate with each other; to accurately, effectively, and systematically exchange data; and to make immediate use of information in an appropriate format.
- Automation and interoperability of electronic health record
Strengthen technological infrastructure and regulations to improve data confidentiality, security, and privacy, prevent unauthorized access to and improper use of patient information, and ensure data integrity and compliance with standards and regulations on data protection.
- Data privacy, confidentiality, and security
Implement or strengthen the national platform for sharing health information in order to promote effective and rapid data collection, prioritization, and mapping, using an automated, systematic process that can be adapted to differing information needs.
- Data and information processing
Facilitate the participation of the scientific and academic community as well as civil society in real-time data production and analysis through timely access to accurate information in the appropriate format.
Knowledge management and sharing
To the extent possible, incorporate tools and applications that can improve data access and availability, and real-time analysis and presentation of data, using different analytical approaches and developing predictive models that enable better planning, response, and decision-making in health services and systems
Innovation
- What is the primary reason for converting most data into a digital format?
a) To reduce data processing requirements
b) To lower peripheral costs
c) To slow down storage speed
d) Due to the lack of user demand
a
- Which of these best exemplifies unstructured data?
a) Spreadsheets
b) Databases
c) Images
d) Data tables
c
- What does the term “information” signify in the context of data management?
a) Raw data
b) Conclusions derived from data
c) Digital data format
d) User demand for data
b
- Why is information valuable to a business?
a) To generate raw data
b) To reduce data processing requirements
c) To identify new business prospects
d) To lower customer satisfaction
c
- What kind of access entails specifying the file’s name and path when retrieving data?
a) Block-level access
b) File-level access
c) Database-level access
d) Application-level access
b
- Which part of a host computer manages the interface between applications and hardware?
a) CPU
b) Network Interface Card (NIC)
c) Operating System
d) Storage Array
c
- What is the primary function of a Network Interface Card (NIC)?
a) Managing storage arrays
b) Handling input/output operations
c) Connecting the host to the internet
d) Managing the operating system
c
- Which of the following is NOT a crucial requirement for data center components?
a) Availability
b) Data Integrity
c) Hardware Compatibility
d) Scalability
c
- What is one of the challenges discussed regarding information management?
a) Decreasing data growth
b) Reduced reliance on information
c) Consistent information value
d) Compliance and regulatory issues
d
- In a data center infrastructure, what role is typically fulfilled by a Storage Area Network (SAN)?
a) Managing applications
b) Providing connectivity between clients and servers
c) Handling data entry
d) Storing data on physical disks
- What is the primary purpose of a Hard Disk Drive (HDD) in a host computer?
a) Managing network communications
b) Handling data storage
c) Running applications
d) Processing user interface commands
b
- Which component of a host computer handles user input like keyboard and mouse interactions?
a) CPU
b) Network Interface Card (NIC)
c) I/O Device
d) Host Bus Adapter (HBA)
c
- What is the primary role of a Network Interface Card (NIC)?
a) Managing storage arrays
b) Handling input/output operations
c) Connecting the host to the internet
d) Managing the operating system
c
- What type of data access involves specifying the Logical Block Address (LBA) when retrieving data?
a) Block-level access
b) File-level access
c) Database-level access
d) Application-level access
a
- Which component of a host computer manages the interface between user applications and the
hardware?
a) CPU
b) Network Interface Card (NIC)
c) Operating System
d) Storage Array
c
- In a data center infrastructure, what role does a Storage Area Network (SAN) typically play?
a) Managing applications
b) Providing connectivity between clients and servers
c) Handling data entry
d) Storing data on physical disks
- What is the primary purpose of a Hard Disk Drive (HDD) in a host computer?
a) Managing network communications
b) Handling data storage
c) Running applications
d) Processing user interface commands
b
- Which component of a host computer is responsible for handling user input like keyboard and mouse
interactions?
a) CPU
b) Network Interface Card (NIC)
c) I/O Device
d) Host Bus Adapter (HBA)
c
- What is the primary function of a Network Interface Card (NIC)?
a) Managing storage arrays
b) Handling input/output operations
c) Connecting the host to the internet
d) Managing the operating system
c
- Why is proper filing of information essential?
a) To maintain an organized appearance
b) To ensure information safety
c) To maximize storage space
d) To create duplicates
- In what two primary forms can information be stored?
a) Manual and digital
b) Physical and virtual
c) Hard copy and soft copy
d) Printed and handwritten
c
- What characterizes an effective filing system?
a) It occupies a significant amount of space
b) It is situated in a remote location
c) It is challenging to use
d) It can meet future requirements
- How is information electronically stored on a CD-ROM?
a) By writing on it with a pen
b) By burning data onto the disc
c) By using a scanner
d) By photocopying it
b
- What is the storage capacity of a standard DVD (Digital Versatile Disc)?
a) 1.4/2 Mb
b) 650 Mb
c) 17 Gb
d) 1 Tb
- In an information system, what does the “software” component refer to?
a. Physical devices
b. Computer programs and applications
c. People managing the system
d. Data storage methods
b
- Which type of query is used to modify records in a database?
a. Selection Query
b. UPDATE Query
c. Retrieval Query
d. Data Query
b
- What do networks in information management provide channels for?
a. Continuous data flow
b. Storing information
c. Data encryption
d. Data deletion
a
- What is the benefit of having a computer-based database for retrieving information?
a. Slower data retrieval
b. Limited search capabilities
c. Faster data retrieval
d. Manual data retrieval
c
- How do information systems assist with making decisions in a business?
a. By comparing external insights to internal data
b. By automating all decision-making processes
c. By outsourcing decision-making to external sources
d. By ignoring external data
a
- What is the primary goal of data privacy and security in information management?
a. To prevent data collection
b. To ensure data integrity
c. To make data public
d. To comply with data protection standards
- What is the main purpose of knowledge management and sharing in information management?
a. To restrict data access
b. To automate data collection
c. To facilitate real-time data analysis
d. To provide timely access to accurate information
d
- In the context of information systems, what does “data processing” involve?
a. Storing data in physical files
b. Collecting data manually
c. Converting raw data into meaningful information
d. Distributing data to external sources
c
- What is one of the benefits of information systems in business processes?
a. Slowing down business operations
b. Complicating business tasks
c. Simplifying business processes
d. Outsourcing all tasks
c
- What is the primary purpose of governance of information systems in pandemic response?
a. Automating data collection
b. Strengthening data privacy
c. Establishing effective mechanisms for IT use
d. Developing predictive models
- What is the primary goal of technological infrastructure in pandemic response?
a. Preventing data breaches
b. Automating data analysis
c. Providing secure communication channels
d. Meeting essential needs like data capture and analysis
d
- Why are information systems considered essential for pandemic response?
a. To slow down the response process
b. To complicate decision-making
c. To provide evidence for informed decisions
d. To eliminate health data disparities
- What are the main areas to prioritize in pandemic response?
a. Governance of information systems
b. Technological infrastructure
c. Automation and interoperability of electronic health records
d. Data privacy, confidentiality, and security
d
- What is essential for a secure technological infrastructure in pandemic response?
a. Real-time dissemination of information
b. Patient portals
c. Communication channels for teleconsultation
d. All of the above
d
- What does interoperability of electronic health records aim to achieve in pandemic response?
a. Automate data exchange between systems
b. Prevent data breaches
c. Improve data privacy
d. Strengthen regulations
a
- What does data privacy, confidentiality, and security aim to prevent in pandemic response?
a. Data collection
b. Unauthorized access and improper use of patient information
c. Data integrity
d. Compliance with data protection standards
b