L 3.2 Information Systems Development + Big Data & Business Intelligence Flashcards
data drive organisations have
access to data
access to analytics
analytics and domain knowledge collaboration
data informed decision making
Big data types
- structured data: transactions data
- semi-structured data: clickstreams or sensor data
- unstructured data: text, audio and video data, comments on socials
bid data characteristics
- volume: amount of users
- velocity: how fast, posts per day
- variety: how many types of data, follows, comments, uploads
data warehouse
stores data form different business processes, quick to access historical information for business intelligence
databases
- places where things are stored like wikipedia, Spotify, amazon
- store data and make it accessible where and when needed
- can store organisational data ranging from inventory to demand forecasts
relational databases (RDSMS)
- type of database
- most common, easy to implement and us
- relatively rigid, hard to scale
- attempts to balance efficiency of storage needs ease on retrieval and other factors by storing data in tablets linked via relationships
NoSQL databse
-type of database
- flexible/scalable, often have ad-hoc depending on company needs
- easier than RDBMS therefore becoming more popular
business intelligence and advanced analytics
- refers to tools and techniques for analysing and visualising past data
- improve business with data
why do organisation need business intelligence and advanced analytics
- most businesses are data driven
- can better respond to ongoing threat and opportunities and plan for the future
- humans need to translate the output and turn it into action
visual analytics
- combines data and human ability to spot patterns that are hard to express/questions systematically
- databases → BI software → visual analytics
clustering
involves grouping similar data points together based on their features or attributes
goal is to discover hidden pattern or structures in the data
spurious correlations
not a meaningful correlation
advanced data analytics and AI
- through this we want to explain why things happen and predict what will happen in the future
- gain competitive advantage by innovation
machine learning
using statistical tools to model relationships between variables
text mining - NLP
- NLP: natural language processing
- a number of statistical methods to extract content from text data
- another way of reinforcement learning
- the more the internet is used the better is gets at understanding behaviours