AWS Analytics Flashcards
An interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL.
Amazon Athena
What is Amazon Athena integrated with?
Athena is out-of-the-box integrated with AWS Glue Data Catalog,
A managed service in the AWS Cloud that makes it simple and cost-effective to set up, manage, and scale a search solution for your website or application
Amazon CloudSearch
How many languages does AWS CloudSearch support?
34 languages
What is Amazon Elasticsearch Service?
Deploy, secure, operate, and scale Elasticsearch to search, analyze, and visualize data in real-time.
USE CASE: log analytics, full-text search, application monitoring, and clickstream analytics, with enterprise-grade availability, scalability, and security.
Industry-leading cloud big data platform for processing vast amounts of data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto.
Amazon EMR
Run petabyte-scale analysis at less than half of the cost of traditional on-premises solutions and over 3x faster than standard Apache Spark.
Amazon EMR
A data management and analytics service purpose-built for the financial services industry (FSI)
Amazon FinSpace
What is AWS Kinesis?
Makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information
AWS Service that process and analyze data as it arrives and respond instantly instead of having to wait until all your data is collected before the processing can begin
Amazon Kinesis
What are the four services of AWS Kinesis?
Kinesis Data Firehose, Kinesis Data Analytics, Kinesis Data Streams, and Kinesis Video Streams
AWS Kinesis service: The easiest way to reliably load streaming data into data stores and analytics tools. It can capture, transform, and load streaming data into S3
Amazon Kinesis Data Firehose
AWS Kinesis service: Analyze streaming data, gain actionable insights, and respond to your business and customer needs in real-time
Amazon Kinesis Data Analytics
AWS Kinesis service: massively scalable and durable real-time data streaming service. Can continuously capture gigabytes of data per second from hundreds of thousands of sources such as website clickstreams, database event streams, financial transactions, social media feeds, IT logs, and location-tracking events.
Amazon Kinesis Data Streams
The data collected is available in milliseconds to enable real-time analytics use cases such as real-time dashboards, real-time anomaly detection, dynamic pricing, and more.
AWS Kinesis service: makes it easy to securely stream video from connected devices to AWS for analytics, machine learning (ML), playback, and other processing.
Amazon Kinesis Video Streams
What is AWS Redshift?
Most widely used cloud data warehouse. It makes it fast, simple and costeffective to analyze all your data using standard SQL and your existing Business Intelligence (BI) tools.
A fast, cloud-powered business intelligence (BI) service that makes it easy for you to deliver insights to everyone in your organization as receive answers in seconds through natural langauge queries and create and publish interactive dashboards that can be accessed from browsers or mobile devices.
Amazon QuickSight
What is AWS Data Exchange?
Makes it easy to find, subscribe to, and use third-party data in the cloud.
Qualified data providers include category-leading brands such as Reuters, who curate data from over 2.2 million unique news stories per year in multiple languages; Change Healthcare, who process and anonymize more than 14 billion healthcare transactions and $1 trillion in claims annually; Dun & Bradstreet, who maintain a database of more than 330 million global business records; and Foursquare, whose location data is derived from 220 million unique consumers and includes more than 60 million global commercial
venues
USE CASE: academic researchers can conduct studies on climate change by subscribing to data on carbon dioxide emissions; and healthcare professionals can subscribe to aggregated data from historical clinical trials to accelerate their research activities
A web service that helps you reliably process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals
AWS Data Pipeline
You can regularly access your data where it’s stored, transform and process it at scale, and efficiently transfer the results to AWS services
A fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics
AWS Glue
What is ETL?
Extract, Transform, Load
A service that makes it easy to set up a secure data lake in days.
AWS Lake Formation
What is a data lake?
Centralized, curated, and secured repository that stores all your data, both in its original form and prepared for analysis.
A fully managed service that makes it easy for you to build and run applications that use Apache Kafka to process streaming data.
Amazon Managed Streaming for Apache Kafka (Amazon MSK)
What is Apache Kafka?
Open-source platform for building real-time streaming data pipelines and applications.