Professional Cloud Architect Flashcards
What is a GCP Service that handles streaming and batch data?
Cloud DataFlow
What does DLP stand for and how is it used?
Data Loss Prevention and it is used to sanitize data and remove sensitive information
App Engine is what type of service?
PAAS Platform as a Service
Compute Engine (GCE) is what type of service?
IAAS Infrastructure as a Service
What are the FireStore Components?
Field
Collection Group
Document
Document ID
What are the Cloud DataStore Components?
Kind
Entity
Property
Key
If a Compute Engine Application exists in a single VPC across three regions and your application must communicate over VPN to your company’s on-premise network then how many VPN Gateways are required?
3 Cloud VPN gateways are required.
Cloud VPN Gateways are bound to a single region.
Create a Cloud VPN Gateway in each region
What type of migration model does Dress4Win state in their business requirements?
Lift and Shift
What are the 5 sequential steps for cloud migration?
1 Assess 2 Pilot 3 Move Data 4 Move Applications 5 Cloudify & Optimize
Dynamic Routing uses a _________ to automatically discover new subnet routes
Cloud Router
The 4 layers of the GCP Cloud Resource Hierarchy
1 Organization
2 Folders
3 Projects
4 Resources
Which network interconnect method connects your network to a GCP VPC over a public internet encrypted tunnel?
Cloud VPN
Command to create a new storage bucket
gsutil mb -l {location} -c {storage class} gs://BucketName
Cloud Router uses this protocol to handle dynamic routing between locations
BGP Border Gateway Protocol
Where can you export Stackdriver logs to (not counting customer locations)
1 Cloud Storage
2 Cloud Pub/Sub
3 BigQuery
What is the max speed of a single Cloud VPN tunnel (non-peered)
1.5 Gbps
Every load balancer must have a ___ and a ____
Frontend || Backend
Role necessary to link a project to a billing account
Billing Account User
How many VPN tunnels can you create in a single Cloud VPN gateway
8
What is the default, implied status of all egress traffic in a VPC firewall
Allow All
Google Cloud Storage holds what type of data?
Unstructured
This service is required to setup dynamic routing over a Cloud VPN Service
Cloud Router
Where does Cloud Dataaprep load data from?
Cloud Storage and BigQuery
The two methods of permissions for Google Cloud Storage
1 IAM: Identity and Access management
2 ACL: Access control list
This database service is ideal for low-latency storage of time-series data
Cloud BigTable
Relational Databases
Cloud SQL
Cloud Spanner
Non-Relational Databases
Cloud DataStore
Cloud FireStore
Cloud BigTable
DataWareHouse
BigQuery
This managed database is a no-ops petabyte-scale data warehouse that queries data in standard SQL Format
Big Query
Retention period for data access logs
30 days
______ Roles apply to the entire project.
Primitive
An HTTP load balancer can forward traffic by ____ and ____
location
content
Which GCP load balancers are multi-regional in scope?
1 HTTP Load Balancer
2 TCP Proxy
3 SSL Proxy
VPC subnets can exist in more than one _____
zone (in the same region)
Which connection protocol does the Cloud VPN service use?
IPSEC
This IAM member allows public/anonymous access to a resource
allUsers
Google account type for members of an organization WITHOUT access to Google apps
Cloud Identity Domain
What type of managed database is ideal for web and mobile applications?
Cloud DataStore
More lightweight container image option to run on GKE
Alpine Linux
The name for the modular components of a Cloud Deployment Manager Configuration
Templates
GCP Service for Providing a ‘single pane of glass’ for monitoring resources and alerts across projects in AWS
StackDriver Monitoring
VPC firewall rules are applied on a per-instance basis
True
What layer of the Cloud Resource Hierarchy are chargeable resources hosted in?
Projects
Which networking interconnect option connects your business directly to Google, but not directly to GCP VPC?
Peering
The 3 Primitive Roles and the types of access they give:
1 Owner: Full Project Access (Billing and Assigning IAM Roles)
2 Editor: Full Access minus- Billing and IAM access
3 Viewer: View only
Google account type for a collection of individual Google Accounts
Google Groups
When to use Dataproc over Data Flow
When using Hadoop/Spark workflows
Another term for mapping Cloud Identity to Active Directory to duplicate account information.
Federation
What is a pod on GKE?
Smallest deployable unit. Contains one or more containers that run on nodes
The three IAM Role Types
1 Primitive
2 Predefined
3 Custom
Two format options for Cloud Deployment Manager template files
Jinja
Python
The five (non-beta) Stackdriver services
1) . Logging
2) . Trace
3) . Monitoring
4) . Error Reporting
5) . Debug
Cloud Storage can act as a block-level SAN replacement (True/False)
False; you would need to use a persistent disk for a direct SAN replacement
The two Memcache service levels
1 Dedicated
2 Shared
GCP service for asynchronous messaging, used for streaming data ingest
Cloud Pub/Sub
In a Shared VPC network, the ____ project hosts the VPC components, and the ___ project uses hosted VPC resources
Host
Service
This managed database is ideal for NoSQL purposes, is NoOps in setup/maintenance, and is ideal for mobile save game state
Cloud DataStore
What is a service account?
1 Assigned to an application or a server
2 Authenticated with a service account key
3 Both a member and a resource
How to easily apply VPC firewall rules to individual instances instead of the entire network
Network Tags
Admin Activity Logs are ____ by default
Enabled
When are un-managed instance groups useful?
Migrating grouped servers to the cloud with minimal disruption in workflow
____ provides a direct physical connection to connect your on-premises network to a Google Cloud VPC network.
Cloud Interconnect
How to optimize your CDN cache performance:
Configure Cache Hit Ratio
Collection of statements that define who has access to what resource on GCP
IAM Policy
This application is required to configure a Cloud Storage bucket as a mounted disk on a GCE instance.
Google Cloud Storage Fuse (gcs-fuse)
a managed instance group is created from an ____
Instance Template
Permissions for working with VPC networks fall under this service.
Compute Engine
What are the 5 load balancer options in GCP
1) Internal
2) Network
3) HTTP(s)
4) TCP Proxy
5) SSL Proxy
How to add subnets in other regions to the same VPC network:
No configuration necessary
What are the two database structure formats we discussed in this course?
Relational (SQL) || Non-Relational (NoSQL)
An export in Stackdriver Logging requires what components to setup?
A filter to select log entries
A destination to export filtered logs
Sink: Select which filtered logs to send to which destination
Format of Deployment Manager configuration files
YAML format
GCP’s service that is build on Apache Beam, used for processing both batch and streaming data
Cloud DataFlow
Retention period for admin activity logs
400 days
This type of disk is directly connected to a GCE instance and must be set up on instance creation
Local SSD
Where can billing data be exported?
1 Cloud Storage
2 Big Query
Which are the benefits of quotas?
Protection of unexpected spikes in resource usage
Prevent runaway consumption due to error or malicious intent
What could be the cause if an Instance Group VMs keep restarting every minute?
1 Failing Health Check
2 Configure the firewall to allow proper access to instance group VM’s (subnet, tag) from load balancer IP
MountKirk Games is looking to migrate how many environments to the cloud?
(2) environments different storage for each service
1 Game BackEnd on Google Cloud Compute Engine (GCE)
2 Analytics
What would fulfill the MountKirk technical requirement for “connecting a trans-actional database service to manage user profiles and game state”?
Cloud Datastore - NoSQL transactional database - perfect for game user-profiles and game states
What would fulfill the MountKirk technical requirement “Store game activity in a timeseries database service for future analysis”?
Store in BigQuery
BigQuery vs BigTable
BigQuery a lot more managed
No requirement for low latency analytics response time (Big Table)
BigQuery has a response measured in seconds, scales efficiently
BigQuery reading from BigTable possible response as well
What would fulfill the MountKirk technical requirement “As the System scales, ensure that data is not lost due to processing backlogs. “?
1 HTTP Load Balancer- Automatically scales to meet demand
2 Managed Instance Groups - also auto-scales
3 Pub/Sub - Buffers late/slow data
What would fulfill the MountKirk technical requirement “Run hardened Linux Distro”?
Managed Instance groups with custom images
What would fulfill the MountKirk technical requirement “Process incoming (streaming) data on the fly directly from the game servers?
Connect services (stackdriver logs metrics, gce game serverss) with Pub/Sub Process with DataFlow
What would fulfill the MountKirk technical requirement “Process data that arrives late because of slow mobile networks” ?
Pub/Sub: Scales and Buffers messages
DataFlow: Accounts for late/out of order data
What would fulfill the MountKirk technical requirement “Allow queries to access at least 10 TB of historical data.”?
BigQuery - SQL Queries against data
What would fulfill the MountKirk technical requirement “Process files that are regulary uploaded by users’ mobile devices. ?
Upload to Cloud Storage
Process via DataFlow
What would fulfill the Dress4Win technical requirement equivalent of “MySQL”?
DataCenter»_space; GCP
MySQL»_space; Cloud SQL (Lift . Shift)
5TB»_space; 10 TB Size Limit
Single Region - no global footprint requirement
Migration -
1 Create replica server managed by Cloud SQL
2 Once replica is synced: Update applications to point to replica
3 Promote replica to stand-alone instance
What would fulfill the Dress4Win technical requirement “Redis 3 server Cluster” ?
Two options
1) Run Redis server on Compute Engine
2) Use new Memorystore managed Redis database
What would fulfill the Dress4Win technical requirement “40 Web Application servers providing micro-services based APIs and static content. “Tomcat - Java”, “Nginx”, “4 core CPUs”,”32 GB of RAM”?
The existing environment has lots of idle time
- Managed instance groups - autoscaling using custom machine types (Fits Lift . Shift)
Alternatively - can re-architect for GKE/GAE for microservices deployments for future phases
What would fulfill the Dress4Win technical requirement “20 Apache Hadoop/Spark servers:”?
Cloud Dataproc connecting to Cloud Storage
What would fulfill the Dress4Win technical requirement “3 RabbitMQ servers for messaging, social notifications, and events:”?
Pub/Sub likely replacement
Can also deploy same environment on Compute engine instance group (lift and shift)
What would fulfill the Dress4Win technical requirement “Jenkins, monitoring, bastion hosts, security scanners”?
No managed service equivalents
Use GCE instances - custom machine types
Think about using the Market Place as well
What would fulfill the Dress4Win technical requirement “iSCSI for VM hosts/Fiber channel SAN - Backup for MySQL databases” ?
SAN/iSCSI requires block storage
Persistent disks working in a SAN Cluster
What would fulfill the Dress4Win technical requirement “NAS - image storage, logs, backups”?
Cloud Storage - direct replacement
Infinite scale in a single bucket
Persistent also an option
What would fulfill the TerramEarth business requirement “Decrease unplanned vehicle downtime to less than 1 week”?
Convert to 100% cellular connectivity
What would fulfill the TerramEarth business requirement “Support the dealer network with more data on how their customers use their equipment to better position new products and services”?
Share insights with Data Studio
What would fulfill the TerramEarth business requirement “Have the ability to partner with different companies – especially with seed and fertilizer suppliers in the fast-growing agricultural business – to create compelling joint offerings for their customers”?
- Share insights with Data Studio
- BigQuery / ML analytics to predict customer needs
- Tech lead will enable partnerships
What would fulfill the TerramEarth technical requirement “expand beyond a single datacenter to decrease latency to American midwest and east coast”?
Multi-regional/global services
What would fulfill the TerramEarth technical requirement “create a backup strategy”?
Regular BigQuery Exports to Cloud Storage
What would fulfill the TerramEarth technical requirement “Increase the security of data transfer from equipment to the datacenter”?
- Cloud Endpoints - manage and protect APIs
- Cloud IoT Core - also managed security
- Customer supplied encryption keys
What would fulfill the TerramEarth technical requirement “Improve data warehouse”?
- Cloud dataflow - transform incoming streaming data to the preferred format
- Alternatively, stage in Cloud Storage, clean with Cloud Dataprep, and run job backed by DataFlow into BigQuery
What would fulfill the TerramEarth technical requirement “Use Customer and equipment data to anticipate customer needs”?
Pair BigQuery with machine learning services for predictive analytics
_______ provides visual notebooks for working with BigQuery/Cloud ML Engine data for ML/analytics?
Datalab
What does CSEKs stand for?
Customer-supplied encryption keys