GCP Certif Flashcards
S: Cloud storage / GCS?
Data lake + A scalable service for storing unstructured data, allowing objects up to 5 TB, designed for availability, durability, and scalability.
Name 4 storage classes in Cloud Storage? define each
Standard Storage, Nearline Storage, Coldline Storage, and Archive Storage
What is Cloud SQL?
Google Cloud’s managed relational database service
What is AlloyDB?
A high-performance, fully managed PostgreSQL database service
What is Spanner?
A fully managed relational database service with strong consistency and horizontal scalability.
Ingestion: Pub/Sub
Asynchronous messaging system delivering data from external systems
S: data sync
A data sync is the final stop in the data journey. It’s where processed and transformed data is stored for future use, analysis, and decision-making. Think of it as the reservoir at the end of the river, where valuable information is collected and readily available.
S: BigQuery
a serverless data warehouse;
A fully managed, serverless data warehouse for analytics, suitable for OLAP workloads
Name 3 built-in features of BigQuery? and 3 use cases?
=> ML, BI, and geospatial analysis
=> Online Analytical Processing (OLAP) workloads, business reporting, and big data processing
S: Bigtable
a no SQL database
A high-performance (highly scalable..) NoSQL database for fast key-value lookups with sub-10 millisecond latency
What is IAM in BigQuery?
Identity and Access Management (IAM) controls access at the dataset, table, view, or column level
What is the key difference between a data lake and a data warehouse?
Data lakes store raw, unprocessed data, while data warehouses store processed and organized data
What is Firestore?
A serverless, NoSQL document database designed for high performance and automatic scaling.
Data Gov: 1.What is Dataplex? 2.What are the key benefits of using Dataplex?
- Dataplex is a comprehensive data management solution that allows organizations to centrally discover, manage, monitor, and govern distributed data.
2.Dataplex helps break down data silos, centralize security and governance, enable distributed ownership, and make data easily searchable by business contexts.