TB2 Flashcards

Question

Encryption

Answer 1

Encryption is a security method where information is encoded in such a way that only authorized parties can access it. It transforms readable data, or plaintext, into an unreadable format, known as ciphertext, using an encryption algorithm and a key. This ensures that even if data breaches occur, the information remains unreadable and secure. Importance of Encryption: .Data Security: Protects sensitive data such as personal information, financial records, and confidential corporate data from breaches and unauthorized access. .Privacy Compliance: Many industries are subject to regulations requiring the protection of sensitive information, such as GDPR, HIPAA, etc., which mandate the use of encryption. .Trust: Maintains the trust of customers and stakeholders by ensuring that their data is handled securely and responsibly. .Data Integrity: Ensures that data is not altered or tampered with during storage or transmission, maintaining its accuracy and reliability.

Answer 2

.Personal Identifiable Information (PII): Names, addresses, social security numbers, etc. .Financial Information: Credit card numbers, bank account details, transaction records. .Confidential Business Information: Trade secrets, proprietary technology, strategic plans.

Answer 3

Databases use various encryption techniques to protect data at different levels, from individual cells to entire databases. These techniques can be broadly categorized into symmetric and asymmetric encryption, each with its advantages and use cases. Symmetric Encryption Utilizes a single key for both encryption and decryption. This method is efficient and typically faster, making it suitable for encrypting large volumes of data. Algorithms - Advanced Encryption Standard (AES), Triple Data Encryption Standard (3DES), and Blowfish. Ideal for encrypting data at rest, such as entire database files or backups, where encryption and decryption speed is crucial. Asymmetric Encryption Employs a pair of keys – a public key for encryption and a private key for decryption. This method is more secure but also more computationally intensive. Algorithms - RSA (Rivest-Shamir-Adleman), Elliptic Curve Cryptography (ECC), and Digital Signature Algorithm (DSA). Best suited for encrypting data in transit or for establishing secure connections for remote database access.

Answer 4

Rest: Protects data stored within the database or on disk. Techniques include Transparent Data Encryption (TDE) and file-system level encryption. It's crucial for preventing data breaches resulting from physical theft or unauthorized access to storage media. Transit: Secures data as it moves between the database and applications or between servers. Implemented through protocols like TLS (Transport Layer Security) and SSL (Secure Sockets Layer), it safeguards against interception and eavesdropping.

Answer 5

Application-Level Encryption: The application encrypts data before sending it to the database. This approach provides fine-grained control over what data is encrypted and allows for application-specific encryption schemes. Database-Level Encryption: The database system itself manages encryption, offering a more straightforward implementation that requires less modification to existing applications. However, it may provide less flexibility in terms of which data is encrypted.

Answer 6

Data retrieval in the context of encrypted databases involves fetching the requested data and converting it from its encrypted form (ciphertext) back into a readable format (plaintext). This process is crucial for maintaining the confidentiality and integrity of sensitive information while ensuring it remains accessible to authorized users. The Retrieval Process Request Authentication: Verifies the identity of the user or system requesting data, ensuring that only authorized parties can initiate data retrieval. Decryption: Once access is granted, the database decrypts the requested data using the appropriate decryption key. This step requires careful management of encryption keys to ensure they are accessible to legitimate users while being protected from unauthorized access. Data Presentation: The decrypted data is then presented to the user or application in a readable format. This step often involves additional security measures, such as secure transmission protocols and session management, to protect data as it's delivered to the end user.

Answer 7

Performance Overhead: Encryption and decryption processes can introduce latency, affecting the performance of database queries and data retrieval operations. Key Management Complexity: Managing the lifecycle of encryption keys, including generation, storage, rotation, and revocation, poses significant challenges. Mismanagement can lead to data loss or breaches. Access Control and Authentication: Implementing robust access control mechanisms is essential to prevent unauthorized data access. This includes managing permissions and roles within the database management system.

Answer 8

Implement Role-Based Access Control (RBAC): Define roles and permissions clearly to ensure users have access only to the data they are authorized to view. Use Secure Transmission Protocols: When transmitting data, especially over public networks, use protocols like TLS (Transport Layer Security) to protect the data in transit. Regular Audits and Monitoring: Conduct regular security audits and monitor access logs to detect and respond to unauthorized access attempts promptly. Encryption Key Management: Utilize dedicated key management solutions to automate key rotation, securely store keys, and ensure they are only accessible to authorized applications and users. Data Masking: For extra security, especially in development and testing environments, use data masking techniques to obscure sensitive information, ensuring that even if data is accessed, it cannot be misused.

Answer 9

Homomorphic Encryption: Allows computations on encrypted data, providing results without ever decrypting the data. This advanced technique is promising for secure data processing and analysis. Blockchain for Data Integrity: Leveraging blockchain technology to ensure the integrity and immutability of transaction logs and sensitive data records.

Answer 10

The pgcrypto extension enables PostgreSQL with cryptographic functionality, facilitating encryption/decryption and secure storage of data directly within the database. It supports both symmetric and limited asymmetric encryption methods, allowing for the protection of sensitive information and secure password storage through hashing. The pgcrypto extension in PostgreSQL supports symmetric encryption using PGP_SYM and AES, allowing secure data storage. Remember: Symmetric encryption means the same key is used to encrypt and decrypt data. PGP_SYM is versatile, ideal for encrypting text data with a passphrase, making it user-friendly for scenarios where data needs to be shared securely between parties who have the passphrase. AES is known for its speed and robust security, suitable for encrypting large volumes of data efficiently. It's recommended for situations demanding high performance and strong security, such as storing sensitive user information or financial records.

Answer 11

In secure password storage practices, passwords are hashed, not encrypted. The distinction is crucial: hashing is a one-way process, meaning once a password is hashed, it cannot be reversed or decrypted back to its original plaintext form. This is why it's impossible to "decrypt" and view the original password from a hash stored in the database. Hashing is a one-way process used for verifying the integrity of data. The same input always produces the same output, but it's computationally infeasible to reverse the process and retrieve the original input from the hash output. Encryption is a two-way process that allows data to be made unreadable via encryption and then returned to its original readable form via decryption, using a specific key.

Answer 12

The purpose of hashing passwords before storing them is to protect user credentials. Even if a database is compromised, the attackers cannot retrieve the actual passwords, only their hashes. Secure password hashing algorithms, especially those designed for password storage like bcrypt, scrypt, or Argon2, are intentionally designed to make this reversal computationally impractical.

Answer 13

A PostgreSQL function used to hash passwords. Utilizes a cryptographic hash function to convert plaintext passwords into a secure, fixed-size hash. Ensures that stored passwords are not kept in plain text, enhancing security.

Answer 14

BF (Blowfish): This is the algorithm used by bcrypt, which is well-regarded for its security due to its adaptive cost factor. It allows you to scale the algorithm's complexity and resistance to brute-force attacks as hardware capabilities improve. MD5: An older hashing algorithm that is much faster but significantly less secure than Blowfish. It's generally not recommended for new systems due to vulnerabilities to collision attacks and its susceptibility to fast brute-force attacks. XDES (Extended DES): An extension of the traditional DES (Data Encryption Standard) algorithm, offering better security through a configurable number of encryption rounds. Like Blowfish, it can be set to be computationally intensive, though it's generally less used than bcrypt. DES: The original Data Encryption Standard algorithm. It's considered obsolete for most purposes due to its short key length, which makes it vulnerable to brute-force attacks.

Answer 15

A salt is a random sequence of characters added to the input of a hash function along with the password. The same password with different salts will result in different hashes. Significantly improves hash security by preventing pre-computation attacks. Typically used with crypt via the gen_salt function, which supports multiple hashing algorithms. Purpose Uniqueness: By adding a salt to each password before it is hashed, even identical passwords will produce unique hash values, thus preventing attackers from using pre-computed hash tables (rainbow tables) to crack the passwords. Security Enhancement: Salts increase the complexity and uniqueness of hashed passwords, making them much harder to crack. This is particularly important in a database breach scenario where attackers gain access to hashed passwords.

Answer 16

Database performance refers to the effectiveness of database systems in managing data operations, measured by the system's response time, throughput, and resource utilization. High performance means that the database can handle queries and transactions quickly and efficiently, with minimal delays and optimal use of hardware resources. Why tunning is important? User Experience: The speed at which a database processes and returns information can significantly affect the user's interaction with an application. Faster responses improve user satisfaction and engagement. Resource Optimization: Efficient database operations consume less CPU, memory, and disk I/O, which not only improves the current system's responsiveness but also scales better with increased load, delaying or eliminating the need for costly hardware upgrades. Consistency and Reliability: Well-tuned databases handle peak loads effectively, maintain consistent performance levels under varying loads, and ensure data integrity and security.

Answer 17

Redundancy: Poor design often leads to unnecessary duplication of data across the database. This not only wastes storage space but also complicates updates, as the same data may need to be updated in multiple places. Over time, redundancy can lead to significant inefficiencies and increased likelihood of data inconsistencies. Update Anomalies: A database that hasn't been properly normalized is prone to update anomalies. This means that changes to data in one part of the database can inadvertently lead to inconsistencies elsewhere. For example, if duplicate data exists in multiple tables, updating it in one place but not the others can lead to discrepancies, making the database unreliable. Inefficiency: Inefficient data organization can slow down query performance, especially as the volume of data grows. For instance, without proper indexing or separation of frequently accessed data from less frequently used information, queries can become slower due to the need to scan large amounts of irrelevant data. Scalability Issues: Databases designed without considering future growth can encounter scalability issues. This may manifest as performance degradation under increased load, difficulty in implementing necessary schema changes, or challenges in optimizing queries to meet evolving business requirements. Loss of Flexibility: A rigidly structured database can significantly hinder the implementation of new features or adjustments to the business logic. When the database schema is too closely tied to the current application logic, any change in business requirements can require extensive modifications to the database, leading to higher development costs and potential downtime.

Answer 18

Query optimization is a crucial aspect of database management aimed at reducing the time and resources required to execute SQL queries. It involves rewriting queries, choosing the most efficient execution paths, and employing database features like indexes and partitioning to speed up data retrieval. Effective query optimization can significantly improve application performance and responsiveness, especially in databases with large volumes of data.

Answer 19

Execution Plan: Databases use execution plans to determine how to carry out a query. Understanding these plans, which can be viewed using commands like EXPLAIN in PostgreSQL, is fundamental to identifying bottlenecks and optimizing queries. Index Usage: Proper use of indexes is one of the most effective ways to improve query performance. However, indexes should be used strategically, as unnecessary indexes can degrade write performance and consume additional storage. Query Rewriting: Often, the way a query is written can impact performance. Rewriting queries to be more efficient, such as minimizing subqueries, using joins appropriately, and filtering data as early as possible in the query, can lead to significant improvements. Statistics and Cardinality Estimates: Modern DBMSs collect statistics about table sizes and data distributions, which are used to estimate the "cost" of different query plans. Keeping these statistics up to date is crucial for the optimizer to make accurate decisions. Leverage Database Features: Use database-specific features like partitioning to improve query performance on large tables, and consider using materialized views to cache complex queries.

Answer 20

MONITORING AND CONTINUOUS IMPROVEMENT Continuous monitoring helps identify performance bottlenecks, inefficient queries, and potential data integrity issues. By tracking performance metrics and patterns over time, database administrators and developers can proactively address issues before they impact the user experience or lead to more significant problems. Key Areas for Monitoring Query Performance: Identify slow-running queries that may need optimization. Resource Utilization: Monitor CPU, memory, and disk I/O to ensure the database server has sufficient resources. Index Usage and Efficiency: Ensure indexes are being used effectively and identify opportunities for additional indexing. Errors and Warnings: Track error logs for any unusual activity or recurrent issues that need attention.

Answer 21

Continuous Improvement Cycle Assess: Regularly review performance metrics and logs. Plan: Identify issues and prioritize fixes based on impact. Implement: Apply optimizations, such as query rewriting, index adjustments, or configuration changes. Review: Assess the impact of changes and document lessons learned.

Answer 22

Design Challenges: Creating an efficient database requires careful planning and understanding of the data's nature. A poorly designed database can lead to redundancy, inconsistency, and inefficient data retrieval. Management Overhead: Databases need ongoing maintenance to ensure they run smoothly. This includes tasks like updating systems, backing up data, and optimizing performance. Such management tasks require skilled administrators and can become time-consuming and complex, especially for large databases.

Answer 23

Initial Setup Costs: The cost of setting up a database system can be high, especially for large-scale operations. This includes hardware, software licenses, and the cost of hiring skilled personnel to design and implement the database. Ongoing Operational Costs: Beyond the initial setup, databases incur ongoing operational costs, including maintenance, security measures, and updates. For complex and large-scale systems, these costs can be significant.

Answer 24

Handling Large Data Volumes: As the amount of data grows, databases can experience slowdowns if not properly optimized or scaled. Performance tuning and scaling solutions are necessary but can add to the complexity and cost. Scalability Limitations: While databases are designed to scale, doing so effectively requires careful planning and additional resources. Scaling challenges can arise from hardware limitations, software architecture, or the database model used.

Answer 25

Data Breaches and Attacks: Despite robust security measures, databases are constant targets for cyberattacks, leading to potential data breaches. The consequences of such breaches can be severe, including loss of sensitive information, financial loss, and damage to reputation. Complex Security Management: Ensuring a database is secure involves managing access controls, encrypting data, and monitoring for suspicious activity. This complexity can be overwhelming, especially for organizations without dedicated security experts.

Answer 26

SQL Injection: A technique used by attackers to execute malicious SQL commands by exploiting vulnerabilities in the database layer. This can lead to unauthorized access to sensitive information. Data Breaches: Unauthorized access to the database can result in sensitive data being stolen, including personal information, financial records, and intellectual property. Insider Threats: Sometimes, the threat comes from within an organization. Employees with access to databases might misuse their privileges, intentionally or accidentally exposing data.

Answer 27

Encryption: Encrypting data stored in databases is fundamental to protecting it from unauthorized access. Both data at rest and in transit should be encrypted. Access Controls: Implement strict access control policies to ensure that only authorized personnel can access sensitive data. Use role-based access control to minimize the risk of insider threats. Regular Audits and Monitoring: Conduct regular security audits to check for vulnerabilities and monitor database activity to detect any suspicious behaviour promptly.

Answer 28

Keep Software Up to Date: Regularly update database management software to protect against known vulnerabilities. Use Strong Authentication Mechanisms: Implement strong password policies and consider multi-factor authentication to enhance security. Backup and Recovery Plans: Maintain regular backups of data and have a recovery plan in place to deal with data loss or corruption incidents.

Answer 29

Full Backup: Copies all data from the database. It provides the foundation for other types of backups but requires more storage space and time to complete. Example: Performing a full backup weekly to ensure a complete copy of all data at that point in time. Incremental Backup: Only backs up the data that has changed since the last backup (either full or incremental). It's faster and requires less storage than a full backup but depends on the previous backups for a complete restore. Example: Daily incremental backups to capture the changes made each day, reducing the time and storage space needed. Differential Backup: Captures the changes made since the last full backup. Unlike incremental backups, each differential backup grows in size, as it accumulates all changes since the last full backup. Example: Using differential backups mid-week to quickly recover data without needing to process all daily increments.

Answer 30

Regular Scheduling: Implementing a regular schedule for full, incremental, and differential backups to balance between protection and resources. Storage Solutions: Utilizing various storage solutions, including on-site servers and cloud storage, to protect against site-specific disasters. Testing and Verification: Regularly testing backups to ensure data can be effectively restored and verifying backup integrity to confirm data is not corrupted.

Answer 31

Assessment: Quickly assess the extent of the data loss or corruption to determine the appropriate recovery method. Choosing a Recovery Point: Decide on the most appropriate point to restore the data from, considering the data loss event and the state of available backups. Preparation: Ensure the recovery environment is ready, which may include setting up hardware, configuring software, or preparing the network for data restoration. Restoration: Execute the recovery process, which may involve restoring from a full backup, applying incremental backups, or using point-in-time recovery techniques. Validation: After restoration, validate the integrity and completeness of the recovered data to ensure it meets operational requirements. Review: Conduct a post-recovery review to understand the cause of the data loss, evaluate the effectiveness of the recovery process, and identify improvements for future recovery plans.

Answer 32

ACID Compliance: Ensures reliable transaction processing and maintains data integrity in complex transactions. Extensive Data Type Support: Handles a wide array of data types including geometric, custom types, and JSON, facilitating flexible data storage and manipulation. Advanced Replication: Built-in support for streaming replication and logical replication, allowing for high availability and flexible replication strategies. Point-in-Time Recovery (PITR): Supports continuous archiving of transaction logs, enabling precise restoration of data to any point in time. Robust Security Features: Offers strong security features like role-based access control, SSL-encrypted connections, and row-level security to protect sensitive data. Flexible Backup Options: PostgreSQL offers a range of backup options including SQL dump, file system level backup, and continuous archiving, catering to different recovery point objectives (RPOs). Efficient Disaster Recovery: The inclusion of features like WAL (Write-Ahead Logging - a standard method used in database systems to ensure data integrity) archiving and streaming replication facilitates efficient disaster recovery strategies, minimizing downtime. Tooling and Extensions: A wealth of third-party tools and extensions like pgBackRest, Barman, and WAL-E enhance PostgreSQL's native backup and recovery capabilities, providing automation, efficiency improvements, and added flexibility. Performance and Scalability: Even during backup and recovery processes, PostgreSQL's architecture and performance optimizations ensure minimal impact on operational efficiency.

Answer 33

Regular Backup Schedules: Automate backups to occur during off-peak hours to minimize performance impact. Decide on the frequency of full, incremental, and differential backups based on data change rate and recovery objectives. Backup Types and Rotation: Use a combination of full, differential, and WAL backups. Implement a retention policy that aligns with business requirements and storage capacity. Disaster Recovery Plan: Develop a comprehensive disaster recovery plan that includes backup, recovery procedures, roles, and responsibilities.

Answer 34

Backup Automation: Utilize tools such as cron jobs, pgAdmin, or custom scripts to automate the backup process. Ensure notifications are in place for any failures or issues. Automated Testing: Schedule regular automated recovery drills to validate backup integrity and the restoration process. Monitoring and Alerts: Monitoring Systems: Implement monitoring systems to track backup processes. Monitor disk usage, error logs, and the completion status of scheduled backups. Alerting Mechanisms: Set up alerts for backup failures, storage capacity thresholds, and other critical events that might require immediate attention.

Answer 35

Documentation: Maintain thorough documentation of the backup and recovery procedures, including configurations and step-by-step guides. Training: Regularly train staff involved in backup and recovery processes to ensure they are familiar with procedures and best practices.

Answer 36

Test Restores: Periodically perform test restores from backup to ensure that data can be recovered successfully and within the required time frame. Data Integrity Checks: Perform checksums and data integrity checks post-recovery to ensure that the data is consistent and intact.

Answer 37

Offsite and Onsite Storage: Store backups both onsite for quick recovery and offsite to protect against local disasters. Backup Storage Management: Monitor and manage storage space to ensure that backups do not fail due to insufficient disk space.

TB2 Flashcards

(61 cards)