low latency Flashcards

Question

What is CAP theroem

Answer 1

http://ksat.me/a-plain-english-introduction-to-cap-theorem In a system which essentially does a read-write operation and is distributed ( so several node separated by network and heterogenous nodes ) . The network in this case is asynchronous that is the nodes can get disconnected( aka partitoned) you always have to choose between consistency and availability Consistency : When a read is done it will always get the value of the last write . or an Atomic read. If the system cant provide you the data it will respond with an error ( which contradicts Availability) Availability : The System is always available and you will always get a response . The response may not be the most updated data ( since that data is on another node and due to partition(that node being disconnected from you) has not reached you . You choose to respond in all cases. C is important Banking and Financial Systems Explanation: Banking systems must ensure that all transactions are consistently recorded. For instance, if a bank transaction is processed (e.g., a transfer of funds), it’s critical that the most recent account balance is reflected in all subsequent reads across the system. Partition Handling: During a network partition, such a system would rather deny access or delay responses than return outdated data, ensuring that the integrity of data is maintained. Drawback: This approach may impact availability if the system needs to enforce consistency during network partitions, potentially blocking operations until partition issues are resolved Another example Inventory Management in E-commerce Explanation: Real-time inventory management systems need to ensure that products are not oversold. When an item is sold, it’s important that the inventory count is consistent across the entire system to prevent selling more items than available. Systems Prioritizing Availability (A > C) When uninterrupted service is critical, even at the cost of possibly returning slightly stale data, systems prioritize Availability over Consistency. Social media platforms like Twitter, Instagram, or Facebook focus on providing fast, real-time access to updates and notifications, allowing users to continue interacting with the platform even if some data might not reflect the latest update. Why Availability?: In this context, it’s better for users to have some data available (even if slightly stale) than to block access until the latest updates are fully synchronized. Partition Handling: During a network partition, each partition may operate independently, allowing users to continue reading and posting content with the expectation that the system will eventually achieve consistency. Another A Caching Systems (e.g., CDN for Video Streaming) Explanation: Content delivery networks (CDNs) prioritize availability, as their primary goal is to serve content quickly to users worldwide. Data is often cached across distributed servers to minimize latency. Why Availability?: Video streaming platforms like Netflix, YouTube, or Hulu cache popular content at various edge locations to ensure fast access, even if the data is not always the absolute latest (e.g., metadata like the number of views might lag slightly).

Answer 2

dont provide setters for any field Make all fields final and private. they are initialized by the constructor or before hand. Make the class Final so subclassed cant over ride this. Better way is to make the constructor private and construct instances in factory methods. If the instance fields include references to mutable objects, don't allow those objects to be changed: Don't provide methods that modify the mutable objects. Don't share references to the mutable objects. Never store references to external, mutable objects passed to the constructor; if necessary, create copies, and store references to the copies. Similarly, create copies of your internal mutable objects when necessary to avoid returning the originals in your methods.

Answer 3

These objects can be used as Key for Maps and be stored in Sets correctly If not immutable . the hash value for a mutable object may change and finding them in map or set will fail leading to new objects being added to the collection Immutable objects are threadsafe because their state never changes and they make for much more predictable code behavior . easier to read and understand the code.

Answer 4

Practical Use Cases for Immutable Objects in Multi-Threading Shared Configuration Data: As shown in the example above, immutable objects are ideal for storing application-wide configuration data shared across threads. Caching: Immutable objects are often used in caching scenarios where multiple threads read cached data without needing synchronization. Message Passing: In producer-consumer systems or message queues, immutable objects ensure that messages passed between threads cannot be modified. Functional Programming: Immutable objects are foundational in functional programming paradigms (e.g., Java Streams API), where operations like mapping and filtering do not modify the original data but create new instances.

Answer 5

Garbage Collection (GC): The introduction of ZGC in Java 11 and G1GC enhancements in Java 17 have made GC pauses extremely low and consistent, which is a key factor for low latency applications. ZGC is designed specifically for low latency, scalability, and ease of use, allowing a Java application to continue running while it performs all GC operations except thread stack scanning. Immutable Objects: Making objects immutable can help reduce the amount of synchronization required and can improve performance, especially in highly concurrent applications. Concurrency Utilities: Java 5 introduced several concurrency utilities and locks that can help improve performance in multi-threaded applications. Examples include the Lock interface, CountDownLatch, and Semaphore classes. Profiling: Using a profiler can help identify performance bottlenecks and optimize code for low latency. Tools like JProfiler, YourKit, and VisualVM can provide insights into CPU usage, memory usage, and thread behavior. Amdahl's Law: Amdahl's Law can be used to identify the sequential execution path in a program and optimize it for performance. By improving the performance of the critical path, overall performance can be improved. Warmup: JVM warmup can be used to improve the performance of Java applications by allowing the JIT compiler to optimize code before it is executed. This can reduce the time required for JIT compilation during runtime and improve overall performance.

Answer 6

is a scalable low latency garbage collector ZGC performs all expensive work concurrently, without stopping the execution of application threads for more than a few milliseconds .. under milleseconds in some cases Pause times are independent of heap size that is being used works well for most heap size ( 8 MB to 16 TB) . The most important tuning option for ZGC is setting the max heap size (-Xmx) ZGC is a concurrent collector think live set ( currently active objects ) and the object allocation rate ( new objects being created) The second tuning option one might want to look at is setting the number of concurrent GC threads (-XX:ConcGCThreads) ZGC has heuristics to automatically select this number ZGC uncommits unused memory, returning it to the operating system. This is useful for applications and environments where memory footprint is a concern

Answer 7

Garbage 1st collector It divides the entire heap into regions . say of 1 MB each . Any region maybe used for eden/ survivor or old gen The marking of live objects happens concurrently with the running application (there are slight pauses when they start ) The evacuation or collection of regions is done by other threads which run parallel with the application ( they do cause a pause) The evacuation will pick regions with the highest garbage and move the live objects out to another region ( young -> survivor ) or survivor o old or old to old The region that got cleaned up is now made available for any assignment The tuning is in terms of specifying the maxGCpausetime in milliseconds The GC is adaptive to see how many regions it can clean up while staying within the gc pause time limit. Its not always guaranteed but it tries Z1GC does provide balanced low latency ( 200 ms to seconds) and high throughput.. Its quite balanced. Tuning can be done to the number of concurrent threads doing marking and parallel threads doing evacuation It has a young phase collection ( young to old) .. more tuned to faster runtime Mixed Collection : Old and new .. more tuned to clearing up maximum memory. does have small pause. There is also the full GC that can happen when there isnt space to allocate new objects .. like an allocation failure

Answer 8

[Eden: 3072.0K(194.0M)->0.0B(201.0M) Survivors: 0.0B->0.0B Heap: 3727.1M(4022.0M)->3612.0M(4022.0M)], [Metaspace: 2776K->2776K(1056768K 2015-09-14T12:32:24.398-0700: 0.356 – Here 2015-09-14T12:32:24.398-0700indicates the time at which this GC event fired. Here 0.356 indicates that 356 milliseconds after the Java process was started this GC event was fired. GC pause (G1 Evacuation Pause) — Evacuation Pause is a phase where live objects are copied from one region (young or young + old) to another region. (young) – indicates that this is a Young GC event. GC Workers: 8 – indicates the number of GC worker threads. Times: user=0.08, sys=0.00, real=0.02 secs – here note the ‘real’ time, it indicates this GC event took 0.02 seconds to complete. If you are wondering what is the purpose of ‘user’ and ‘sys’ times, please refer to this article.

Answer 9

You can cache Objects and return existing objects rather than creating one each time. For example all a-b and A-Z Character are already cached . public class CharacterCacheExample { public static void main(String[] args) { Character c1 = Character.valueOf('A'); // Cached Character c2 = Character.valueOf('A'); // Same cached instance Same for Boolean Boolean.TRUE and Boolean.FALSE are predefined static instances of Boolean objects. If b is true, it returns Boolean.TRUE (a cached instance). If b is false, it returns Boolean.FALSE (a cached instance

Answer 10

Builder pattern is a good choice when designing classes whose constructors or static factories would have more than a handful of parameters, especially if many of the parameters are optional or of identical type. Client code is much easier to read and write with builders than with telescoping constructors, and builders are much safer than JavaBeans.

Answer 11

Operating systems limit the number of open file descriptors per process (e.g., 1024 on many Linux systems). If too many files are left open (due to delayed GC cleanup), attempts to open new files will fail with errors like: Open files consume memory and system resources. Not closing files immediately can lead to memory leaks and degraded performance.

Answer 12

The single most important factor that distinguishes a well-designed component from a poorly designed one is the degree to which the component hides its internal data and other implementation details from other components. A well-designed component hides all its implementation details, cleanly separating its API from its implementation. Components then communicate only through their APIs and are oblivious to each others’ inner workings.

low latency Flashcards

(36 cards)