`
leonzhx
  • 浏览: 810292 次
  • 性别: Icon_minigender_1
  • 来自: 上海
社区版块
存档分类
最新评论

Parallel and concurrent garbage collectors

    博客分类:
  • JVM
阅读更多

http://chaoticjava.com/posts/parallel-and-concurrent-garbage-collectors/

In the last post, I've been talking about the basics of garbage collection and the generational garbage collector. While the generational garbage collector brought huge performance benefits by getting the large, old generation memory area to be infrequently visited by the collector, it still wasn't enough for the new era of faster processors and memory sizes which spawned larger applications with multiple threads running concurrently, creating loads of garbage. The original generational collector was single threaded, and was called the serial collector. A parallel collector was required, and since the young and old generation memory spaces are different in the way the objects residing in them are used, so are the collectors implemented for them.

Since explaining multi-threaded implementations is difficult, and more-so when it's with the topic of garbage collection, I've used illustrations that depict the state of a one or two threads at a time; in reality, the amount of threads running is equal to the number of processors the machine has, which can easily be a two-digit number. I hope the explanations are clear enough, and as always questions are welcomed!

 

Parallel scavenge

The parallel implementation for the young generation memory space is simple and is as follows: the root objects are divided among the garbage collection threads, making the marking phase much shorter. As soon as an object is found to be live, it is copied to the To area immediately (or to the old generation space, if he survived enough iterations). This can be accomplished by allocating a segment in the To area per thread, allowing the different threads to copy objects to the To area without the need for thread synchronization. Since allocation is done in Eden anyway, the To area is allowed to be a bit fragmented without causing harm.

The following image illustrates how a single thread claims a single object. Obviously, in reality all threads allocated for the collection task are sweeping in a non-synchronized way, unaware of each other's work.



 

Parallel compacting collector

The implementation for the old generation memory space is much more complex. Remember that this memory is significantly larger and is occupied by objects that are less likely to be garbage. This operation is a three steps procedure: first, the memory space is divided into regions. Then, the parallel marking starts, with the root objects being divided among collector threads. When a live object is encountered, the region it is located in is updated so that at the end of the process, each region contains information stating how much live data is available in it, and where live objects are located in it.

After that‘s done, the summary step comes along where each region is examined for its density – the amount of live bytes compared to the total bytes in the region. An assumption is now made: since we know that previous collections had been made which compacts the bytes “to the left”, and we know that new objects moved into the old generation are allocated on its “right” side, we can assume that the left side of the memory is filled with older objects than the right side, and in respect with the generational collection idea, the more “to the left” an object is the less likely for it to be dead. The garbage collection uses this assumption for performance optimization: starting at the left most region it looks for the first region with a density which is worthwhile for collection. All regions to the left of that region are not collected and are called the dense prefix.

In addition, during summary the collector can tell what amount of space is going to remain in each region after its compacting so it can already choose which regions are going to be filling other regions (typically, regions on the “right” are filling regions on the “left”). It can also note for each region which regions will be filling it, and for each filling region where it’s going to be located after sweeping, or where the first live byte is going to be located.

Now the sweeping step itself starts. Regions that can be collected without synchronization with other threads are being divided between collector threads. These are regions that do not fill other regions, so they‘re compacted into themselves, or regions that are already completely empty already and can just be claimed by the collector. After a thread finishes claiming a region, it immediately starts filling that region from the regions destined to fill it – this information was determined previously by the summary step. Since it knows no other thread is reading from that source region or writing to the destination region (as the thread is in charge of both) it can make the filling and the claiming without need of synchronization.

Note that in the following image the collection occurs over only 60% of the heap (making the collection twice as fast as if it would collect everything), and reclaiming 82% of the garbage. That means that in order to collect the additional 18%, the garbage collector would have to work just as hard as it did for the first 82%! Obviously, these are not precise calculations, but surely the point is made: this is the reason the dense prefix is an important optimization step.



 

Concurrent mark-sweep collector

While both prior collectors mentioned are parallel and take a lot less time to operate, they still stop the application threads while working. Some applications might have a different requirement though: they might require a more deterministic Java, and are willing to sacrifice some performance in order to get a shorter to non-existent waiting times caused by the garbage collector. For those applications the Java VM supplies the Concurrent Mark-Sweep garbage collector.

This collector uses four steps: first, the initial mark step stops the application for a very short period of time while a single collection thread marks the first level of objects directly connected to the root objects. After that, the application continues to work normally while the collector performs a concurrent mark, marking the rest of the objects while the application keeps running. Because the application might change references to the object graph by the time the concurrent marking is finished, the collector performs an additional step: the remarking, which stops the application for another brief period. When the application changes an object it Is marked as changed. During the remarking step the application checks only those changed objects, and to make things faster it distributes the load between multiple threads. Finally, the collector performs a concurrent sweep which does not compact the memory area in order to save time and operates concurrently with the application threads.

Since the memory area is not compacted, the memory allocator is different in two ways: first, it needs to keep multiple pointers to free memory blocks categorized by their size to allow allocation within the fragments of free memory. Second, it keeps count of the requested lengths to determine the popular object sizes within the application; then, it uses this information to split and merge the free block fragments to match future requirements by the application, assuming that the popular sizes will continue to be popular and often requested.

Final words

This covers everything I’ve managed to gather about the way garbage collectors work. Note that for some applications (typically small, client-side ones) the parallel collectors are not the best. The Java VM selects which collectors to use automatically when its loading according to the properties of the machine and operating system it‘s running on. You can read all about it on Sun’s Java VM ergonomics document.

  • 大小: 76.9 KB
  • 大小: 112.7 KB
分享到:
评论

相关推荐

    细述 Java垃圾回收机制→Types of Java Garbage Collectors 1

    3. CMS(Concurrent Mark Sweep)Garbage Collector:此垃圾回收器旨在减少应用程序的暂停时间,适合响应时间敏感的应用。CMS使用多线程并发地进行大部分垃圾回收工作,仅在特定阶段暂停应用程序。它在处理大量数据...

    openjdk8u60+jvm jdk源码+jvm源码

    2. **垃圾收集器(Garbage Collectors)**:OpenJDK 8 包含了几种不同的垃圾收集器,如 Serial GC、Parallel GC、CMS (Concurrent Mark Sweep) 和 G1 (Garbage-First)。每种GC策略都有其适用场景,通过调整这些参数...

    对象分配过程详解、常用垃圾回收器

    常见的垃圾回收器有Serial GC、Parallel GC(也称为Throughput GC)、CMS GC(Concurrent Mark Sweep GC)和G1 GC(Garbage-First GC)。Serial GC使用单线程进行垃圾回收,适用于单核处理器或者小内存的环境。...

    JDK10(JDK10底层C++源码及hotspot虚拟机源码)

    2. **垃圾收集器(Garbage Collectors)**:HotSpot支持多种GC策略,如Parallel GC、Concurrent Mark Sweep (CMS) 和G1。JDK10的G1 Full GC并行化是HotSpot的一个重要改进。 3. **内存模型(Memory Model)**:...

    HotSpot GC官网文档截图 - 20200917

    3. **CMS(Concurrent Mark Sweep)收集器** (8 Concurrent Mark Sweep (CMS) Collector .png) - CMS收集器是一种低暂停时间的垃圾收集策略,它在大部分时间与应用程序线程并发运行,只有在最后的整理阶段会短暂...

    JVM详解与学习

    - **跟踪收集器 (Tracing Garbage Collectors)**:基于可达性分析来确定哪些对象不再被引用,从而进行回收。 ##### 3.3 JVM的垃圾收集策略 Sun JVM支持多种垃圾收集策略,包括: - **串行收集器 (Serial Collector)...

Global site tag (gtag.js) - Google Analytics