Performance Tuning

Introduction

Ehcache is fast. Really fast. It was thought and made to be. For example, a straight get(key) from the cache should be under 500ns. That’s fast and way sufficient for most purpose.

Keep that in mind. Especially when comparing caching frameworks with a benchmark. If something looks 10% faster it actually means 50ns faster. Do you care that much about these 50ns to lose your precious time in benchmarking?

That said, the way you configure Ehcache can have an impact on performance. This document is a work in progress. It will give you the performance impact of classical Ehcache configuration. It will also give you some advance tuning possibilities.

In the future, we will add some figures of what "slower" means. However, always do your own benchmark.

Stores

We probably know that the fastest store is on-heap. Until you overwhelm the garbage collector.

Your next best bet is off-heap.

Try to avoid disk. Use a remote drive or even an HDD at your own risk.

We won’t talk about clustering here because it’s a different realm and its performance is based on many factors.

The next question would be: "Should I use a single tier?" Is using a single-tier off-heap faster than two-tiers? The answer depends on what you do with it. Having two tiers is a bit slower on writing. It is also a bit slower on reading when the data is not found in the caching tier (on-heap). However, it will be faster for an entry that is indeed found in a higher tier.

So again, it depends. The more you follow the caching hypothesis that the same data is always reused (and so in the caching tier), the more interesting having two or more tiers will be. As a general rule of thumb, a tier that has a hit ratio of 50% or less is going to slow down the cache. Either enlarge such tier or completely get rid of it.

Byte Sizing

A on-heap tier can be limited to a number of entries or a number of bytes. When using bytes, we need to calculate the size of every object added to the cache. This is of course much slower than calculating the number of entries.

Size calculation is done using the SizeOf library. This library uses multiple magic tricks to do so. It selects the fastest one for a given environment. Make sure of what is used to confirm you can’t use a faster way on your platform.

Serialization

Off-heap, disk and clustering need to serialize keys and values before storing them. By default, custom super-fast serializers are used for Long, Integer, Character, Double, Float, byte[] and String objects. A slightly faster, customized Java serialization is used for other object types. It is well-known for not being the fastest thing around. Ehcache uses it because it is supported out of the box. However, you can increase performance by providing your own serializers.

Copier

By default, on-heap storage stores the entries by reference. You might want to use a copier to store entries by value for whatever reason. Out of the box, Ehcache bundles a copier that makes use of its serialization mechanism: org.ehcache.impl.copy.SerializingCopier. This can be much slower so watch out.

Loader-Writer

Loader-writer is interesting for many reasons. First, it protects you against the Thundering Herd. However, it needs to pass through more complicated code to do so.

We are expecting it to be a tiny bit slower. But nothing noticeable enough to prevent you from using it.

Expiration

A cache with no expiration will always be faster.

Time to Live

If you need to set an expiration time, TTL will be the faster one. This is because the expiration time of an entry is calculated and updated ony when the entry is inserted or updated in the cache. But it still requires an expiration check at access time.

So you can expect a 2% drop in performance when using TTL.

Time to Idle

TTI is slower than TTL. We need to recalculate and update the expiration time each time the entry is accessed.

Custom

In general, using a custom ExpiryPolicy will be the slowest. Ehcache had optimised the handling to handle other cases. When using a custom policy, you are on your own.

Allocation rate

Ehcache won’t allocate any object during a simple on-heap get(). However, keep in mind that your configuration might do so.

For instance, let’s say you define an expiry policy like this.

new ExpiryPolicy<Object, Object>() {
  @Override
  public Duration getExpiryForCreation(Object key, Object value) {
    return null;
  }

  @Override
  public Duration getExpiryForAccess(Object key, Supplier<?> value) {
    return Duration.ofSeconds(10); (1)
  }

  @Override
  public Duration getExpiryForUpdate(Object key, Supplier<?> oldValue, Object newValue) {
    return null;
  }
};

1	Will instantiate a `Duration` every time an entry is accessed

In this case, putting the Duration as a constant would solve the problem.

Time Source

By default, Ehcache uses a TimeSource that will retrieve the system time at every call. It is fast but not super duper fast. But it is super duper accurate.

You can trade accuracy for speed by using a TickingTimeSource. Please read the javadoc for details but the concept is that a timer will increase time instead of always retrieving system time.

Switching to TickingTimeSource, even with a granularity of 1ms, can improve the performance of a get as high as 30%.

The drawback is that a timer will continuously run. Also, time might drift from the real time a bit. Especially if the granularity of the systemUpdatePeriod is big. Is your expiration needs to be really tightly linked with real time, it can be a problem. But in most cases, the drifting doesn’t matter much.