memcache

Introduction

MemCache is a fast in-memory cache which provides a mapping of arbitrary keys to values.

Caches act as temporary holding areas for frequently used data, bridging the gap between main storage (e.g., hard drive, application database) and processing units (e.g., CPU) or clients (e.g., web browsers making requests to servers). They aim to speed up access to frequently needed information by storing it closer to the processing unit, where it can be accessed much faster. Caching occurs at multiple levels between client machines and servers to optimize resource delivery.

Author: Sudip Bose([email protected])

Features

Cache eviction algorithms and TTL optimize cache usage by removing inactive data, ensuring crucial information is readily available for faster application execution. These algorithms prioritize active data within limited cache space, leading to significant performance improvements in various software systems.

Strategy and Considerations

Cache eviction policies vary widely (see here). For this cache, we’re implementing LFRU (Least Frequently Recently Used), a hybrid of LFU (Least Frequently Used) and LRU (Least Recently Used).

Implementing LFRU in real cache systems is complex due to cache partitioning into privileged and unprivileged sections. Popular content moves to the privileged section. When full, LFRU evicts from the unprivileged section, shifts content between partitions, and inserts new content into the privileged section. LRU manages the privileged partition, while an approximated LFU (ALFU) handles the unprivileged section.

Our cache avoids multiple partitions. It’s designed to access the Least-Frequent and Least-Recent keys in O(1) time. If multiple Least-Frequent keys exist, we evict the Least-Recent one, ensuring efficiency.

Before 2010, LFU policies used data structures like heaps and hash tables, resulting in O(log n) operations. These weren't scalable for modern needs, like internet proxy servers handling thousands of requests per second. As internet users grew from 1.97 billion in 2010 to 6 billion in 2024, efficient scalability became crucial. The linked whitepaper introduces a strategy for O(1) LFU operations.

Cache Replacement Algorithm Overview

The constant O(1) runtime is achieved by utilizing a nested doubly linked list with a hash table to access elements by their keys.

The doubly linked list is used to link together nodes representing a set of nodes that have the same access frequency (squares). This set of nodes with the same access frequency forms a doubly linked list of such nodes (circles). Each circular node has a pointer to its parent frequency node, so that later on we can access frequencies directly for our computation. An additional hashmap is used to store elements by key which maps to its parent node, its value and some other metadata dictates by implementation choices. The LFU key would be the left-most square, and LRU key would be the bottom-most circle associated with the left-most square.

This approach offers efficient cache eviction with a constant time complexity, making it suitable for high-throughput systems like proxy-servers, CDNs etc.

Build Process

Generate build files

$ mkdir -p build && cd build
$ cmake ..

Compile the project

$ make

Running tests

Project compilation will generate test executables automatically.

$ ./runTests 
[==========] Running 1 test from 1 test suite.
[----------] Global test environment set-up.
[----------] 1 test from MemCacheTest
[ RUN      ] MemCacheTest.InsertionAndRetrieval
[       OK ] MemCacheTest.InsertionAndRetrieval (0 ms)
[----------] 1 test from MemCacheTest (0 ms total)

[----------] Global test environment tear-down
[==========] 1 test from 1 test suite ran. (0 ms total)
[  PASSED  ] 1 test.
$

To rerun the tests:

$ make clean && make 
$ ./runTests

See here for more on make commands.

Getting the Source

$ git clone --recurse-submodules https://github.com/s-bose7/memcache.git

Performance

Here is a performance report from the run of the included cache_benchmark program. The report gives a ballpark performance estimation.

Setup

We use a cache size of 10^5, with a million entries. Each entry has a 16 byte key, and a 100 byte value.

memcache:	version 1.1 (Latest)
Date:		Sun Jun 16 21:35:16 2024
CPU:		Intel(R) Core(TM) i3-6006U CPU @ 2.00GHz
RAM:		2 GB (Available)
Keys:		16 bytes each
Values:		100 bytes each
Entries:	1000000
Cache Size:	1000000
Raw Size:	~206.90 MB

Write Performance

writeseq: 	3.37372 micros/op; (approximately 296408 writes per second)
writerandom: 	
overwrite:   	2.23422 micros/op;

Read Performance

readseq: 	1.71238 micors/op; (approxmimately 583981 reads per seocnd)
readrandom:

Limitations

No persistancy to disk.
A single process (possibly multi-threaded) can access the cache at a time.
Callers cannot query data based on parameters i.e. time, max, min etc.
Not distributed, subsequently no horizontal scaling.
No client-server support, applications that needs such support will have to
wrap their own server around the library.

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
docs		docs
extern		extern
include		include
resources/images		resources/images
src		src
tests		tests
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
ROADMAP.md		ROADMAP.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

memcache

Introduction

Features

Strategy and Considerations

Cache Replacement Algorithm Overview

Build Process

Generate build files

Compile the project

Running tests

Getting the Source

Performance

Setup

Write Performance

Read Performance

Limitations

About

Releases 5

Packages

Languages

License

s-bose7/memcache

Folders and files

Latest commit

History

Repository files navigation

memcache

Introduction

Features

Strategy and Considerations

Cache Replacement Algorithm Overview

Build Process

Generate build files

Compile the project

Running tests

Getting the Source

Performance

Setup

Write Performance

Read Performance

Limitations

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages