perf(make_idempotent): introduce row lock to improve 2PC for idempotent atomic writes by empiredan · Pull Request #2214 · apache/incubator-pegasus

empiredan · 2025-03-19T04:30:07Z

Previously in #2198, we implemented
idempotence for each atomic write by blocking the entire mutation queue until the 2PC
pipeline was drained. However, this significantly affects performance since the pipeline
is stalled and all write requests get stuck in the mutation queue.

To address this performance issue, we introduce a row-lock mechanism: each hash key
and the highest decree currently in the 2PC phase are recorded in a hash table. For each
atomic write request, if the maximum decree associated with its hash key has not yet been
applied to the storage engine, the request is blocked in the mutation queue. Otherwise,
the hash key is considered unlocked and the request can proceed into the 2PC phase at
any time.

To avoid the performance overhead of deserialization, we directly use the partition_hash
(an unsigned 64-bit integer) from the client instead of the hash key. This also makes memory
usage more predictable, as the partition_hash has a fixed size.

Additionally, to mitigate the performance impact caused by frequent insertions and deletions
in the row-lock hash table, we introduce an LRU strategy: keys are only evicted when the hash
table exceeds a certain size threshold, and only the least recently used keys with no active
usage are removed.

Give a concrete example to illustrate how atomic write requests are handled after introducing
row locks. Suppose a client issues an incr request to a primary replica. If the primary replica
has been configured to make all atomic write requests idempotent, then:

A mutation will be created as a blocking candidate to hold this atomic write request and then
appended to the mutation queue.
This mutation will be blocked and cannot get popped once the hash key contained in it is
locked(i.e. the maximum decree associated with the hash key has not been applied to the
storage engine).
This mutation can get popped only after its hash key becomes unlocked.
Popped from the mutation queue, the current base value 100 is read from the storage
engine, and create a single put request to store the final value 101.
Another mutation is then created to hold this idempotent single put request.
Subsequently the new mutation enters 2PC phase, appended to plog and broadcast to
secondary replicas.

…e performance

empiredan added 3 commits March 18, 2025 23:20

perf(make_idempotent): introduce row lock for each hash key to improv…

134097b

…e performance

format and fix compilation

e1b8f47

refactor

c7d10c8

github-actions Bot added cpp build labels Mar 19, 2025

empiredan mentioned this pull request Mar 19, 2025

Support making atomic write requests idempotent #2197

Open

15 tasks

fix clang-tidy and IWYU

80c8f92

github-actions Bot added the scripts label Mar 19, 2025

empiredan added 4 commits March 19, 2025 17:40

fix

43c5c10

fix clang-tidy and IWYU

bb7046f

fix tests

b08a894

fix core dump for absl::flat_hash_map ASan

d1dcf6b

github-actions Bot added the thirdparty label Mar 20, 2025

empiredan added 2 commits March 20, 2025 12:39

add absl_sanitize_address.h

d6a4af7

debug for asan

39a2a31

github-actions Bot added the github label Mar 20, 2025

empiredan added 2 commits March 20, 2025 15:27

remove absl_sanitize_address.h

08227df

turn to boost::unordered_map

65bc30a

github-actions Bot removed the github label Mar 20, 2025

remove absl::flat_hash_map from cmake and format thirdparty cmake

b0d8948

github-actions Bot removed the thirdparty label Mar 20, 2025

empiredan added 5 commits March 21, 2025 15:20

introduce boost::unordered_flat_map

56744f2

fix IWYU and add comments for mutation.h

bf78481

add comments

5ee5595

add comments

34419cc

fix dorny/paths-filter

6b0e49b

github-actions Bot added the github label Apr 2, 2025

empiredan added 3 commits April 2, 2025 20:11

upgrade dorny/paths-filter to 3.0.2

f95803a

use last decree and lru

11ee395

refactor and add comments for lru

ee588ea

empiredan added 3 commits April 8, 2025 19:01

fix IWYU and refactor

bb7a732

fix clang-tidy

09731b9

fix IWYU

e40e89e

empiredan marked this pull request as ready for review April 9, 2025 03:06

acelyc111 approved these changes Apr 13, 2025

View reviewed changes

foreverneverer approved these changes Apr 14, 2025

View reviewed changes

empiredan merged commit cebbfae into apache:master Apr 14, 2025
82 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(make_idempotent): introduce row lock to improve 2PC for idempotent atomic writes#2214

perf(make_idempotent): introduce row lock to improve 2PC for idempotent atomic writes#2214
empiredan merged 24 commits intoapache:masterfrom
empiredan:idempotent-rowlock

empiredan commented Mar 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

empiredan commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

empiredan commented Mar 19, 2025 •

edited

Loading