Add a metric to capture write enqueue latency by jasonk000 · Pull Request #193 · Netflix/EVCache

jasonk000 · 2026-05-14T17:21:25Z

This captures from when the caller tries to send a request until the request is finally sent on the wire, primarily so we can see when the enqueue time is increasing (a signal for pressure on the IO loop).

akhaku

technically not sent on the wire but passed to the non-blocking socket, right?

Generally LGTM

akhaku · 2026-05-15T21:23:50Z

+            if (wc <= 0L || wc < operationAttachedNs) return;
+            loopEnqueueToWriteLatency.record(wc - operationAttachedNs, TimeUnit.NANOSECONDS);
+        } catch (Throwable t) {
+            if (log.isDebugEnabled()) log.debug("recordLoopEnqueueToWriteLatency failed", t);


we don't really need the isDebugEnabled check here right since the expression in log.debug is cheap

Yes. Updated the comments to make that more clear, and dropped that catch there since the onyl thing we have is a record() call. -> a6d34e2

This captures from when the caller tries to send a request until the request is finally sent on the wire, primarily so we can see when the enqueue time is increasing (a signal for pressure on the IO loop).

jasonk000 · 2026-06-26T22:18:34Z

I think this needs a little work. Even though per-op latency is OK, when we multiply it by 100x for a 100-element bulk call the latency can stack up to 10's of microseconds. Not a lot, but, if we can reorder the PR i think we can do a bit better.

(TODO note for myself - this code should, instead of writing directly to metrics, accumulate metrics in a TimerBatchUpdater). Likely this batching can be applied to other evcache batch query metrics too.

akhaku approved these changes May 15, 2026

View reviewed changes

jasonk000 marked this pull request as ready for review May 18, 2026 19:24

jasonk000 added 2 commits June 26, 2026 21:11

Add a metric to capture write enqueue latency

df88ad1

This captures from when the caller tries to send a request until the request is finally sent on the wire, primarily so we can see when the enqueue time is increasing (a signal for pressure on the IO loop).

fix: flakey test and improve comments

e6f6073

jasonk000 force-pushed the jkoch/loop-enqueue-to-write-latency-metric branch from a6d34e2 to e6f6073 Compare June 26, 2026 21:12

jasonk000 marked this pull request as draft June 29, 2026 16:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a metric to capture write enqueue latency#193

Add a metric to capture write enqueue latency#193
jasonk000 wants to merge 2 commits into
jkoch/loop-cpu-utilization-metricfrom
jkoch/loop-enqueue-to-write-latency-metric

jasonk000 commented May 14, 2026

Uh oh!

akhaku left a comment

Uh oh!

akhaku May 15, 2026

Uh oh!

jasonk000 May 18, 2026 •

edited

Loading

Uh oh!

jasonk000 commented Jun 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

jasonk000 commented May 14, 2026

Uh oh!

akhaku left a comment

Choose a reason for hiding this comment

Uh oh!

akhaku May 15, 2026

Choose a reason for hiding this comment

Uh oh!

jasonk000 May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jasonk000 commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jasonk000 May 18, 2026 •

edited

Loading

jasonk000 commented Jun 26, 2026 •

edited

Loading