fix: accumulate agentstats until reported and fix insights DAU offset #15832

mafredri · 2024-12-11T15:32:02Z

This PR addresses a flake in TestDeploymentInsights caused by missing agent network stats. It also fixes the assumption that we should discard and not accumulate agent network stats if we can't keep up. Without accumulation we risk losing data.

Fixes coder/internal#259

Fixes #15824

mafredri · 2024-12-11T15:33:36Z

coderd/insights.go

@@ -89,7 +89,7 @@ func (api *API) returnDAUsInternal(rw http.ResponseWriter, r *http.Request, temp
 	}
 	for _, row := range rows {
 		resp.Entries = append(resp.Entries, codersdk.DAUEntry{
-			Date:   row.StartTime.Format(time.DateOnly),
+			Date:   row.StartTime.In(loc).Format(time.DateOnly),


Review: Drive-by fix, the date was off-by-one depending on timezone.

mafredri · 2024-12-11T15:35:18Z

agent/stats.go

+	} else {
+		s.networkStats = maps.Clone(virtual)
+		s.unreported = true
+	}


Review: If the callback was called multiple times before reporting, we lost data as each update is a snapshot since the last.

This can happen if:

The interval is short (tests)

Report takes a long time

I believe the assumption is that the "ConnStatsCallback" reports a realistic count for "now", however, what it actually returns is closer to an additive diff between this and the previous report. Thus, if two callbacks happen in quick succession we're effectively zeroing the actual data.

Great catch!

coderd/insights_test.go

mafredri · 2024-12-11T15:36:44Z

coderd/insights_test.go

@@ -76,7 +86,7 @@ func TestDeploymentInsights(t *testing.T) {
 	workspace := coderdtest.CreateWorkspace(t, client, template.ID)
 	coderdtest.AwaitWorkspaceBuildJobCompleted(t, client, workspace.LatestBuild.ID)

-	ctx := testutil.Context(t, testutil.WaitLong)
+	ctx := testutil.Context(t, testutil.WaitSuperLong)


Review: In race mode, propagating the agent connection stats can take a while.

dannykopping

LGTM

dannykopping · 2024-12-11T16:46:28Z

agent/stats.go

+	// Accumulate stats until they've been reported.
+	if s.unreported {
+		if s.networkStats == nil && virtual != nil {
+			s.networkStats = make(map[netlogtype.Connection]netlogtype.Counts)


Nit: let's save some allocations.

Suggested change

s.networkStats = make(map[netlogtype.Connection]netlogtype.Counts)

s.networkStats = make(map[netlogtype.Connection]netlogtype.Counts, len(virtual))

I've never actually benchmarked how much a difference a size hint gives for maps, especially ones that don't have a lot of data. Is there a significant difference?

Your suggestion made me realize this had a better fix 😄.

dannykopping · 2024-12-11T16:47:20Z

agent/stats.go

+	} else {
+		s.networkStats = maps.Clone(virtual)
+		s.unreported = true
+	}


Great catch!

coderd/insights_test.go

Co-authored-by: Danny Kopping <danny@coder.com>

dannykopping

LGTM

github-actions bot assigned mafredri Dec 11, 2024

fix: accumulate agentstats until reported and fix insights DAU offset

e97f3a9

Fixes #15824

mafredri force-pushed the mafredri-fix-agentstats-acc-and-dau-flake branch from a1757f0 to e97f3a9 Compare December 11, 2024 15:32

mafredri commented Dec 11, 2024

View reviewed changes

fix test

bd4ae11

mafredri changed the title ~~fix: accumulate agentstats until reported and fix insights DAU offset~~ fix: fix insights DAU offset by accumulating agentstats until reported Dec 11, 2024

mafredri changed the title ~~fix: fix insights DAU offset by accumulating agentstats until reported~~ fix: accumulate agentstats until reported and fix insights DAU offset Dec 11, 2024

mafredri marked this pull request as ready for review December 11, 2024 16:06

mafredri requested review from dannykopping and spikecurtis December 11, 2024 16:06

fix comment

53f3275

dannykopping approved these changes Dec 11, 2024

View reviewed changes

mafredri and others added 5 commits December 11, 2024 19:07

add comment from PR review

57fde09

refactor stats accumulation

7f7df65

Update coderd/insights_test.go

ff236f0

Co-authored-by: Danny Kopping <danny@coder.com>

fix format

08dea24

fix failf

3fc95da

dannykopping approved these changes Dec 12, 2024

View reviewed changes

Merge branch 'main' into mafredri-fix-agentstats-acc-and-dau-flake

02bafec

ethanndickson mentioned this pull request Dec 18, 2024

flake: TestDeploymentInsights coder/internal#237

Closed

mafredri merged commit 4c5b737 into main Dec 18, 2024
33 checks passed

mafredri deleted the mafredri-fix-agentstats-acc-and-dau-flake branch December 18, 2024 09:26

github-actions bot locked and limited conversation to collaborators Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: accumulate agentstats until reported and fix insights DAU offset #15832

fix: accumulate agentstats until reported and fix insights DAU offset #15832

Uh oh!

mafredri commented Dec 11, 2024 •

edited

Loading

Uh oh!

mafredri Dec 11, 2024

Uh oh!

mafredri Dec 11, 2024 •

edited

Loading

Uh oh!

dannykopping Dec 11, 2024

Uh oh!

Uh oh!

mafredri Dec 11, 2024 •

edited

Loading

Uh oh!

dannykopping left a comment

Uh oh!

dannykopping Dec 11, 2024

Uh oh!

mafredri Dec 11, 2024

Uh oh!

dannykopping Dec 11, 2024

Uh oh!

Uh oh!

Uh oh!

dannykopping left a comment

Uh oh!

Uh oh!

Uh oh!

	s.networkStats = make(map[netlogtype.Connection]netlogtype.Counts)
	s.networkStats = make(map[netlogtype.Connection]netlogtype.Counts, len(virtual))

fix: accumulate agentstats until reported and fix insights DAU offset #15832

fix: accumulate agentstats until reported and fix insights DAU offset #15832

Uh oh!

Conversation

mafredri commented Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mafredri Dec 11, 2024

Choose a reason for hiding this comment

Uh oh!

mafredri Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dannykopping Dec 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mafredri Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dannykopping left a comment

Choose a reason for hiding this comment

Uh oh!

dannykopping Dec 11, 2024

Choose a reason for hiding this comment

Uh oh!

mafredri Dec 11, 2024

Choose a reason for hiding this comment

Uh oh!

dannykopping Dec 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dannykopping left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mafredri commented Dec 11, 2024 •

edited

Loading

mafredri Dec 11, 2024 •

edited

Loading

mafredri Dec 11, 2024 •

edited

Loading