chore: add benchmarks for read_gbq_colab #1860

tswast · 2025-06-26T18:13:53Z

Follow-up to #1846

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Towards internal issue b/420984164 🦕

tswast · 2025-06-26T18:15:06Z

Marking as do not merge until the percentile_99 table finishes writing, but I think it's ready to review at least. Ran locally.

…420984164-bench-methods

TrevorBergeron · 2025-06-26T23:02:26Z

tests/benchmark/read_gbq_colab/aggregate_output.py

+    group_column = "col_int64_1"
+    if group_column not in df.columns:
+        group_column = "col_bool_0"


not sure I follow what is going on here?

We need some column to group by and some tables with tiny rows can only fit a boolean. I can add a comment.

TrevorBergeron · 2025-06-26T23:19:53Z

tests/benchmark/read_gbq_colab/filter_output.py

+
+    # Simulate the user filtering by a column and visualizing those results
+    df_filtered = df[df["col_bool_0"]]
+    df_filtered.shape


These .shape calls are going to be pretty brutal, going to double-execute

chore: add benchmarks for read_gbq_colab

77c9061

tswast requested review from a team as code owners June 26, 2025 18:13

tswast requested a review from jialuoo June 26, 2025 18:13

blunderbuss-gcf bot assigned jiaxunwu Jun 26, 2025

product-auto-label bot added size: l Pull request size is large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Jun 26, 2025

tswast added the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label Jun 26, 2025

Merge branch 'main' into b420984164-bench-methods

5c21699

correct project id

b14171c

tswast requested review from TrevorBergeron and removed request for jialuoo June 26, 2025 18:16

tswast assigned TrevorBergeron and unassigned jiaxunwu Jun 26, 2025

tswast added 3 commits June 26, 2025 13:18

Merge remote-tracking branch 'origin/b420984164-bench-methods' into b…

526356a

…420984164-bench-methods

exclude error too

615a76a

Delete tests/benchmark/read_gbq_colab/first_page.py_percentile_99.error

1c963c3

TrevorBergeron previously approved these changes Jun 26, 2025

View reviewed changes

tswast removed the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label Jun 27, 2025

explain column selection for groupby

8a36a32

tswast dismissed TrevorBergeron’s stale review via 8a36a32 June 27, 2025 20:14

tswast enabled auto-merge (squash) June 27, 2025 20:41

TrevorBergeron approved these changes Jun 27, 2025

View reviewed changes

tswast merged commit ed75cd9 into main Jun 27, 2025
24 of 25 checks passed

tswast deleted the b420984164-bench-methods branch June 27, 2025 21:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: add benchmarks for read_gbq_colab #1860

chore: add benchmarks for read_gbq_colab #1860

Uh oh!

tswast commented Jun 26, 2025 •

edited

Loading

Uh oh!

tswast commented Jun 26, 2025

Uh oh!

TrevorBergeron Jun 26, 2025

Uh oh!

tswast Jun 27, 2025

Uh oh!

TrevorBergeron Jun 26, 2025

Uh oh!

Uh oh!

Uh oh!

chore: add benchmarks for read_gbq_colab #1860

chore: add benchmarks for read_gbq_colab #1860

Uh oh!

Conversation

tswast commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tswast commented Jun 26, 2025

Uh oh!

TrevorBergeron Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

tswast Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

TrevorBergeron Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tswast commented Jun 26, 2025 •

edited

Loading