chore: add array operators to SQLGlot compiler #1852

sycai · 2025-06-25T02:25:34Z

In addition, I polished the OpRegistration class such that full type hints are supported for each compilation function. This is achieved by relaxing the constraints of function signatures.

chelsea-lin · 2025-06-25T17:49:45Z

bigframes/core/compile/sqlglot/expressions/unary_compiler.py

+
+@UNARY_OP_REGISTRATION.register(ops.ArrayIndexOp)
+def _(op: ops.ArrayIndexOp, expr: TypedExpr) -> sge.Expression:
+    offset = sge.Anonymous(


The compile_explode also need SAFE_OFFSET but present as different way. Maybe we should define them in the same way:

python-bigquery-dataframes/bigframes/core/compile/sqlglot/sqlglot_ir.py

Lines 367 to 371 in bc885bd

sge.Bracket(

this=column,

expressions=[unnested_offset_alias],

safe=True,

offset=False,

Can we use sge.func to call a specific function instead of a generic one currently referred to as sge.Anonymous?

safe_offset prefer upper case for any SQL preserved keywords.

Good call. Done!

chelsea-lin · 2025-06-25T17:59:30Z

bigframes/core/compile/sqlglot/expressions/unary_compiler.py

+
+@UNARY_OP_REGISTRATION.register(ops.ArraySliceOp)
+def _(op: ops.ArraySliceOp, expr: TypedExpr) -> sge.Expression:
+    slice_idx = sqlglot.to_identifier("slice_idx")


the sql ops use next(self.uid_gen.get_uid_stream("bfcol_")) to generate any new columns but this uid_gen instance is not passed to the scalar op compiler yet.

That feels like an overkill. I don't think we need to worry about name collisions in this subquery of unnesting an array with offsets , because none of these fields inherently has a name. Plus, the "slice_idx" and "el" are more meaningful names.

As for injecting uid_gen to the scalar compiler in the general sense, let's only do that when it's necessary.

you're right. Both slice_idx and el are both local naming and won't be joined with other column names yet, according to the generated SQL below. Yes, I am okay to keep as current so far.

ARRAY( SELECT el FROM UNNEST(`bfcol_1`) AS el WITH OFFSET AS slice_idx WHERE slice_idx >= 1 AND slice_idx < 5 ) AS `bfcol_4`

sycai and others added 6 commits June 23, 2025 22:38

[WIP] Add array operators. Need to finish tests

4dacef0

Merge branch 'main' into sycai_scalar_compiler

55cf35f

Merge branch 'main' into sycai_scalar_compiler

41dcdd6

add tests

3c6d8a9

fix lint

63fedea

Merge branch 'main' into sycai_scalar_compiler

91e35b4

product-auto-label bot added size: l Pull request size is large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Jun 25, 2025

fix typos

419e906

sycai added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jun 25, 2025

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jun 25, 2025

sycai requested a review from chelsea-lin June 25, 2025 17:36

sycai marked this pull request as ready for review June 25, 2025 17:36

sycai requested review from a team as code owners June 25, 2025 17:36

blunderbuss-gcf bot assigned shobsi Jun 25, 2025

chelsea-lin reviewed Jun 25, 2025

View reviewed changes

Use sge.Bracket() for safe_offset

4041e63

sycai requested a review from chelsea-lin June 25, 2025 18:38

chelsea-lin approved these changes Jun 25, 2025

View reviewed changes

sycai enabled auto-merge (squash) June 25, 2025 18:51

sycai merged commit c88a825 into main Jun 25, 2025
18 of 25 checks passed

sycai deleted the sycai_scalar_compiler branch June 25, 2025 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: add array operators to SQLGlot compiler #1852

chore: add array operators to SQLGlot compiler #1852

sycai commented Jun 25, 2025

Uh oh!

chelsea-lin Jun 25, 2025

Uh oh!

chelsea-lin Jun 25, 2025

Uh oh!

chelsea-lin Jun 25, 2025

Uh oh!

sycai Jun 25, 2025

Uh oh!

chelsea-lin Jun 25, 2025

Uh oh!

sycai Jun 25, 2025

Uh oh!

chelsea-lin Jun 25, 2025

Uh oh!

Uh oh!

Uh oh!

	sge.Bracket(
	this=column,
	expressions=[unnested_offset_alias],
	safe=True,
	offset=False,

chore: add array operators to SQLGlot compiler #1852

chore: add array operators to SQLGlot compiler #1852

Conversation

sycai commented Jun 25, 2025

Uh oh!

chelsea-lin Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

chelsea-lin Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

chelsea-lin Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

sycai Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

chelsea-lin Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

sycai Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

chelsea-lin Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!