ENH avoid np.square(X) in enet_coordinate_descent to save memory #31665

lorentzenchr · 2025-06-26T21:17:59Z

Reference Issues/PRs

None

What does this implement/fix? Explain your changes.

This PR replaces np.square(X).sum(axis=0) by np.einsum("ij,ij->j", X, X) to avoid memory allocation of the size of X(usually the largest object).

Any other comments?

This also improves timing a bit.

We might even consider to write the loop explicitly like in (

scikit-learn/sklearn/metrics/_pairwise_distances_reduction/_base.pyx.tp

Lines 20 to 26 in 20b8d0b

    
           cdef float64_t[::1] _sqeuclidean_row_norms64_dense( 
        
               const float64_t[:, ::1] X, 
        
               intp_t num_threads, 
        
           ): 
        
               """Compute the squared euclidean norm of the rows of X in parallel. 
        
               This is faster than using np.einsum("ij, ij->i") even when using a single thread.

).

github-actions · 2025-06-26T21:18:53Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 47d0663. Link to the linter CI: here}

thomasjpfan

Do you have a quick memory benchmark of einsum vs np.square(X).sum(axis=0)?

lorentzenchr · 2025-06-27T06:59:42Z

import numpy as np

rng = np.random.default_rng(42)
X = rng.standard_normal((100, 10_000))
print(f"X allocates {X.nbytes * 1e-6} MB of memory")
# X allocates 8.0 MB of memory

%timeit np.square(X).sum(axis=0)
# 1.09 ms ± 53.2 μs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

%timeit np.einsum("ij,ij->j", X, X)
# 631 μs ± 29.3 μs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

%load_ext memory_profiler
%memit np.square(X).sum(axis=0)
peak memory: 87.97 MiB, increment: 7.98 MiB

In [6]: %memit np.einsum("ij,ij->j", X, X)
peak memory: 88.34 MiB, increment: 0.03 MiB

Similar with transposed X.
This shows that np.square indeed doubles the memory (+8 MB).

OmarManzoor

LGTM. Thanks @lorentzenchr

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai> Co-authored-by: Virgil Chan <virchan.math@gmail.com>

ENH avoid np.square(X) in enet_coordinate_descent to save memory

3a2490a

github-actions bot added module:linear_model cython labels Jun 26, 2025

simpler solution

b1b621d

lorentzenchr added the Performance label Jun 26, 2025

lorentzenchr added 2 commits June 26, 2025 23:25

DOC add whatsnew

da401b1

FIX again

47d0663

thomasjpfan reviewed Jun 27, 2025

View reviewed changes

thomasjpfan approved these changes Jun 27, 2025

View reviewed changes

OmarManzoor approved these changes Jun 27, 2025

View reviewed changes

OmarManzoor merged commit 687e84a into scikit-learn:main Jun 27, 2025
36 checks passed

lorentzenchr deleted the cd_fast_avoid_square_X branch June 27, 2025 20:08

lorentzenchr added a commit to lorentzenchr/scikit-learn that referenced this pull request Jun 27, 2025

DOC fix typo and improve whatsnew of scikit-learn#31665

71a7e81

lorentzenchr mentioned this pull request Jun 27, 2025

DOC fix typo and improve whatsnew of #31665 #31669

Merged

jeremiedbb added a commit that referenced this pull request Jun 29, 2025

DOC fix typo and improve whatsnew of #31665 (#31669)

db21513

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai> Co-authored-by: Virgil Chan <virchan.math@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH avoid np.square(X) in enet_coordinate_descent to save memory #31665

ENH avoid np.square(X) in enet_coordinate_descent to save memory #31665

Uh oh!

lorentzenchr commented Jun 26, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 26, 2025 •

edited

Loading

Uh oh!

thomasjpfan left a comment

Uh oh!

lorentzenchr commented Jun 27, 2025 •

edited

Loading

Uh oh!

OmarManzoor left a comment

Uh oh!

Uh oh!

Uh oh!

	cdef float64_t[::1] _sqeuclidean_row_norms64_dense(
	const float64_t[:, ::1] X,
	intp_t num_threads,
	):
	"""Compute the squared euclidean norm of the rows of X in parallel.

	This is faster than using np.einsum("ij, ij->i") even when using a single thread.

Uh oh!

ENH avoid np.square(X) in enet_coordinate_descent to save memory #31665

ENH avoid np.square(X) in enet_coordinate_descent to save memory #31665

Uh oh!

Conversation

lorentzenchr commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

lorentzenchr commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Jun 26, 2025 •

edited

Loading

github-actions bot commented Jun 26, 2025 •

edited

Loading

lorentzenchr commented Jun 27, 2025 •

edited

Loading