Adding equivalent to numpy.nan_to_num functionality #540

aspeers · 2023-12-08T21:56:16Z

Adding equivalent to numpy.nan_to_num functionality
(https://numpy.org/doc/stable/reference/generated/numpy.nan_to_num.html)
addressing issue 479 (#479).

Motivation for these changes

Requested in issue #479

Implementation details

Added nan_to_num in pytensor/tensor/math.py

Checklist

Explain motivation and implementation 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues, preferably in nice commit messages.
The commits correspond to relevant logical changes. Note that if they don't, we will rewrite/rebase/squash the git history before merging.
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

Major / Breaking Changes

...

New features

added nan_to_num

Bugfixes

...

Documentation

Example:

import pytensor
from pytensor import tensor as pt

import numpy as np

# Replace NaN
print("Replace NaN")
x = pt.dvector("x")
y = pt.log(x)

f = pytensor.function([x], y)

t = np.random.normal(size=(10,))
print("x")
print(t)
print("f = log(x)")
print(f (t))

# Test: NaN default replacement
print("\nTest: NaN default replacement")
y1 = pt.math.nan_to_num(y)
f = pytensor.function([x], y1)
print(f (t))

# Test: NaN custom replacement
print("\nTest: NaN custom replacement")
y2 = pt.math.nan_to_num(y, nan=1)
f = pytensor.function([x], y2)
print(f (t))

# Replace +INF/-INF
print("\n\nReplace +INF/-INF")
a = pt.dvector("a")
b = pt.dvector("b")
c = pt.math.true_div(a,b)

t_a = np.random.normal(size=(10,))
t_b = np.zeros((10,))

f = pytensor.function([a, b], c)
print("a")
print(t_a)
print("b")
print(t_b)
print("f = a/b")
print(f(t_a, t_b))

# Test: +INF/-INF default replacement
c1 = pt.math.nan_to_num(c)
f = pytensor.function([a, b], c1)
print("\nTest: +INF/-INF default replacement")
print(f(t_a, t_b))

# Test: +INF/-INF custom replacement
c2 = pt.math.nan_to_num(c, posinf=5, neginf=-5)
f = pytensor.function([a, b], c2)
print("\nTest: +INF/-INF custom replacement")
print(f(t_a, t_b))

Maintenance

...

(https://numpy.org/doc/stable/reference/generated/numpy.nan_to_num.html) addressing issue 479 (pymc-devs#479).

aspeers · 2023-12-08T22:04:12Z

First PR from yesterday's code sprint. Main reasons for marking as in progress included:

Had some difficulty running pytest locally. Resulted in multiple failures (most involving missing files/config parameters) which I figured the community could help by providing the relevant pytest command to run.
Could use advice on adding unit tests for the addition.
Was suggested to use ifelse() by @jessegrabowski instead of traditional if statements. ifelse is currently not imported/used in the math.py file and adding it to the imports list was causing a circular reference. Any thoughts would be welcomed!

ricardoV94

Thanks for opening the PR. Looks great!

I left some small technical suggestions. We will also need some tests

ricardoV94 · 2023-12-09T12:11:02Z

pytensor/tensor/math.py

+    x = switch(bitwise_and(isinf(x), pos), maxf, x)
+    x = switch(bitwise_and(isinf(x), ~pos), minf, x)


Let's compare directly with posinf and neginf, if we don't have a isposinf and isneginf it's a good time to add it.

Suggested change

x = switch(bitwise_and(isinf(x), pos), maxf, x)

x = switch(bitwise_and(isinf(x), ~pos), minf, x)

x = switch(bitwise_and(isinf(x), pos), maxf, x)

x = switch(bitwise_and(isinf(x), ~pos), minf, x)

ricardoV94 · 2023-12-09T12:11:30Z

pytensor/tensor/math.py

+    # Get max and min values representable by x.dtype
+    maxf = np.finfo(x.real.dtype).max
+    minf = np.finfo(x.real.dtype).min


Let's define it only when posinf and neginf are None

ricardoV94 · 2023-12-09T12:11:46Z

pytensor/tensor/math.py

@@ -2937,7 +2937,58 @@ def matmul(x1: "ArrayLike", x2: "ArrayLike", dtype: Optional["DTypeLike"] = None
    return out


+def nan_to_num(x, nan=0.0, posinf=None, neginf=None):
+    """
+    Replace NaN values with the `nan` keyword, +INF with the `posinf`


Let's just copy Numpy docstring. No point in reinventing it

ricardoV94 · 2023-12-09T12:13:42Z

pytensor/tensor/math.py

+    """
+
+    # Replace NaN's with nan keyword
+    x = switch(isnan(x), nan, x)


Instead of overriding x, let's write all the checks isnan`isposinf\isneginf` on the original array.

ricardoV94 · 2023-12-09T12:15:25Z

ifelse is not appropriate because it requires a scalar condition, and here we want to work with arbitrary tensors

Dhruvanshu-Joshi · 2024-05-30T04:30:25Z

Hi @aspeers , are you still working on this?

Adding equivalent to numpy.nan_to_num functionality

f5997cd

(https://numpy.org/doc/stable/reference/generated/numpy.nan_to_num.html) addressing issue 479 (pymc-devs#479).

ricardoV94 reviewed Dec 9, 2023

View reviewed changes

ricardoV94 added enhancement New feature or request NumPy compatibility labels Dec 9, 2023

Dhruvanshu-Joshi mentioned this pull request Jun 1, 2024

Add nan_to_num helper #796

Merged

11 tasks

ricardoV94 closed this in #796 Jul 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding equivalent to numpy.nan_to_num functionality #540

Adding equivalent to numpy.nan_to_num functionality #540

Uh oh!

aspeers commented Dec 8, 2023

Uh oh!

aspeers commented Dec 8, 2023

Uh oh!

ricardoV94 left a comment

Uh oh!

ricardoV94 Dec 9, 2023

Uh oh!

ricardoV94 Dec 9, 2023

Uh oh!

ricardoV94 Dec 9, 2023

Uh oh!

ricardoV94 Dec 9, 2023

Uh oh!

ricardoV94 commented Dec 9, 2023

Uh oh!

Dhruvanshu-Joshi commented May 30, 2024

Uh oh!

Uh oh!

		x = switch(bitwise_and(isinf(x), pos), maxf, x)
		x = switch(bitwise_and(isinf(x), ~pos), minf, x)

Adding equivalent to numpy.nan_to_num functionality #540

Adding equivalent to numpy.nan_to_num functionality #540

Uh oh!

Conversation

aspeers commented Dec 8, 2023

Motivation for these changes

Implementation details

Checklist

Major / Breaking Changes

New features

Bugfixes

Documentation

Maintenance

Uh oh!

aspeers commented Dec 8, 2023

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 9, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 9, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 9, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 9, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Dec 9, 2023

Uh oh!

Dhruvanshu-Joshi commented May 30, 2024

Uh oh!

Uh oh!