Add digitize op for Discretization layer #641

abuelnasr0 · 2023-07-30T19:38:35Z

This op will help to implement Discretization layer.
The output of the op is unified to be like the tensorflow output. and there is somethings to consider:
1- jax and numpy can digitize if the bins are monotonically decreasing but torch and tensorflow can't. torch will return undefined output. And tensorflow will return an error.
2- jax, numpy, and torch has right arg but tensorflow hasn't. so I didn't add it

fchollet

Thank you for the PR! 👍

keras_core/ops/numpy.py

fchollet · 2023-07-31T17:28:05Z

keras_core/ops/numpy.py

+        return backend.numpy.digitize(x, bins)
+
+    def compute_output_spec(self, x, bins):
+        return KerasTensor(x.shape, dtype=x.dtype)


Surely the dtype should be int? What is it?

x.dtype can be float or int, so I must return KerasTensor(x.shape, dtype="int32")?
actually I wanted to make the dtype int, but I saw that there is other ops where the input can be float and the dtype is set to the input dtype. so I thought the dtype of the shape must be the same as the input.

It's not an open question where you can make a choice. The dtype you pass here should match the dtype that gets actually returned when you run the op. What is that dtype?

Ok I get it now. it's int64 for numpy.

It must be the same dtype in all backends (if it isn't, that's a bug and we need to cast)

I have used standardize_dtype to test the return dtypes but it changes the dtype of numpy from int64 to int32 because of this line

keras-core/keras_core/backend/common/variables.py

Line 391 in 6b4d054

if dtype == "int":

where:

np.dtype("int64") == "int" ##returns True

so it executes what in the condition and return int32 instead of int64 which causes the fail of the test. I can open a pull request to fix this, if you like. I will just move the if statement below the mentioned line to the top of it.

also jax doesn't enable int64 until jax_enable_x64 is set to True using jax.config.update("jax_enable_x64", True). I think If we want to unify the dtypes we should enable x64, when the backend is set to jax.

Got it. In this case just pass dtype="int" in compute_output_spec, and in the unit tests check x.dtype to match backend.standardize_dtype("int").

I did that. but I had to cast pytorch output to int32 for the test to work.
I think fixing standrize_dtype() and enabling int64 by default for jax will be a better solution. can you give that a look?

JAX doesn't make that possible.

keras_core/ops/numpy_test.py

fchollet

Thanks for the contribution!

abuelnasr0 added 2 commits July 30, 2023 22:30

Add digitize op for digitization layer

e1d183f

Fix typo in the example

d73a510

abuelnasr0 changed the title ~~Add digitize op for digitization layer~~ Add digitize op for Discretization layer Jul 31, 2023

fchollet reviewed Jul 31, 2023

View reviewed changes

abuelnasr0 added 2 commits August 1, 2023 21:36

Add dtype check

a2a717c

Fix format

9d140dc

fchollet reviewed Aug 1, 2023

View reviewed changes

fchollet merged commit 710cfdb into keras-team:main Aug 1, 2023
6 checks passed

abuelnasr0 deleted the Digitize-op branch August 3, 2023 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add digitize op for Discretization layer #641

Add digitize op for Discretization layer #641

abuelnasr0 commented Jul 30, 2023 •

edited

Loading

fchollet left a comment

fchollet Jul 31, 2023

abuelnasr0 Jul 31, 2023 •

edited

Loading

fchollet Jul 31, 2023 •

edited

Loading

abuelnasr0 Jul 31, 2023

fchollet Jul 31, 2023

abuelnasr0 Aug 1, 2023 •

edited

Loading

abuelnasr0 Aug 1, 2023 •

edited

Loading

fchollet Aug 1, 2023

abuelnasr0 Aug 1, 2023 •

edited

Loading

fchollet Aug 1, 2023

fchollet left a comment

Add digitize op for Discretization layer #641

Add digitize op for Discretization layer #641

Conversation

abuelnasr0 commented Jul 30, 2023 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

fchollet Jul 31, 2023

Choose a reason for hiding this comment

abuelnasr0 Jul 31, 2023 • edited Loading

Choose a reason for hiding this comment

fchollet Jul 31, 2023 • edited Loading

Choose a reason for hiding this comment

abuelnasr0 Jul 31, 2023

Choose a reason for hiding this comment

fchollet Jul 31, 2023

Choose a reason for hiding this comment

abuelnasr0 Aug 1, 2023 • edited Loading

Choose a reason for hiding this comment

abuelnasr0 Aug 1, 2023 • edited Loading

Choose a reason for hiding this comment

fchollet Aug 1, 2023

Choose a reason for hiding this comment

abuelnasr0 Aug 1, 2023 • edited Loading

Choose a reason for hiding this comment

fchollet Aug 1, 2023

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

abuelnasr0 commented Jul 30, 2023 •

edited

Loading

abuelnasr0 Jul 31, 2023 •

edited

Loading

fchollet Jul 31, 2023 •

edited

Loading

abuelnasr0 Aug 1, 2023 •

edited

Loading

abuelnasr0 Aug 1, 2023 •

edited

Loading

abuelnasr0 Aug 1, 2023 •

edited

Loading