-
Notifications
You must be signed in to change notification settings - Fork 19.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TensorFlow GPU - Fix keras/layers/merging/merging_test.py
#18567
Comments
The culprit is between Tested via: |
Thanks Ramesh for the repo, we will revisit this during the triage meeting. On a side note, I didn't find any change on the sparse side between those two date. Will need to dig deep for the root cause. |
Here's a small code snippet to reproduce the issue in Colab with Keras Master and TF-Nightly -
Error:
|
If I replace |
@fchollet - If you have any thoughts or suggestions to try let me know. |
The example you provided doesn't even use sparse inputs, which is different from the error on top. The error |
If you install the Nightly from 09/18, it works fine.
It has started failing from 09/19 Nightly.
|
There are multiple failures in |
TF Nightly 09/18 works for ALL the tests in |
Somehow I wasn't able to produce the on colab with T4 GPU. https://colab.sandbox.google.com/drive/1_hMJieL_6DobTPUbZ6BRZIEVz0YRHhBo#scrollTo=GM2B7qEqNYqk Maybe I didn't config the GPU properly? |
@sampathweb do u have a testable env that I can run with? |
Also seems to be failing with JAX-GPU now:
|
I will work on this tomorrow. I used Colab v100 as my test env |
Seems to be a Cudnn TF compilation issue.
Tested via
|
Fix failing test -
keras/layers/merging/merging_test.py::MergingLayersTest::test_sparse_dot_2d Fatal Python error: Aborted
and update TODO in https://github.com/keras-team/keras/blob/master/keras/kokoro/github/ubuntu/gpu/build.sh#L39https://source.cloud.google.com/results/invocations/9df9ee7e-5666-4644-abd2-01a10771faeb/targets/keras%2Fgithub%2Fubuntu%2Fgpu%2Ftensorflow%2Fpresubmit/log
The text was updated successfully, but these errors were encountered: