Non-zero status code returned while running ConvTranspose node. #21034

Jerry-Master · 2024-06-13T14:37:32Z

Describe the issue

When running my onnx model on C++ in CPU everything works perfectly. However, when running it with the Cuda provider it throws this error:

2024-06-13 16:21:29.9651844 [E:onnxruntime:InferenceEngineORT, cuda_call.cc:118 onnxruntime::CudaCall] CUDNN failure 3: CUDNN_STATUS_BAD_PARAM ; GPU=0 ; hostname=008C00014 ; file=C:\a\_work\1\s\onnxruntime\core\providers\cuda\nn\conv_transpose.cc ; line=318 ; expr=cudnnAddTensor(GetCudnnHandle(context), &alpha, s_.b_tensor, b_data, &alpha, s_.y_tensor, y_data);
2024-06-13 16:21:29.9907653 [E:onnxruntime:, sequential_executor.cc:516 onnxruntime::ExecuteKernel] Non-zero status code returned while running ConvTranspose node. Name:'_inlfunc_torch_nn_modules_conv_ConvTranspose2d_p_m_up3_0_1_ConvTranspose_7' Status Message: CUDNN failure 3: CUDNN_STATUS_BAD_PARAM ; GPU=0 ; hostname=008C00014 ; file=C:\a\_work\1\s\onnxruntime\core\providers\cuda\nn\conv_transpose.cc ; line=318 ; expr=cudnnAddTensor(GetCudnnHandle(context), &alpha, s_.b_tensor, b_data, &alpha, s_.y_tensor, y_data);

To reproduce

The model has as input l_x_ tensor of shape 1,3,128,256 and l_k_ tensor of shape 1,1,32,32 and the output is called p_8 with shape 1,3,128,256. The model is uploaded to drive here. It was exported from usrnet repo using dynamo exporter and opset 18.

The following python code helps reproduce the issue:

import argparse
import cv2
import onnxruntime
import numpy as np

def _create_parser():
    parser = argparse.ArgumentParser()
    parser.add_argument('--onnx-path', type=str, default="usrnet_128x256_32x32.onnx")
    return parser


def main():
    parser = _create_parser()
    args = parser.parse_args()

    x = np.ones((1, 3, 128, 256)).astype(np.float32)
    k = np.ones((1, 1, 32, 32)).astype(np.float32)

    # compute ONNX Runtime output prediction
    ort_session = onnxruntime.InferenceSession(args.onnx_path, providers=['CUDAExecutionProvider'])  
    ort_inputs = {'l_x_': x, 'l_k_': k}
    ort_outs = ort_session.run(['p_8'], ort_inputs)

    out = ort_outs[0].squeeze(0).transpose(1, 2, 0)
    out_norm = np.zeros_like(out)
    out_norm = cv2.normalize(out, -1, 0, 255, norm_type=cv2.NORM_MINMAX)
    cv2.imwrite('out.png', out_norm)
    

if __name__ == '__main__':
    main()

Urgency

Yes

Platform

Windows

OS Version

11

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.18.0

ONNX Runtime API

C++

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

CUDA 11.6

The text was updated successfully, but these errors were encountered:

github-actions bot added ep:CUDA issues related to the CUDA execution provider platform:windows issues related to the Windows platform labels Jun 13, 2024

sophies927 removed the platform:windows issues related to the Windows platform label Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-zero status code returned while running ConvTranspose node. #21034

Non-zero status code returned while running ConvTranspose node. #21034

Jerry-Master commented Jun 13, 2024 •

edited

Loading

Non-zero status code returned while running ConvTranspose node. #21034

Non-zero status code returned while running ConvTranspose node. #21034

Comments

Jerry-Master commented Jun 13, 2024 • edited Loading

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

Jerry-Master commented Jun 13, 2024 •

edited

Loading