How to do multithreaded infer with onnxruntime #21419

XiaBing992 · 2024-07-19T15:33:55Z

Describe the issue

When I do multithreaded infer via onnxruntime(python), I get an error. My onnx_session are all independent, model files are all read independently, for multithreaded reasoning what should I do?

To reproduce

...
def Infer(self, request, context):
# 推理
log.info('infer...')
model_path = os.path.join(MODEL_DIR, request.model_name)
input = np.array(request.input).astype(np.float32)
......
providers = [
('CUDAExecutionProvider', {
'device_id': 0,
}),
'CPUExecutionProvider',
]
onnx_session = onnxruntime.InferenceSession(model_path, providers = providers)
........

    output = onnx_session.run(output_name, input_feed=input_feed)

    .......

if name == 'main':

thread_num = 50
threads = []
start_time = time.time()
for index in range(thread_num):
    threads.append(threading.Thread(target = Infer, args=......))
    threads[index].start()

for th in threads:
    th.join()

end_time = time.time()

log.info('all time: {}'.format(end_time - start_time))

Urgency

No response

Platform

Linux

OS Version

ubuntu 20.04

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.18.1

ONNX Runtime API

Python

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

12.4

The text was updated successfully, but these errors were encountered:

wejoncy · 2024-07-22T09:02:15Z

Do you want multi-sessions in your senario?
Multiprocessing should work for you.

github-actions · 2024-08-25T15:00:56Z

This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.

github-actions bot added the ep:CUDA issues related to the CUDA execution provider label Jul 19, 2024

sophies927 removed the ep:CUDA issues related to the CUDA execution provider label Jul 25, 2024

github-actions bot added the stale issues that have not been addressed in a while; categorized by a bot label Aug 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to do multithreaded infer with onnxruntime #21419

How to do multithreaded infer with onnxruntime #21419

XiaBing992 commented Jul 19, 2024

wejoncy commented Jul 22, 2024

github-actions bot commented Aug 25, 2024

How to do multithreaded infer with onnxruntime #21419

How to do multithreaded infer with onnxruntime #21419

Comments

XiaBing992 commented Jul 19, 2024

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

wejoncy commented Jul 22, 2024

github-actions bot commented Aug 25, 2024