More accurate time logging for ImageEncoder and fix concurrent image processing corruption #1765

irexyc · 2024-06-12T09:04:12Z

Motivation

RunningLeon · 2024-06-13T03:31:06Z

lmdeploy/vl/engine.py

@@ -104,23 +98,22 @@ def forward(self, inputs: List[Image]):
        """Model forward."""
        time_start = time.perf_counter()
        outputs = self.model.forward(inputs)
+        if isinstance(outputs[0], torch.Tensor):


do we still need to convert to cpu ?

The modification here is mainly to solve the #1759

Will it slow down the inference performance?

test with llava-v1.5-7b and profile_restful_api.py (PR 1662)

Move tensor to cpu doesn't affect the performance. But using asyncio.Task may affects the performance.

# with thread number of prompt tokens: 248339 number of completion tokens: 240582 token throughput (completion token): 825.260 token/s token throughput (prompt + completion token): 1677.128 token/s RPS (request per second): 3.430 req/s RPM (request per minute): 205.816 req/min # with asyncio.Task number of prompt tokens: 248339 number of completion tokens: 240582 token throughput (completion token): 808.719 token/s token throughput (prompt + completion token): 1643.513 token/s RPS (request per second): 3.362 req/s RPM (request per minute): 201.691 req/min

If gpu utility is high, you can add

await asyncio.get_event_loop().run_in_executor(None, self.stream.synchronize)

after forward so other coroutine can use cpu during forward.

According to my test, wrapping forward is better to synchronize the stream.

outputs = await asyncio.get_event_loop().run_in_executor( None, self.forward, inputs) # token throughput (completion token): 826.370 token/s

outputs = self.model.forward(inputs) if isinstance(outputs[0], torch.Tensor): stream = torch.cuda.current_stream(outputs[0].device) await asyncio.get_event_loop().run_in_executor( None, stream.synchronize) # token throughput (completion token): 815.130 token/s

RunningLeon

LGTM

lvhan028 · 2024-06-18T05:30:56Z

@irexyc Does this PR ONLY target #1759?

irexyc · 2024-06-18T05:37:49Z

Does this PR ONLY target #1759?

and #1730

irexyc added 2 commits June 12, 2024 09:01

more accurate time logging

3679475

replace thread with asyncio.Task

23112fc

RunningLeon reviewed Jun 13, 2024

View reviewed changes

RunningLeon requested a review from grimoire June 13, 2024 03:31

grimoire approved these changes Jun 13, 2024

View reviewed changes

lvhan028 added the improvement label Jun 13, 2024

speed up

8977ae9

RunningLeon approved these changes Jun 17, 2024

View reviewed changes

lvhan028 changed the title ~~More accurate time logging for ImageEncoder~~ More accurate time logging for ImageEncoder and fix concurrent image processing corruption Jun 18, 2024

lvhan028 merged commit 812fb15 into InternLM:main Jun 18, 2024
5 checks passed

RunningLeon mentioned this pull request Jun 20, 2024

[Bug] 部署cogvlm2运行时，接受的多个并发之间存在干扰，后面的请求使用前面请求传的图像 #1730

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More accurate time logging for ImageEncoder and fix concurrent image processing corruption #1765

More accurate time logging for ImageEncoder and fix concurrent image processing corruption #1765

irexyc commented Jun 12, 2024

RunningLeon Jun 13, 2024

irexyc Jun 13, 2024

lvhan028 Jun 13, 2024

irexyc Jun 13, 2024

grimoire Jun 13, 2024

irexyc Jun 17, 2024

RunningLeon left a comment

lvhan028 commented Jun 18, 2024

irexyc commented Jun 18, 2024

More accurate time logging for ImageEncoder and fix concurrent image processing corruption #1765

More accurate time logging for ImageEncoder and fix concurrent image processing corruption #1765

Conversation

irexyc commented Jun 12, 2024

Motivation

RunningLeon Jun 13, 2024

Choose a reason for hiding this comment

irexyc Jun 13, 2024

Choose a reason for hiding this comment

lvhan028 Jun 13, 2024

Choose a reason for hiding this comment

irexyc Jun 13, 2024

Choose a reason for hiding this comment

grimoire Jun 13, 2024

Choose a reason for hiding this comment

irexyc Jun 17, 2024

Choose a reason for hiding this comment

RunningLeon left a comment

Choose a reason for hiding this comment

lvhan028 commented Jun 18, 2024

irexyc commented Jun 18, 2024