-
Notifications
You must be signed in to change notification settings - Fork 276
Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] python -m lmdeploy.serve.proxy.proxy --server_name "xxx" --server_port xxx --strategy "min_expected_latency"
#1851
opened Jun 25, 2024 by
zeroneway
2 tasks
[Bug] Segmentation fault: address not mapped to object at address 0x2058
#1849
opened Jun 25, 2024 by
austingg
2 tasks done
under stream mode, if break generator in advance, it may lead to server stuck [Bug]
#1848
opened Jun 25, 2024 by
shanekong
2 tasks
[Bug] InternLM2MLP.forward() missing 1 required positional argument: 'im_mask'
#1847
opened Jun 25, 2024 by
jiangjingz
2 tasks done
[Bug] lmdeploy - [31mERROR[0m - Truncate max_new_tokens to 221
#1841
opened Jun 24, 2024 by
tairen99
1 of 2 tasks
[Feature] How to support bf16 when inferencing Internvl-chat
#1839
opened Jun 24, 2024 by
Leo-yang-1020
[Bug] Qwen-7B-Chat 量化报错 AttributeError: 'RMSNorm' object has no attribute 'variance_epsilon'
#1830
opened Jun 23, 2024 by
CodexDive
2 tasks
Model name id returned is weird specially when using Docker [Bug]
#1827
opened Jun 21, 2024 by
Hugobox
2 tasks done
[Bug] MiniCPM-llama3-V2_5 启动后使用image url 使用base64 没有回复结果
#1819
opened Jun 21, 2024 by
weiminw
2 tasks
[Bug] lmdeploy部署intermlm2-chat-20b,遇到<|im_end|>不会停止
#1815
opened Jun 20, 2024 by
jeinlee1991
2 tasks done
[Bug] vl pipeline triggle cudaMemcpyAsync ERROR illegal memory access
#1813
opened Jun 20, 2024 by
pupumao
2 tasks done
[Bug] n_token = outputs.num_token . Error: AttributeError: 'tuple' object has no attribute 'num_token'
#1802
opened Jun 19, 2024 by
Liqiandi
2 tasks done
[Feature] Prefill/Decoding disaggregation substantially boosts throughput
#1801
opened Jun 19, 2024 by
serser
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.