Adapted to the Ascend NPU #3933

Dbassqwer · 2024-04-30T06:51:18Z

对当前0.2.10版本进行昇腾NPU适配，其中Fastchat已由官方Repo适配过，本次commit适配了以下内容：
1、embedding model部分支持NPU加载
2、以及可选device支持NPU

lededev · 2024-05-05T15:03:51Z

server/utils.py

@@ -512,25 +512,28 @@ def _get_proxies():
 def detect_device() -> Literal["cuda", "mps", "cpu"]:
    try:
        import torch
+        import mindspore as ms


第515行的import放在520行前面要好些，用cuda和mps的用户不需要引入这个mindspore就直接返回了。

为什么要用mindspore而不是直接用torch_npu

glide-the · 2024-06-25T02:49:01Z

你好，在2.0的贡献我们没有计划维护，目前主分支已经是3.0的代码，感谢你对本贡献，期待你的下次参与

3.0使用了第三方平台加载本地模型，对于本地模型的支持，请参考readme

Adapted to the Ascend NPU

f1420b6

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Apr 30, 2024

lededev reviewed May 5, 2024

View reviewed changes

glide-the closed this Jun 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapted to the Ascend NPU #3933

Adapted to the Ascend NPU #3933

Dbassqwer commented Apr 30, 2024

lededev May 5, 2024

hongru-yu May 24, 2024

glide-the commented Jun 25, 2024

Adapted to the Ascend NPU #3933

Adapted to the Ascend NPU #3933

Conversation

Dbassqwer commented Apr 30, 2024

lededev May 5, 2024

Choose a reason for hiding this comment

hongru-yu May 24, 2024

Choose a reason for hiding this comment

glide-the commented Jun 25, 2024