Add chatglm2 & chatglm3 autotp #5540

Yejing-Lai · 2024-05-16T04:37:53Z

This PR aims to enable chatglm2 & chatglm3 autotp. Similar to the phi3, this model uses the chunk MLP layer, so we adjust the weight order by 'shard_mlp_chunk' func. Please kindly review~ Thanks!

delock · 2024-05-20T14:59:57Z

deepspeed/module_inject/auto_tp.py

@@ -476,7 +481,9 @@ def _replace_module(self, r_module, prev_name='', prev_class_name=''):

    def get_model_num_kv_heads(self, config):
        num_kv_heads = None
-        kv_head_names = ['num_kv_heads', 'num_key_value_heads', 'num_attention_heads', 'n_heads']
+        kv_head_names = [
+            'multi_query_group_num', 'num_kv_heads', 'num_key_value_heads', 'num_attention_heads', 'n_heads'


@Yejing-Lai as this list gets longer, it would help if newly added entries can have comments for what model this new entry serves. In case one day this piece information is needed.

delock · 2024-06-06T06:50:01Z

Hi @loadams , this PR is ready for review. Can this PR be reviewed? Thanks!

Yejing-Lai · 2024-06-13T05:23:32Z

Hi @loadams. The CI error seems like a network issue. Could you rerun the CI? Thanks!

deepspeed/module_inject/auto_tp.py

add chatglm2 & chatglm3 autotp

a6027ac

Yejing-Lai requested review from mrwyattii, awan-10 and arashb as code owners May 16, 2024 04:37

delock reviewed May 20, 2024

View reviewed changes

tjruwase and others added 4 commits May 20, 2024 11:01

Merge branch 'master' into lyj/chatglm2

575c7fb

add comment

7c64e6f

Merge branch 'master' into lyj/chatglm2

b177342

Merge branch 'master' into lyj/chatglm2

5cf1e67

loadams added 2 commits June 6, 2024 08:43

Merge branch 'master' into lyj/chatglm2

6ff785f

Merge branch 'master' into lyj/chatglm2

c31e2c3

Yejing-Lai and others added 3 commits June 13, 2024 13:23

Merge branch 'master' into lyj/chatglm2

f475d88

Merge branch 'master' into lyj/chatglm2

9c86a50

Merge branch 'master' into lyj/chatglm2

111af31

loadams enabled auto-merge June 18, 2024 16:20

loadams disabled auto-merge June 18, 2024 16:28

loadams reviewed Jun 26, 2024

View reviewed changes

deepspeed/module_inject/auto_tp.py Show resolved Hide resolved

Yejing-Lai and others added 2 commits June 27, 2024 09:26

Merge branch 'master' into lyj/chatglm2

190b9c8

Merge branch 'master' into lyj/chatglm2

73ee2b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add chatglm2 & chatglm3 autotp #5540

Add chatglm2 & chatglm3 autotp #5540

Yejing-Lai commented May 16, 2024

delock May 20, 2024

delock commented Jun 6, 2024

Yejing-Lai commented Jun 13, 2024

Add chatglm2 & chatglm3 autotp #5540

Are you sure you want to change the base?

Add chatglm2 & chatglm3 autotp #5540

Conversation

Yejing-Lai commented May 16, 2024

delock May 20, 2024

Choose a reason for hiding this comment

delock commented Jun 6, 2024

Yejing-Lai commented Jun 13, 2024