P-tuuning的一些问题 #18

Ming-Qin-tech · 2021-05-12T07:09:14Z

1.你好，我想问一下，在P-tunning中，[Mask]在一众[unused]中得位置是怎么确定的？是人工选择的吗？如果不是的话，是根据什么方式确定的？
2.原论文中写的当数据量比较少的时候用的anchor-word，比如预测“英国首都”，在几个[unused]中加一个[capital]效果会比较好，这个[capital]应该加在哪个位置是如何确定的呢？

Riroaki · 2021-05-27T11:22:37Z

1.你好，我想问一下，在P-tunning中，[Mask]在一众[unused]中得位置是怎么确定的？是人工选择的吗？如果不是的话，是根据什么方式确定的？
2.原论文中写的当数据量比较少的时候用的anchor-word，比如预测“英国首都”，在几个[unused]中加一个[capital]效果会比较好，这个[capital]应该加在哪个位置是如何确定的呢？

不是作者哈，试着说一下自己的理解：

这个项目中的prmopt没有使用[unused] token，这里的[Mask]就和manual prmopt的mask位置一致。你看到的可能是苏剑林大佬文章中用了unused token的setting，他的代码在这里：https://github.com/bojone/P-tuning
这个项目中其实prompt中大部分的token都是anchor-word，具体到PT_Fewshot/data_utils/task_pvp.py中你可以看各个task的prompt。比如说Rte任务的prompt格式如下：

P-tuning/PT-Fewshot/data_utils/task_pvps.py

Line 288 in 368ab85

string_list_a = [text_a, 'Question:', text_b, "?", "the", "Answer:", self.mask, "."]

它对应的block_flag_a是：

P-tuning/PT-Fewshot/data_utils/task_pvps.py

Line 290 in 368ab85

block_flag_a = [0, 0, 0, 0, 1, 0, 0, 0]

其中第5个值为1，表示这个词是可以替换成LSTM embedding的，在这个prompt中对应the这个token。
其他的PVP同理，总之它目前的实现中基本除了少部分block_flag==1位置的token以外都是anchor token。
至于为什么这么选择，大概是因为这些token包含的语义信息比较少，替换掉也没事，效果稍微调一下也能上去……

terrifyzhao · 2021-06-30T08:20:59Z

1.你好，我想问一下，在P-tunning中，[Mask]在一众[unused]中得位置是怎么确定的？是人工选择的吗？如果不是的话，是根据什么方式确定的？
2.原论文中写的当数据量比较少的时候用的anchor-word，比如预测“英国首都”，在几个[unused]中加一个[capital]效果会比较好，这个[capital]应该加在哪个位置是如何确定的呢？

不是作者哈，试着说一下自己的理解：

这个项目中的prmopt没有使用[unused] token，这里的[Mask]就和manual prmopt的mask位置一致。你看到的可能是苏剑林大佬文章中用了unused token的setting，他的代码在这里：https://github.com/bojone/P-tuning

这个项目中其实prompt中大部分的token都是anchor-word，具体到PT_Fewshot/data_utils/task_pvp.py中你可以看各个task的prompt。比如说Rte任务的prompt格式如下：

P-tuning/PT-Fewshot/data_utils/task_pvps.py

Line 288 in 368ab85

string_list_a = [text_a, 'Question:', text_b, "?", "the", "Answer:", self.mask, "."]

它对应的block_flag_a是：

P-tuning/PT-Fewshot/data_utils/task_pvps.py

Line 290 in 368ab85

block_flag_a = [0, 0, 0, 0, 1, 0, 0, 0]

其中第5个值为1，表示这个词是可以替换成LSTM embedding的，在这个prompt中对应the这个token。
其他的PVP同理，总之它目前的实现中基本除了少部分block_flag==1位置的token以外都是anchor token。
至于为什么这么选择，大概是因为这些token包含的语义信息比较少，替换掉也没事，效果稍微调一下也能上去……

看了半天没明天，原来是这个意思，谢谢

rookiebird · 2021-10-12T11:26:56Z

@Riroaki 我也看了代码，好像是这个意思，但是他这个block 和论文上说的有出入：

For instance,for RTE task, the token “?” within prompt template “[PRE][prompt tokens][HYP]?[prompt tokens][MASK]” is specially added as an anchor token and affects the performance a lot.

这代码写的，跟论文上的，不一样。。。。不仅prompt 不一样， anchor word 也不一样。。。。难受

sxthunder · 2021-10-31T06:55:16Z

PVP 是啥意思啊

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

P-tuuning的一些问题 #18

P-tuuning的一些问题 #18

Ming-Qin-tech commented May 12, 2021

Riroaki commented May 27, 2021 •

edited

Loading

terrifyzhao commented Jun 30, 2021

rookiebird commented Oct 12, 2021

sxthunder commented Oct 31, 2021

P-tuuning的一些问题 #18

P-tuuning的一些问题 #18

Comments

Ming-Qin-tech commented May 12, 2021

Riroaki commented May 27, 2021 • edited Loading

terrifyzhao commented Jun 30, 2021

rookiebird commented Oct 12, 2021

sxthunder commented Oct 31, 2021

Riroaki commented May 27, 2021 •

edited

Loading