Treat messages with missing variables as text #1137

anunnakian · 2024-05-21T17:46:47Z

Issue

Change

When we declare an assistant as following example :

interface AiService {

    @UserMessage("What is the capital of {{it}}?")
    String answer();
}

We don't throw an illegal exception like before, we treat the user message like text without resolving the {{it}} variable.

Here is a test that explain the treatment clearly :

@Test
void test_user_message_configuration_8() {

    // given
    AiService aiService = AiServices.builder(AiService.class)
            .chatLanguageModel(chatLanguageModel)
            .build();

    // when-then
    assertThat(aiService.answer()).containsIgnoringCase("");
    verify(chatLanguageModel).generate(singletonList(userMessage("What is the capital of {{it}}?")));
}

General checklist

There are no breaking changes
I have added unit and integration tests for my change
I have manually run all the unit and integration tests in the module I have added/changed, and they are all green
I have manually run all the unit and integration tests in the core and main modules, and they are all green

I have added/updated the documentation
I have added an example in the examples repo (only for "big" features)

Checklist for adding new model integration

I have added my new module in the BOM

Checklist for adding new embedding store integration

I have added a {NameOfIntegration}EmbeddingStoreIT that extends from either EmbeddingStoreIT or EmbeddingStoreWithFilteringIT
I have added my new module in the BOM

Checklist for changing existing embedding store integration

I have manually verified that the {NameOfIntegration}EmbeddingStore works correctly with the data persisted using the latest released version of LangChain4j

langchain4j · 2024-05-22T11:32:36Z

Hi @anunnakian thanks a lot!

This seems to be a bit different from what was described in #1125

This is definitely a wrong configuration:

interface AiService {

    @UserMessage("What is the capital of {{it}}?")
    String answer();
}

The case described in the issue is different:

@AiService
public interface AiAssistant {

    TokenStream chat(@MemoryId String chatId, @UserMessage String userMessage);
}

assistant.chat("12345", "Text containing {{it}}");

anunnakian · 2024-05-22T19:40:14Z

@langchain4j if we consider that the issue was fixed, the following tests must be green right ?

@Test
void test_user_message_configuration_8() {

    // given
    AiService aiService = AiServices.builder(AiService.class)
            .chatLanguageModel(chatLanguageModel)
            .chatMemoryProvider(memoryId -> MessageWindowChatMemory.withMaxMessages(10))
            .build();

    // when
    aiService.chat8("12345", "What is the capital of {{it}}?");

    // then
    verify(chatLanguageModel).generate(singletonList(userMessage("What is the capital of {{it}}?")));
}

@Test
void test_user_message_configuration_9() {

        // given
        AiService aiService = AiServices.builder(AiService.class)
                .chatLanguageModel(chatLanguageModel)
                .chatMemoryProvider(memoryId -> MessageWindowChatMemory.withMaxMessages(10))
                .build();

        // when
        aiService.chat8("12345", "What is the capital of {{variable}}?");

        // then
        verify(chatLanguageModel).generate(singletonList(userMessage("What is the capital of {{variable}}?")));
    }

anunnakian · 2024-05-22T20:14:44Z

We should remove this test case, if the missing variable {{name}} must be treated like text, right ?

@Test
void should_fail_when_value_is_missing() {

   // given
   PromptTemplate promptTemplate = PromptTemplate.from("My name is {{name}}.");

   Map<String, Object> variables = emptyMap();

   // when-then
   assertThatThrownBy(() -> promptTemplate.apply(variables))
           .isExactlyInstanceOf(IllegalArgumentException.class)
           .hasMessage("Value for the variable 'name' is missing");
}

langchain4j · 2024-05-22T20:19:43Z

@anunnakian to your first question: yes.
Second: not sure, is there a way to treat it as a text (instead of template) only for that use case?

anunnakian · 2024-05-22T20:27:54Z

@langchain4j If we treat the message as text instead of a template, we will break the case if we have two variable with one missing like the following test :

interface AiService {

    String chat9(@MemoryId String chatId, @UserMessage String userMessage, @V("country") String country);
}

@Test
void test_user_message_configuration_10() {

    // given
    AiService aiService = AiServices.builder(AiService.class)
            .chatLanguageModel(chatLanguageModel)
            .chatMemoryProvider(memoryId -> MessageWindowChatMemory.withMaxMessages(10))
            .build();

    // when
    aiService.chat9("12345", "What is the {{it}} of {{country}}?", "Germany");

    // then
    verify(chatLanguageModel).generate(singletonList(userMessage("What is the {{it}} of Germany?")));
}

This test must be green too at the end, right ?

anunnakian · 2024-05-24T20:42:05Z

@langchain4j I found a solution to make all tests above and the following one green!
my branch is up to date and waiting for your feedback

interface AiService {
    String chat(@MemoryId String chatId, @UserMessage String userMessage);
}

@Test
void test_user_message_configuration_9() {

    // given
    AiService aiService = AiServices.builder(AiService.class)
            .chatLanguageModel(chatLanguageModel)
            .chatMemoryProvider(memoryId -> MessageWindowChatMemory.withMaxMessages(10))
            .build();

    // when
    aiService.chat("12345", "What is the {{it}} of {{var}}?");

    // then
    verify(chatLanguageModel).generate(singletonList(userMessage("What is the {{it}} of {{var}}?")));
}

anunnakian · 2024-05-24T21:32:10Z

langchain4j-core/src/main/java/dev/langchain4j/spi/prompt/PromptTemplateFactory.java

+         * Get all variables extracted from the template.
+         * @return A set of variable names.
+         */
+        default Set<String> getAllVariables() {


A make this method default to avoid breaking changes

ashimoon · 2024-05-29T01:26:17Z

Should we just always treat missing variables, including {{it}} as an illegal state? I would imagine in the vast majority of cases, if the author created a @UserMessage with a prompt variable in it, then the method signature must match that user message, otherwise an exception should be thrown.

If we do indeed want to support What is the capital of {{it}}? where we want the string to be supported literally without any templating, then could we just add a property to @UserMessage to indicate that?

e.g.

@UserMessage(value = "What is the capital of {{it}}?", templated = false)

public @interface UserMessage {
    String value()
    boolean templated() default true;
}

If we need even further grained control, such that only some variables are excluded from templating, that could also be defined in the annotation?

@UserMessage(value = "What is the capital of {{it}} in {{country}}?", excludedVariables = { "it" })

public @interface UserMessage {
    String value()
    String[] excludedVariables() default {};
}

langchain4j · 2024-05-29T11:56:24Z

@ashimoon yes, I think you are right. We should be strict by default (as it is now).

I think I rushed a bit with #1125. It can be solved by escaping {{it}} instead of making changes to the default behaviour of prompt templates.

@anunnakian sorry about that, let's put this PR on hold for now

anunnakian · 2024-05-29T17:17:08Z

Ok! Don't panic everything is under control (just kidding... 😁)

So, I'll create another PR to escape {{it}} variable tonight

Thanks for your feedback @ashimoon @langchain4j 😉

langchain4j · 2024-05-29T18:09:58Z

@anunnakian I meant that the current implementation (without this PR) indeed seems to be the most reasonable. I guess that specific corner case in #1125 is actually ok and the user can escape manually, if there is a need. Whether we should support auto-escaping, it is tricky. I would take a break and return to this problem a bit later.

anunnakian · 2024-06-07T07:14:27Z

@langchain4j got it ;)

Threat prompt with missing 'it' variable as text

186e604

Merge branch 'main' into it_not_found

6087e94

anunnakian changed the title ~~Threat prompt with missing 'it' variable as text~~ Treat prompt with missing 'it' variable as text May 23, 2024

anunnakian added 7 commits May 23, 2024 11:35

Merge branch 'main' into it_not_found

9dda75e

Merge branch 'main' into it_not_found

50424d6

Merge branch 'main' into it_not_found

5e98f87

Merge branch 'main' into it_not_found

025316d

Merge branch 'main' into it_not_found

3148ebb

Threat prompt with missing variable as text

56d9d28

Fix coverage

adfa2d8

Polish

c67482f

anunnakian commented May 24, 2024

View reviewed changes

anunnakian added 2 commits May 24, 2024 23:36

Polish

cdd12f5

Merge branch 'main' into it_not_found

47e73a7

langchain4j added the P1 Highest priority label May 27, 2024

anunnakian changed the title ~~Treat prompt with missing 'it' variable as text~~ Treat messages with missing variables as text May 27, 2024

anunnakian added 5 commits May 27, 2024 11:26

Merge branch 'main' into it_not_found

9d8871b

Merge branch 'main' into it_not_found

2d9ad28

Merge branch 'main' into it_not_found

77b6039

Merge branch 'main' into it_not_found

0de0181

Merge branch 'main' into it_not_found

d908289

Merge branch 'main' into it_not_found

9a32a77

Merge branch 'main' into it_not_found

1378dad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Treat messages with missing variables as text #1137

Treat messages with missing variables as text #1137

anunnakian commented May 21, 2024 •

edited

Loading

langchain4j commented May 22, 2024

anunnakian commented May 22, 2024 •

edited

Loading

anunnakian commented May 22, 2024 •

edited

Loading

langchain4j commented May 22, 2024

anunnakian commented May 22, 2024 •

edited

Loading

anunnakian commented May 24, 2024 •

edited

Loading

anunnakian May 24, 2024

ashimoon commented May 29, 2024

langchain4j commented May 29, 2024

anunnakian commented May 29, 2024

langchain4j commented May 29, 2024

anunnakian commented Jun 7, 2024

Treat messages with missing variables as text #1137

Are you sure you want to change the base?

Treat messages with missing variables as text #1137

Conversation

anunnakian commented May 21, 2024 • edited Loading

Issue

Change

General checklist

Checklist for adding new model integration

Checklist for adding new embedding store integration

Checklist for changing existing embedding store integration

langchain4j commented May 22, 2024

anunnakian commented May 22, 2024 • edited Loading

anunnakian commented May 22, 2024 • edited Loading

langchain4j commented May 22, 2024

anunnakian commented May 22, 2024 • edited Loading

anunnakian commented May 24, 2024 • edited Loading

anunnakian May 24, 2024

Choose a reason for hiding this comment

ashimoon commented May 29, 2024

langchain4j commented May 29, 2024

anunnakian commented May 29, 2024

langchain4j commented May 29, 2024

anunnakian commented Jun 7, 2024

anunnakian commented May 21, 2024 •

edited

Loading

anunnakian commented May 22, 2024 •

edited

Loading

anunnakian commented May 22, 2024 •

edited

Loading

anunnakian commented May 22, 2024 •

edited

Loading

anunnakian commented May 24, 2024 •

edited

Loading