Improve JSON format prompt for large chunks & Handle ZeroDivisionError #982

Manav916 · 2024-05-21T14:02:45Z

Description

In this PR there are three different changes

Changes

Fixed a Typo in the filter_question_prompt Instruction from Asses to Assess
Added a try-except block for handling ZeroDivisionError for the filter method in the NodeFilter class
Improved the JSON_FORMAT_INSTRUCTIONS for better output generation
It seems like the LLM sometimes loses track of the imposed instruction for the output, especially for long prompts. So although the output generated for a chunk_size of 512 is perfect, the output for a chunk_size of 1024 has an extra newline. Here parsing fails when using PydanticOutputParser even though the llm has generated an output and there is only an extra '\n' as shown in the instances below.

But by tuning the prompt and adding Please output your response in the demanded json format. at the end of the instruction we get output without '\n'. This output can then be parsed and the context can be considered.

This commit introduces error handling for ZeroDivisionError in the filter method of the NodeFilter class. This change ensures that the application gracefully handles cases where division by zero occurs, setting the score to 0 by default.

Modify the JSON_FORMAT_INSTRUCTIONS in output_parser.py to ensure better JSON output handling by LLMs, particularly for larger chunk sizes. This change helps maintain the structure of the output without newlines, which optimizes parsing by PydanticOutputParser and reduces failures due to formatting issues in long prompts.

jjmachan · 2024-05-22T12:17:38Z

@shahules786 could you check if we can merge this in?

src/ragas/testset/filters.py

Co-authored-by: Massimiliano Pronesti <[email protected]>

Manav916 and others added 4 commits May 21, 2024 12:18

fix typo in prompt

124acf1

add try-except block to handle ZeroDivisionError

47d3a1a

This commit introduces error handling for ZeroDivisionError in the filter method of the NodeFilter class. This change ensures that the application gracefully handles cases where division by zero occurs, setting the score to 0 by default.

Merge branch 'explodinggradients:main' into dev

5a589f2

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label May 21, 2024

jjmachan requested a review from shahules786 May 22, 2024 12:17

Merge branch 'explodinggradients:main' into dev

d991aa1

mspronesti suggested changes May 24, 2024

View reviewed changes

src/ragas/testset/filters.py Outdated Show resolved Hide resolved

Manav916 and others added 3 commits May 24, 2024 20:35

Update src/ragas/testset/filters.py

8f9cf25

Co-authored-by: Massimiliano Pronesti <[email protected]>

fix(testset): reinitialize docstore for generation with new documents

4d596bb

Merge branch 'dev' of https://github.com/Manav916/ragas into dev

201eefd

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve JSON format prompt for large chunks & Handle ZeroDivisionError #982

Improve JSON format prompt for large chunks & Handle ZeroDivisionError #982

Manav916 commented May 21, 2024 •

edited

Loading

jjmachan commented May 22, 2024

Improve JSON format prompt for large chunks & Handle ZeroDivisionError #982

Are you sure you want to change the base?

Improve JSON format prompt for large chunks & Handle ZeroDivisionError #982

Conversation

Manav916 commented May 21, 2024 • edited Loading

Description

Changes

jjmachan commented May 22, 2024

Manav916 commented May 21, 2024 •

edited

Loading