RAGchain

RAGchain is a framework for developing advanced RAG(Retrieval Augmented Generation) workflow powered by LLM (Large Language Model). While existing frameworks like Langchain or LlamaIndex allow you to build simple RAG workflows, they have limitations when it comes to building complex and high-accuracy RAG workflows.

RAGchain is designed to overcome these limitations by providing powerful features for building advanced RAG workflow easily. Also, it is partially compatible with Langchain, allowing you to leverage many of its integrations for vector storage, embeddings, document loaders, and LLM models.

Docs | API Spec | QuickStart

Quick Install

pip install RAGchain

Why RAGchain?

RAGchain offers several powerful features for building high-quality RAG workflows:

OCR Loaders

Simple file loaders may not be sufficient when trying to enhance accuracy or ingest real-world documents. OCR models can scan documents and convert them into text with high accuracy, improving the quality of responses from LLMs.

Reranker

Reranking is a popular method used in many research projects to improve retrieval accuracy in RAG workflows. Unlike LangChain, which doesn't include reranking as a default feature, RAGChain comes with various rerankers.

Great to use multiple retrievers

In real-world scenarios, you may need multiple retrievers depending on your requirements. RAGchain is highly optimized for using multiple retrievers. It divides retrieval and DB. Retrieval saves vector representation of contents, and DB saves contents. We connect both with Linker, so it is really easy to use multiple retrievers and DBs.

pre-made RAG pipelines

We provide pre-made pipelines that let you quickly set up RAG workflow. We are planning to make much complex pipelines, which hard to make but powerful. With pipelines, you can build really powerful RAG system quickly and easily.

Easy benchmarking

It is crucial to benchmark and test your RAG workflows. We have easy benchmarking module for evaluation. Support your own questions and various datasets.

Installation

From pip

simply install at pypi.

pip install RAGchain

From source

First, clone this git repository to your local machine.

git clone https://github.com/Marker-Inc-Korea/RAGchain.git
cd RAGchain

Then, install RAGchain module.

python3 setup.py develop

For using files at root folder and test, run dev requirements.

pip install dev_requirements.txt

Supporting Features

Advanced RAG features

Retrievals

BM25
Vector DB
Hybrid (rrf and cc)
HyDE

OCR Loaders

Rerankers

UPR
TART
BM25
LLM
MonoT5

Web Search

Google Search
Bing Search

Workflows (pipeline)

Basic
Visconde
Rerank
Google Search

Extra utils

Query Decomposition
Evidence Extractor
REDE Search Detector
Semantic Clustering
Cluster Time Compressor

Dataset Evaluators

Contributing

We welcome any contributions. Please feel free to raise issues and submit pull requests.

Acknowledgement

This project is an early version, so it can be unstable. The project is licensed under the Apache 2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 956 Commits
RAGchain		RAGchain
docs		docs
tests		tests
.env.template		.env.template
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
dev_requirements.txt		dev_requirements.txt
pytest_template.ini		pytest_template.ini
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGchain

Quick Install

Why RAGchain?

OCR Loaders

Reranker

Great to use multiple retrievers

pre-made RAG pipelines

Easy benchmarking

Installation

From pip

From source

Supporting Features

Advanced RAG features

Retrievals

OCR Loaders

Rerankers

Web Search

Workflows (pipeline)

Extra utils

Dataset Evaluators

Contributing

Acknowledgement

About

Releases 12

Packages

Contributors 5

Languages

License

Marker-Inc-Korea/RAGchain

Folders and files

Latest commit

History

Repository files navigation

RAGchain

Quick Install

Why RAGchain?

OCR Loaders

Reranker

Great to use multiple retrievers

pre-made RAG pipelines

Easy benchmarking

Installation

From pip

From source

Supporting Features

Advanced RAG features

Retrievals

OCR Loaders

Rerankers

Web Search

Workflows (pipeline)

Extra utils

Dataset Evaluators

Contributing

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases 12

Packages 0

Contributors 5

Languages

Packages