llm-inference

A platform to test multiple LLM models inside a RAG workflow to choose the best model for embedding and retrieval and the best prompt according to the use case

python infrastructure aws cloud serverless terraform artificial-intelligence iac rag iac-terraform llm llms langchain langchain-python llm-inference

Updated Jun 21, 2024
Jupyter Notebook

SalientPharaoh / Optimisation_of_dynamic_neural_networks

Star

Code and analysis for optimizing dynamic neural networks. This project investigates and implements various optimization techniques to enhance dynamic neural networks.

python flask nextjs pruning quantization huggingface-transformers deepspeed llm-inference

Updated Jun 13, 2024
Python

awesome-software / tree-of-thoughts

Star

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

large-language-models prompt-engineering llm-inference

Updated Jun 1, 2023
Python

This repository contains question-answers model as an interface which retrieves answers from vector database for a question. Embeddings or tokenised vector being computed using OpenAI API call which gets inserted into ChromaDB as a RAG. OpenAI API key would be required to run this service.

rag huggingface-transformers langchain vectordb llm-inference

Updated Aug 1, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-inference

Here are 445 public repositories matching this topic...

awesome-software / FrugalML

Regular-Baf / bafchat

siripragadashashank / accio

mapluisch / LLaVA-WebSocket-Server

truefrontier-ai / Monolith

NajiAboo / google-gemini

awesome-software / HugNLP

machina-ratiocinatrix / calculemus

awesome-software / magentic

cedrickchee / llama2.c

jnopareboateng / TransactBot

awesome-software / gpt-engineer

InquestGeronimo / horizon-takeoff

rfdzan / t5-llm-training

hoehrmann / IETF-CERT

KT313 / assistant_base

riolaf05 / langchain-fastapi-rag-platform

SalientPharaoh / Optimisation_of_dynamic_neural_networks

awesome-software / tree-of-thoughts

navneet1083 / qaml

Improve this page

Add this topic to your repo