Unlock Local AI Processing in Obsidian (feature request) #302

dicksensei69 · 2023-08-03T22:30:33Z

I'm writing to request a feature that would allow users to easily switch between different AI APIs within obsidian-smart-connections. Specifically, I'm interested in being able to toggle between the OpenAI API and emerging alternatives like Oobabooga's textgen and Llamacpp.

These new services offer exciting capabilities like local embeddings and on-device processing that could enhance the Obsidian experience, especially for users who want to avoid sending personal data to third parties. I've found where the API endpoint is configured in the code, and with some tweaking I may be able to switch between them manually. However, having an official option to select different APIs would provide a much smoother experience.

For those wondering, the API endpoint is currently specified in multiple locations the first being on line 1043 of main.js.
url: https://api.openai.com/v1/embeddings,

line 2666
const url = "https://api.openai.com/v1/chat/completions";

line 2719
url: https://api.openai.com/v1/chat/completions,

To manually change the API, these endpoints could be modified to point to local services like Oobabooga or Anthropic. However, this involves directly editing the source code which is cumbersome.

Ideally, there could be a function that defaults to OpenAI, but allows the API URL to be easily configured as a setting. Users could then switch to local IPs or services with just a simple configuration change. Furthermore, if this setting was exposed through the GUI, it would enable seamless API swapping without any code editing required.

The open source ecosystem is rapidly evolving, and empowering users to take advantage of these new innovations aligns with Obsidian's ethos of flexibility and customization. Users would love to rely on my own local hardware for AI processing rather than being locked into a single provider.

Thank you for your consideration. Obsidian has been invaluable for my workflow, and I'm excited by its potential to integrate some of these cutting-edge AI capabilities in a privacy-preserving way. Enabling easy API switching would be a major step forward. Please let me know if I can provide any other details!

The text was updated successfully, but these errors were encountered:

dragos240 · 2023-08-06T09:33:34Z

I may make a PR for this. I've gotten it to work on my local instance of text-generation-webui. All that needs to be done to change the URL is to open main.js and replace the OpenAI API base URL with your own. For it to work with text-generation-webui, you'll need to enable the openai extension, which mimics the endpoints of the OpenAI API. One thing I am not entirely sure about is how the embeddings play with it. I'm testing it out now.

dragos240 · 2023-08-06T12:58:34Z

Nevermind, someone beat me to it

nomadphase · 2023-08-07T08:02:24Z

In case this is not prioritised here, it may be useful to look at the Khoj/Obsidian plugin, which is opensource and enabling Llama2

brianpetro · 2023-08-08T20:12:15Z

@nomadphase thanks for sharing that project.

I checked it out and it does require a separate desktop application to be installed to use the Obsidian plugin. This is the route I expect will be necessary to utilize local models with Obsidian.

While there hasn't been much publicly to see lately in terms of plugin updates, I have been doing a lot in private that will have big implications for this plugin. For example, allowing the Smart Chat to add to and edit notes is just one long weekend away.

And during my weekday work, I've been chugging away at something that, when it makes its way into Obsidian, will be unlike anything else I've seen publicly as far as AI tools are concerned. To clarify why I bring this up now, I've been focussing on using GPT-3.5 for that project because I want the result to be compatible with local models. Basically, my hypothesis is that, if I can make it work with GPT-3.5, then the same functionality should work with local models very soon.

It's still been tough to find a local model for the embeddings that beats OpenAI's ada embeddings. If anyone comes across anything, please let me know.

And lastly, thanks everyone (@dragos240 @dicksensei69 ) for your interest in Smart Connections and I'm looking forward to making more updates soon!

Now back to it,
Brian 🌴

ReliablyAwkward · 2023-10-02T16:06:30Z

I yearn to be updated on this topic, as I am now playing with Docker for windows to obtain LocalAi, such descriptions as the owner hinted upon above would be a genuine game changer.

wenlzhang · 2023-10-14T18:55:41Z

Here are some local LLM related tools that might be of interest:

jmorganca/ollama: Get up and running with Llama 2 and other large language models locally
hinterdupfinger/obsidian-ollama
https://github.com/mistralai/
- The recently released Mistral model is faster than LlaMA on my Intel Mac.

huachuman · 2023-10-30T15:19:30Z

What about using g4f?

https://github.com/xtekky/gpt4free
https://github.com/xiangsx/gpt4free-ts

brianpetro · 2023-10-30T18:43:45Z

@wenlzhang @huachuman thanks for the resources!

I'm still reviewing options and requirements, but I think we're pretty close to having a local embedding model.

The chat models still require an amount of hardware resources that make me pause, but we can do a lot with embeddings alone. And if we were to still use OpenAI for the chat responses while relying on a local embedding model, then that would also significantly reduce the exposure of vaults to OpenAI, as only context used for a specific query would be sent to their servers.

🌴

joelmnz · 2023-11-20T19:02:31Z

In addition to local LLM support, would you consider a LLM router such as https://withmartian.com/ that boasts faster speeds and reduced costs?

I haven't tried this service out yet but if it would be considered I would be happy to investigate further

barshag · 2024-01-20T17:55:10Z

any updates on how to connect Ollama ?

brianpetro · 2024-01-20T19:13:43Z

@barshag

V2.1 will enable configuring API endpoints for the chat model. While I can't say how featureful this option will be compared to what's possible with the OpenAI API, especially considering I intend to add significant capabilities via function calling in v2.1 and I'm not up-to-date on where local models are in that regard, the configuration should allow for integration with local models for individuals capable of setting up the model locally to be accessed via localhost.

I hope that helps!

🌴

benabhi · 2024-02-29T02:24:35Z

I desperately need this feature ^^ I tried editing the main.js openai url and changing them to my local llm with lvstudio but it didn't work.

wwjCMP · 2024-03-01T17:52:36Z

LM Studio provides proxy functionality compatible with the OpenAI API.

brianpetro · 2024-03-01T18:31:43Z

@wwjCMP yes, it does.

I've already connected it in my development version of Smart Connections.

Configurable endpoints/models is just one of the chat features that will be rolling out with v2.1. Still got a few things I'm working on, but it should be rolling out pretty soon as an early access option for supporters.

🌴

UBy · 2024-03-02T12:26:12Z

Support for an OpenRouter connection would be huge, as it gives you access to a great amount of models using the same API: https://openrouter.ai/docs#models
Maybe this is a bit off topic, but related as this is just another configuration of a custom endpoint.

brianpetro · 2024-03-02T13:37:52Z

@UBy that looks interesting, thanks for the tip.

Korayem · 2024-03-05T21:11:48Z

Support for an OpenRouter connection would be huge, as it gives you access to a great amount of models using the same API: https://openrouter.ai/docs#models Maybe this is a bit off topic, but related as this is just another configuration of a custom endpoint.

was about to post a Feature Request but did a search first and found your comment @UBy

Glad @brianpetro likes it!

leethobbit · 2024-03-30T14:01:44Z

Thanks for the great plugin - I'd like to add to the requests for local LLM usage - if it's to be allowed that we can modify the base_url for the model, can we ensure it will work beyond just localhost? I think a lot of us are hosting our models on servers or gaming desktops atm, and I definitely can't run anything locally on my laptop.

Very excited for this! Sending data to a 3rd party like OpenAI is a showstopper for me and most people I know that are dabbling in the LLM space currently.

brianpetro · 2024-03-30T14:44:13Z

Hey @leethobbit , happy to hear you like Smart Connections 😊

Custom local chat modes are already partially available in the v2.1 early release. I say partially because none of the people helping me beta test v2.1 seemed to have tried to use it. I could get it working in my tests, but the local models I was testing with, the only ones I could run on an M2 8GB Mac, returned mostly gibberish.

The current implementation allows custom configuration over localhost, but I already decided to implement access to all configurations for the "custom" models. This would allow using any hostname if the endpoint accepted the OpenAI API format. That's probably what you're looking for to access your gaming machine from your laptop. But this hasn't been a priority since no one participating in the early release has indicated any interest, or even use of, the local chat models.

Maybe you can help work out the bugs once v2.1 becomes a general release, which should be relatively soon, as I have some other big updates I'm looking forward to implementing in v2.2.

Thanks for participating in the Smart Connections community!
🌴

brianpetro · 2024-04-01T22:16:37Z

Update: Thanks to the help of an individual who prefers to remain unnamed, I got the Smart Chat working with Ollama. So far, the new settings for the local chat model look like this in the v2.1 early release:

🌴

Allwaysthismoment · 2024-04-01T22:40:26Z

Bravo!! Nice update and loading it up now.

…

________________________________ From: WFH Brian ***@***.***> Sent: Monday, April 1, 2024 3:17 PM To: brianpetro/obsidian-smart-connections ***@***.***> Cc: Allwaysthismoment ***@***.***>; Manual ***@***.***> Subject: Re: [brianpetro/obsidian-smart-connections] Unlock Local AI Processing in Obsidian (feature request) (Issue #302) Update: Thanks to the help of an individual who prefers to remain unnamed, I got the Smart Chat working with Ollama. So far, the new settings for the local chat model look like this in the v2.1 early release: Screenshot.2024-04-01.at.6.10.49.PM.png (view on web)<https://github.com/brianpetro/obsidian-smart-connections/assets/1886014/ec79e555-c042-4c32-9437-75fd0cbdf164> 🌴 — Reply to this email directly, view it on GitHub<#302 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/APE4YK24Z7RQH4V34YS3OITY3HMF7AVCNFSM6AAAAAA3DM4XIGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMZQGY3DQMZTHA>. You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

wwjCMP · 2024-04-22T14:04:37Z

Update: Thanks to the help of an individual who prefers to remain unnamed, I got the Smart Chat working with Ollama. So far, the new settings for the local chat model look like this in the v2.1 early release: 🌴

Whether it supports using the embedding model run through Ollama.

brianpetro · 2024-04-22T14:17:23Z

@wwjCMP embedding through Ollama is not yet supported. If this is something you're interested in, please make a feature request here https://github.com/brianpetro/obsidian-smart-connections/issues

dicksensei69 changed the title ~~Dropdown or space in settings to change the openai url (feature request)~~ Unlock Local AI Processing in Obsidian (feature request) Aug 3, 2023

ArtificialAmateur mentioned this issue Jun 9, 2024

Feature Request: Support local server for embeddings #645

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unlock Local AI Processing in Obsidian (feature request) #302

Unlock Local AI Processing in Obsidian (feature request) #302

dicksensei69 commented Aug 3, 2023 •

edited

Loading

dragos240 commented Aug 6, 2023

dragos240 commented Aug 6, 2023

nomadphase commented Aug 7, 2023

brianpetro commented Aug 8, 2023

ReliablyAwkward commented Oct 2, 2023

wenlzhang commented Oct 14, 2023

huachuman commented Oct 30, 2023

brianpetro commented Oct 30, 2023

joelmnz commented Nov 20, 2023

barshag commented Jan 20, 2024

brianpetro commented Jan 20, 2024 •

edited

Loading

benabhi commented Feb 29, 2024

wwjCMP commented Mar 1, 2024

brianpetro commented Mar 1, 2024

UBy commented Mar 2, 2024 •

edited

Loading

brianpetro commented Mar 2, 2024

Korayem commented Mar 5, 2024

leethobbit commented Mar 30, 2024

brianpetro commented Mar 30, 2024

brianpetro commented Apr 1, 2024

Allwaysthismoment commented Apr 1, 2024 via email

wwjCMP commented Apr 22, 2024

brianpetro commented Apr 22, 2024

Unlock Local AI Processing in Obsidian (feature request) #302

Unlock Local AI Processing in Obsidian (feature request) #302

Comments

dicksensei69 commented Aug 3, 2023 • edited Loading

dragos240 commented Aug 6, 2023

dragos240 commented Aug 6, 2023

nomadphase commented Aug 7, 2023

brianpetro commented Aug 8, 2023

ReliablyAwkward commented Oct 2, 2023

wenlzhang commented Oct 14, 2023

huachuman commented Oct 30, 2023

brianpetro commented Oct 30, 2023

joelmnz commented Nov 20, 2023

barshag commented Jan 20, 2024

brianpetro commented Jan 20, 2024 • edited Loading

benabhi commented Feb 29, 2024

wwjCMP commented Mar 1, 2024

brianpetro commented Mar 1, 2024

UBy commented Mar 2, 2024 • edited Loading

brianpetro commented Mar 2, 2024

Korayem commented Mar 5, 2024

leethobbit commented Mar 30, 2024

brianpetro commented Mar 30, 2024

brianpetro commented Apr 1, 2024

Allwaysthismoment commented Apr 1, 2024 via email

wwjCMP commented Apr 22, 2024

brianpetro commented Apr 22, 2024

dicksensei69 commented Aug 3, 2023 •

edited

Loading

brianpetro commented Jan 20, 2024 •

edited

Loading

UBy commented Mar 2, 2024 •

edited

Loading