Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

helixml / helix Public

Notifications You must be signed in to change notification settings
Fork 18
Star 277

Code
Issues 99
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: helixml/helix

Releases · helixml/helix

0.9.19 - fix ollama cleanup bug

25 Jun 12:50

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.19 - fix ollama cleanup bug Latest

Latest

What's Changed

Fix bug which was stopping ollama servers getting cleaned up on runners. This was stopping GPU memory getting allocated, slowing down model responses. Effect of deploying this change should be that llama3:70b is reliably fast on the platform, for example.

I don't love this approach, but it seems to work in interactive testing. by @lukemarsden in #343

Full Changelog: 0.9.18...0.9.19

Contributors

lukemarsden

Assets 2

Loading

All reactions

0.9.18 - improve llama3:70b performance

24 Jun 14:14

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.18 - improve llama3:70b performance

What's Changed

Make llama3:70b performance better by improving reliability of shutdown of other models, allowing it to use the full GPU memory

more reliable approach to shutdown process tree by @lukemarsden in #342

Full Changelog: 0.9.17...0.9.18

Contributors

lukemarsden

Assets 2

Loading

All reactions

0.9.17 - optimize startup

24 Jun 13:10

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.17 - optimize startup

What's Changed

optimize startup to avoid excess copies when using a bind-mounted cache directory and pre-baked model weights

optimize copy by @lukemarsden in #341

Full Changelog: 0.9.16...0.9.17

Contributors

lukemarsden

Assets 2

Loading

All reactions

0.9.16 - ollama cleanups, discord bot

24 Jun 12:39

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.16 - ollama cleanups, discord bot

What's Changed

https://www.youtube.com/watch?v=Fow7iUaKrq4

Feature/discord bot v0.1 by @rusenask in #339
kill ollama process group, not just parent process by @lukemarsden in #340

Full Changelog: 0.9.15...0.9.16

Contributors

lukemarsden and rusenask

Assets 2

Loading

All reactions

0.9.15 - Fix warmups, dev websockets, ollama keepalive

14 Jun 15:15

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.15 - Fix warmups, dev websockets, ollama keepalive

What's Changed

Stop loading sdxl by default, neatly sidestepping the headache that was warmup models over-filling GPU memory. SDXL will still work, it just might take a bit longer on the first request to download the weights.
Also fix bug that was stopping llama3:instruct getting loaded as a warmup model.
Fix frontend websocket bug in development
Make ollama keep model weights in memory forever, which is what our scheduler is designed for.
Fix warmups, dev websockets, ollama keepalive by @lukemarsden in #329

Full Changelog: 0.9.14...0.9.15

Contributors

lukemarsden

Assets 2

Loading

All reactions

0.9.14 - Azure OpenAI compatibility, stability improvements

14 Jun 13:43

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.14 - Azure OpenAI compatibility, stability improvements

What's Changed

Fixed nasty bug where inference requests would "fall down a crack" and need to be retried by the user. This would also cause OpenAI API to hang.

Added Azure OpenAI compatibility so users just need to set a couple of env vars to use Helix instead of OpenAI

Azure openai compat by @lukemarsden in #328

Full Changelog: 0.9.13...0.9.14

Contributors

lukemarsden

Assets 2

Loading

All reactions

0.9.13 - fix regression for auto-created API keys in some cases

14 Jun 10:53

lukemarsden

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

0.9.13 - fix regression for auto-created API keys in some cases

What's Changed

reliably create api keys for users by @lukemarsden in #327

Full Changelog: 0.9.12...0.9.13

Contributors

lukemarsden

Assets 2

Loading

All reactions

0.9.12 - app params for API tools, RAG in K8s, fix finetuning in openshift

13 Jun 21:39

lukemarsden

Compare

Choose a tag to compare

Loading

0.9.12 - app params for API tools, RAG in K8s, fix finetuning in openshift

What's Changed

feat: add ability to override app query parameters in OpenAI API request by @philwinder in #318
Local dev guide by @chocobar in #321
Feature/llamaindex decoupling by @rusenask in #322
Fix/rag models by @rusenask in #325
move update & install into same layer by @rusenask in #326

Full Changelog: 0.9.11...0.9.12

Contributors

philwinder, rusenask, and chocobar

Assets 2

Loading

All reactions

0.9.11 - fix finetuning in locked down env

03 Jun 19:06

lukemarsden

Compare

Choose a tag to compare

Loading

0.9.11 - fix finetuning in locked down env

Fix finetuning in locked down openshift

Full Changelog: 0.9.10...0.9.11

Assets 2

Loading

All reactions

0.9.10 - ollama and cog fixes for openshift

03 Jun 17:55

lukemarsden

Compare

Choose a tag to compare

Loading

0.9.10 - ollama and cog fixes for openshift

More fixes for OpenShift - make ollama cache and cog-sdxl directories writeable

Full Changelog: 0.9.9...0.9.10

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 6 7 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.