WIP

norton120 · 2024-06-16T22:20:32Z

this may be a long-running branch since cutting the tests over to use httpx app + FastAPI dependency injection is gonna be a bit of work.

Preamble

The Database(s) that support the application state, agent memory (including vector lookup) and the application itself (user/org management, permissions, settings config etc) interface with the rest of the codebase via a MetadataStore object.

Goals here

The metadatastore stays as a gateway for now, but all the configuration gets conventionalized to each adapter type. Overrides need to happen in the config stack (so 1. envars 2. config file 3. default (lives in the adapter)). Don't start moving to doing ORM'y stuff here yet, keep this focused on config squashing.
Way more test hooks. We want to start seeing unit tests in this PR, the best way to do that is to add override hooks to the existing classes where they are useful and break these down more.

norton120 · 2024-06-16T22:21:07Z

@yoaquim this is the working PR we were talking about

norton120 · 2024-06-17T12:13:26Z

K - thinking through where the complication that prevents us using the orm directly, it's really only the archive. So if we add accessors on the related objects, the adapter can probably obfuscate that complication.
Something like

current_agent = authed_user.agents.get(agent_id)
# here's the magic
# archive_memory is not necessarily a sqlalchemy model
return current_agent.archive_memory.search(search_value)

In this case the adapter interface duck types as an orm - so with the pgvector adapter archive_memory is just a model, in SQLite it is a chroma wrapper.

norton120 · 2024-06-18T23:41:03Z

@cpacker @sarahwooders do you know if the init.sql file at the top level of the repo is for deployment? creating the initial user/password/db for the docker image would just be setting those envars

I'd like to create the test db in the docker db init, ideally, I'd like to not add a second init file and switch them around, so that's why I'm trying to track down what it is used for at the moment

norton120 · 2024-06-19T14:45:04Z

@cpacker @sarahwooders do you know if the init.sql file at the top level of the repo is for deployment? creating the initial user/password/db for the docker image would just be setting those envars

I'd like to create the test db in the docker db init, ideally, I'd like to not add a second init file and switch them around, so that's why I'm trying to track down what it is used for at the moment

For the moment I dumped into that init, overriding it without disturbing it is a bit of work. Can revisit before we start merging.

is working. Next up: - isolate the test_server failing tests - move the settings mock into a conftest fixture - add a test hook for SyncServer so you can do the same thing there. - propigate.

for default persona, human, and preset. Now all derived from settings (which is in turn derived from envars). Still need to square away with the config file hierarchy, so once we resolve the value there is only one definitive source of truth across the rest of the code.

hit by a bus the next person doesn't need to spend a week getting up to speed. This helps clarify the goal in this PR: one config hierarchy assembled once, with one mega hook.

TODO: - mount the test sqlite/chroma somewhere that doesn't clutter up the repo

…ep things clean

…le stripping out extraneous elements. The memory thing needs to be abstracted in a later time, never clear if these are strings or templates or references to a related object

…to clarify https://www.notion.so/Data-Model-Questions-43ef1336483f49c1bf77daddf3f320fa

norton120 · 2024-06-26T17:40:54Z

OK. So the shortest path I can see from here is:

add alembic migrations
move to migration and connection instead of create_all (because that won't work anymore)
overload the metadatastore methods to get parity - this should expose the chroma conflict naturally
solve for chroma/pgvector as an overloaded model in the ORM
get all tests passing, merge in all upstream changes
delete all the dead code. there will be a lot. there already is.

…le entrypoint to be good to go

…to be helpers like palm to do migrations and such

…scheme. the settings.backend object is self-contained, so no more external double-setting

… stub everything over to ORM models.

1. the metadata.py file is being updated to use the ORM 2. conflicting models are being sunset and/or quarantined for this PR 3. CRUD accessors stay in metadatastore but are now managed behind the scenes by the ORM This is going to break a lot of things (which is goodTo get unbroken: 1. update the tests to no longer be aware of the backend configs 2. update the code to same 3. remove all the SQLModel and deprecated backend code 4. document (loom) how the ORM works, how to create migrations, how to traverse the ORM tree etc etc. Strategy here should be to merge this into a long-running branch and start CI against it, then keep pulling main into it until we're ready for a major release (this will be a major). Configs will be extremely thin after this PR. We should be set up to move docker dev to a single stack and docker quickstart to a single image.

sarahwooders self-requested a review June 17, 2024 04:14

norton120 added 21 commits June 19, 2024 16:59

logs red to green

cd9723b

logs reflect debug status

a9b3964

import submodule

22b31bb

using memgpt logger not global logger

47bcf1c

found the bug duplication

13dfe0a

black

15757f9

isort

004fb31

placeholder while thinking

aa396fa

removing dead code to make it easier to refactor

003cc1d

starting in on abstracting the metadatastore adapters

450fa34

most of the initial config override in test_server

b8e0a71

is working. Next up: - isolate the test_server failing tests - move the settings mock into a conftest fixture - add a test hook for SyncServer so you can do the same thing there. - propigate.

abstracted fixture

d0fb886

moving more to fixtures

a63e250

defaults

f1e0c88

conflicting persist

5fc4c62

Started a working README for refactor, so if I get

c363018

hit by a bus the next person doesn't need to spend a week getting up to speed. This helps clarify the goal in this PR: one config hierarchy assembled once, with one mega hook.

ORM abstraction testing pattern set up

f615ae9

almost have the conftest pattern set up

42e3f1d

This is the basic 2 backend pattern.

f34d7ec

TODO: - mount the test sqlite/chroma somewhere that doesn't clutter up the repo

conftest respects relationships

eb50aff

norton120 force-pushed the feature/1437/condense-configs branch from 393f982 to eb50aff Compare June 19, 2024 21:19

norton120 added 3 commits June 19, 2024 17:38

sqlite now stores all the test databases in the .persist folder to ke…

4d1444b

…ep things clean

updating readme

2606f68

more readme

b567c4d

norton120 and others added 5 commits June 21, 2024 15:03

more models, bringing up lots of questions about the data model

8f1fe69

I'm trying to keep this as close to the current model as possible whi…

f1dbf2a

…le stripping out extraneous elements. The memory thing needs to be abstracted in a later time, never clear if these are strings or templates or references to a related object

basic ORM pattern for most objects

8ace1ca

presets model started, lots of questions. pushing to this notion doc …

f367d48

…to clarify https://www.notion.so/Data-Model-Questions-43ef1336483f49c1bf77daddf3f320fa

pretty sure this is the current model

192c80f

Ethan Knox and others added 8 commits June 26, 2024 15:10

alembic-managed migrations

9af8854

migrations now included on startup. we need to add it to every possib…

b651f68

…le entrypoint to be good to go

added jobs model

5b1c878

configs pattern for now. these should be 1st class. also there needs …

b4b68dd

…to be helpers like palm to do migrations and such

finally time to start cutting

fa637b0

pg_uri is now the _only_ db setting. it will always have the correct …

138db65

…scheme. the settings.backend object is self-contained, so no more external double-setting

chewing through all the redundant crud methods in metadata to ideally…

8cceafd

… stub everything over to ORM models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

norton120 commented Jun 16, 2024

norton120 commented Jun 16, 2024

norton120 commented Jun 17, 2024

norton120 commented Jun 18, 2024

norton120 commented Jun 19, 2024

norton120 commented Jun 26, 2024

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

Are you sure you want to change the base?

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

Conversation

norton120 commented Jun 16, 2024

WIP

Preamble

Goals here

norton120 commented Jun 16, 2024

norton120 commented Jun 17, 2024

norton120 commented Jun 18, 2024

norton120 commented Jun 19, 2024

norton120 commented Jun 26, 2024