Strided dataset feature branch #162

Diego-Llanes · 2024-06-11T19:34:23Z

This pull request contains the StridedDataset feature addition. The only file that have additions / changes is src/neuromancer/dataset.py.
All of the class parameters are well documented and have docstrings.

EDIT: There is now also an example documenting the usecase of this new feature.

Contributors:

> > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

…ure_branch' into StridedDataset_feature_branch

> > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

Merged develop into local fork > > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

madelynshapiro · 2024-06-12T16:57:46Z

@Diego-Llanes would you please upload a small example (can be a modified snippet from one of your full notebooks) demonstrating the usage to accompany the PR?

Diego-Llanes · 2024-06-13T18:03:29Z

@Diego-Llanes would you please upload a small example (can be a modified snippet from one of your full notebooks) demonstrating the usage to accompany the PR?

Just added!

madelynshapiro

Excellent notebook!

madelynshapiro · 2024-06-12T16:54:02Z

src/neuromancer/dataset.py

+        batch['name'] = self.name
+        return batch
+
+    def remainder(self, n, d):


Suggested change

def remainder(self, n, d):

@staticmethod

def remainder(n, d):

drgona

The notebook should be modified:

the first image in the notebook is inconsistent with parameters in the text - create a new image illustrating parameters N, L, and stride
include training example of simple system ID model demonstrating the use of the new dataset
include more visualizations of how the data is processed into subsequences - it is not clear what I am looking at in the last plot, add more explanation, always label axes in the plots

RBirmiwal · 2024-06-26T21:50:27Z

strided_dataset_rahul_test.ipynb.zip

I agree with Jan.
In addition have attempted to play around with the strided dataset to ensure it works. I have created a notebook zipped up to this comment. I am still confused at times:

In DictDataset, let's say D:= a dictDataset, i can do D['X'] --> 3D tensor
In StridedDataset S:= StridedDataset, I cannot do S['X'], i have to do S[idx]['X] --> 2D tensors

While the user can figure out how to do proper reshaping and data creation, this is a difference in our API

what if instead we had S['X'] --> list of subsequence tensors for X
Also the tensor X is 2D not 3D......

I attempted to do Neural ODE example with StridedDataset. For L >= nsteps looks like it trains, but the performance is not as good as standard DictDataset.

For L < nsteps, it breaks.

I like this functionality, but we need to ensure it works and performs on-par for all the neuromancer use-cases not just Farama DPC. If there are situations where the StridedDataset will fail (as indicated when L > nsteps), then those need to be handled appropriately.

RBirmiwal · 2024-06-26T21:53:36Z

Also would like an example of a non-trivial update_fn, right now it is

def update_initial_condition(d):
    d['xn_2'] = d["xn"][0:1, :]
    return d

for the case of the neural ode where our keys are "X" and "xn". I don't understand how "xn_2" would play a role/being used. So what are cases where this would be necessary?

Diego-Llanes added 7 commits June 5, 2024 14:21

initial commit

0c30a35

Refactor usability tests.

2c18e26

> > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

Added StridedDataset

6e18682

> > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

Merge remote-tracking branch 'refs/remotes/origin/StridedDataset_feat…

63b8095

…ure_branch' into StridedDataset_feature_branch

Added StridedDataset and documentation

7c737fb

> > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

Minor edit to documentation

1b5eba5

> > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

Merge branch 'develop' into StridedDataset_feature_branch

0112d3f

Merged develop into local fork > > Co-authored-by: Seth Lindberg Briney [email protected] Co-authored-by: Harry Qiang [email protected]

Diego-Llanes requested review from drgona, madelynshapiro and RBirmiwal June 11, 2024 20:40

Diego-Llanes assigned madelynshapiro and RBirmiwal Jun 11, 2024

Diego-Llanes marked this pull request as ready for review June 11, 2024 21:37

Diego-Llanes added 3 commits June 13, 2024 10:08

Started on the StridedDataset notebook

2783ad0

Strided Dataset Example finished

12bc062

Minor edit to examples/strided_dataset.ipynb

86154b3

Diego-Llanes added the enhancement New feature or request label Jun 13, 2024

madelynshapiro approved these changes Jun 14, 2024

View reviewed changes

drgona requested changes Jun 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strided dataset feature branch #162

Strided dataset feature branch #162

Diego-Llanes commented Jun 11, 2024 •

edited

Loading

madelynshapiro commented Jun 12, 2024

Diego-Llanes commented Jun 13, 2024 •

edited

Loading

madelynshapiro left a comment

madelynshapiro Jun 12, 2024

drgona left a comment

RBirmiwal commented Jun 26, 2024

RBirmiwal commented Jun 26, 2024

Strided dataset feature branch #162

Are you sure you want to change the base?

Strided dataset feature branch #162

Conversation

Diego-Llanes commented Jun 11, 2024 • edited Loading

madelynshapiro commented Jun 12, 2024

Diego-Llanes commented Jun 13, 2024 • edited Loading

madelynshapiro left a comment

Choose a reason for hiding this comment

madelynshapiro Jun 12, 2024

Choose a reason for hiding this comment

drgona left a comment

Choose a reason for hiding this comment

RBirmiwal commented Jun 26, 2024

RBirmiwal commented Jun 26, 2024

Diego-Llanes commented Jun 11, 2024 •

edited

Loading

Diego-Llanes commented Jun 13, 2024 •

edited

Loading