Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added new CLOS train test split tutorial notebook #1071

Open
wants to merge 34 commits into
base: master
Choose a base branch
from

Commits on Mar 28, 2024

  1. Configuration menu
    Copy the full SHA
    b5c44b7 View commit details
    Browse the repository at this point in the history
  2. Fixed datasets and added sections on checking for near duplicates/non…

    … iid issues and filtered training data based on exact duplicates between training and test sets
    mturk24 committed Mar 28, 2024
    Configuration menu
    Copy the full SHA
    f954aa8 View commit details
    Browse the repository at this point in the history

Commits on Mar 29, 2024

  1. Can ignore commented out code and also some code I pasted in from a p…

    …revious version following the model eval on clean training + test data. Fixed section on using Datalab on training data to clean the data
    mturk24 committed Mar 29, 2024
    Configuration menu
    Copy the full SHA
    6448856 View commit details
    Browse the repository at this point in the history

Commits on Apr 3, 2024

  1. Added fix to test data inspection/cleaning, changed wording, cleaned …

    …up notebook and added more on hyperparameter optimization section. This section still needs to be improved.
    mturk24 committed Apr 3, 2024
    Configuration menu
    Copy the full SHA
    e82a607 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    58e59f8 View commit details
    Browse the repository at this point in the history

Commits on Apr 4, 2024

  1. Configuration menu
    Copy the full SHA
    ad94a5e View commit details
    Browse the repository at this point in the history
  2. Added hidden tests at end of tutorial, changed intro section wording,…

    … and cleaned up some of the code, put data used into s3 bucket
    mturk24 committed Apr 4, 2024
    Configuration menu
    Copy the full SHA
    797b762 View commit details
    Browse the repository at this point in the history
  3. Modified index files to put clos train test tutorial in general sideb…

    …ar before DCAI workflow tutorial, and renamed it to improving_ml_performance, also removed datalab tabular tutorial since this tutorial is replacing that one
    mturk24 committed Apr 4, 2024
    Configuration menu
    Copy the full SHA
    3164e94 View commit details
    Browse the repository at this point in the history
  4. Fixed markdown wording

    mturk24 committed Apr 4, 2024
    Configuration menu
    Copy the full SHA
    ffe9b33 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    47f9976 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    e22ffd8 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    aaaedf2 View commit details
    Browse the repository at this point in the history

Commits on Apr 8, 2024

  1. top part of tutorial

    jwmueller committed Apr 8, 2024
    Configuration menu
    Copy the full SHA
    068b8cc View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    447002b View commit details
    Browse the repository at this point in the history
  3. wording

    jwmueller committed Apr 8, 2024
    Configuration menu
    Copy the full SHA
    3f4108e View commit details
    Browse the repository at this point in the history

Commits on Apr 9, 2024

  1. Configuration menu
    Copy the full SHA
    36ad7d1 View commit details
    Browse the repository at this point in the history
  2. Fixed headings throughout notebook to be more similar to other tutori…

    …als and adjusted intro section as well
    mturk24 committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    d2419ea View commit details
    Browse the repository at this point in the history

Commits on Apr 16, 2024

  1. Configuration menu
    Copy the full SHA
    a143129 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9ebe4fc View commit details
    Browse the repository at this point in the history
  3. duplicates section

    jwmueller committed Apr 16, 2024
    Configuration menu
    Copy the full SHA
    f2257b2 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    761318f View commit details
    Browse the repository at this point in the history

Commits on Apr 30, 2024

  1. Updated train and test datasets used, fixed bug with not dropping row…

    …s from training data that are exact duplicat with test set, updated seed usage to be proper, and fixed unit tests accordingly
    mturk24 committed Apr 30, 2024
    Configuration menu
    Copy the full SHA
    83d4209 View commit details
    Browse the repository at this point in the history
  2. Updated docs/source/index.rst to be same as master branch with new tu…

    …torial added between datalab and cleanlearning
    mturk24 committed Apr 30, 2024
    Configuration menu
    Copy the full SHA
    2096c5d View commit details
    Browse the repository at this point in the history

Commits on Jun 20, 2024

  1. Configuration menu
    Copy the full SHA
    05ed54c View commit details
    Browse the repository at this point in the history
  2. link to define preprocess

    jwmueller committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    4a32bd9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    cf8408c View commit details
    Browse the repository at this point in the history
  4. test data changes confusion

    jwmueller committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    8b146f7 View commit details
    Browse the repository at this point in the history
  5. conciseness

    jwmueller committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    b5927f0 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    fcdd04c View commit details
    Browse the repository at this point in the history
  7. add elias edit

    Co-authored-by: Elías Snorrason <[email protected]>
    jwmueller and elisno committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    a79485e View commit details
    Browse the repository at this point in the history
  8. reformat wording

    jwmueller committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    01f753b View commit details
    Browse the repository at this point in the history
  9. Update docs/source/tutorials/improving_ml_performance.ipynb

    Co-authored-by: Jonas Mueller <[email protected]>
    mturk24 and jwmueller committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    dff9395 View commit details
    Browse the repository at this point in the history
  10. Update docs/source/tutorials/improving_ml_performance.ipynb

    Co-authored-by: Nelson Auner <[email protected]>
    mturk24 and nelsonauner committed Jun 20, 2024
    Configuration menu
    Copy the full SHA
    10563fe View commit details
    Browse the repository at this point in the history

Commits on Jun 21, 2024

  1. Updated tutorial hidden test thresholds, updated a few code blocks th…

    …at were outdated with newest version of cleanlab package, and some wording in markdown
    mturk24 committed Jun 21, 2024
    Configuration menu
    Copy the full SHA
    69295dd View commit details
    Browse the repository at this point in the history