Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added new CLOS train test split tutorial notebook #1071

Open
wants to merge 34 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 21 commits
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
b5c44b7
Added WIP new CLOS train test split tutorial notebook
mturk24 Mar 28, 2024
f954aa8
Fixed datasets and added sections on checking for near duplicates/non…
mturk24 Mar 28, 2024
6448856
Can ignore commented out code and also some code I pasted in from a p…
mturk24 Mar 29, 2024
e82a607
Added fix to test data inspection/cleaning, changed wording, cleaned …
mturk24 Apr 3, 2024
58e59f8
Fixed some logic in hyperparameter section, still WIP
mturk24 Apr 3, 2024
ad94a5e
Fixed hyperparameter optimization section
mturk24 Apr 4, 2024
797b762
Added hidden tests at end of tutorial, changed intro section wording,…
mturk24 Apr 4, 2024
3164e94
Modified index files to put clos train test tutorial in general sideb…
mturk24 Apr 4, 2024
ffe9b33
Fixed markdown wording
mturk24 Apr 4, 2024
47f9976
Fixed index at docs/source level
mturk24 Apr 4, 2024
e22ffd8
Re-added tabular datalab tutorial
mturk24 Apr 4, 2024
aaaedf2
Fixed tabular datalab tutorial in index
mturk24 Apr 4, 2024
068b8cc
top part of tutorial
jwmueller Apr 8, 2024
447002b
dont need sklearn because its already dependency of cleanlab
jwmueller Apr 8, 2024
3f4108e
wording
jwmueller Apr 8, 2024
36ad7d1
Merge branch 'master' into add-train-test-clos-tutorial
mturk24 Apr 9, 2024
d2419ea
Fixed headings throughout notebook to be more similar to other tutori…
mturk24 Apr 9, 2024
a143129
more clarification on the main steps of the tutorial
jwmueller Apr 16, 2024
9ebe4fc
better motivate the merged data checks
jwmueller Apr 16, 2024
f2257b2
duplicates section
jwmueller Apr 16, 2024
761318f
shorten duplicates notebook cells
jwmueller Apr 16, 2024
83d4209
Updated train and test datasets used, fixed bug with not dropping row…
mturk24 Apr 30, 2024
2096c5d
Updated docs/source/index.rst to be same as master branch with new tu…
mturk24 Apr 30, 2024
05ed54c
address nelson validation data concern
jwmueller Jun 20, 2024
4a32bd9
link to define preprocess
jwmueller Jun 20, 2024
cf8408c
confusion about bias in test data
jwmueller Jun 20, 2024
8b146f7
test data changes confusion
jwmueller Jun 20, 2024
b5927f0
conciseness
jwmueller Jun 20, 2024
fcdd04c
caution this tutorial is complex
jwmueller Jun 20, 2024
a79485e
add elias edit
jwmueller Jun 20, 2024
01f753b
reformat wording
jwmueller Jun 20, 2024
dff9395
Update docs/source/tutorials/improving_ml_performance.ipynb
mturk24 Jun 20, 2024
10563fe
Update docs/source/tutorials/improving_ml_performance.ipynb
mturk24 Jun 20, 2024
69295dd
Updated tutorial hidden test thresholds, updated a few code blocks th…
mturk24 Jun 21, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -154,6 +154,11 @@ Link to Cleanlab Studio docs: `help.cleanlab.ai <https://help.cleanlab.ai/>`_
Datalab Tutorials <tutorials/datalab/index>
jwmueller marked this conversation as resolved.
Show resolved Hide resolved
CleanLearning Tutorials <tutorials/clean_learning/index>
Workflows of Data-Centric AI <tutorials/indepth_overview>
Improving ML Performance <tutorials/improving_ml_performance>
Image Classification <tutorials/image>
Text Classification <tutorials/text>
Tabular Classification <tutorials/tabular>
Audio Classification <tutorials/audio>
Analyze Dataset-level Issues <tutorials/dataset_health>
Outlier Detection <tutorials/outliers>
Improving Consensus Labels for Multiannotator Data <tutorials/multiannotator>
Expand Down
Loading
Loading