Fix new plotting APIs with Triage's result schemas #713

tweddielin · 2019-10-07T22:12:34Z

Using Triage's latest result schema for new plotting APIs instead of creating its own abstractions in post-modeling
Remove old uni-tests for old postmodeling.
We probably want to merge the latest triage into post-postmodeling branch? Otherwise, the tests wouldn't pass.

This commit adds a small change into the catwalk component to calculate feature importances when the model object is a catwalk.estimators.ScaledLogisticRegression. Now, instead of not calculating anything, triage will be able to push feature importances using e to the power of the coefficients.

At long last, the experiment runs table. It contains a variety of metadata about the experiment run, such as installed libraries, git hash, and number of matrices and models built/skipped/errored. Similarly, the experiments table is augmented with data that doesn't change from run-to-run (e.g. number of time splits, as-of-times, total grid size) A variety of methods on the Experiment act as 'entrypoints'. The first entrypoint you hit when running an experiment (e.g generate_matrices, or train_and_test_models) gets tagged on the experiment_runs row. - Add Experiment runs table [Resolves #440] [Resolves #403] and run-invariant columns to Experiments table - Add tracking module to wrap updates to the experiment_runs table - Have experiment call tracking module to save initial information and retrieve a run_id to update with more data later, either itself or through components (e.g. MatrixBuilder, ModelTrainer) that do relevent work - Have experiment save run-invariant information when first computed

Fixed Audition's docs

Introduce experiment_runs table, beef up experiments table

Add feature_importance metric to SLR [solves #509]

* mostly removing non-ascii from the license file. adding explict lineterminator on csv.writer

* Update black from 18.9b0 to 19.3b0 * Update alembic from 1.0.7 to 1.0.8 * Update sqlalchemy from 1.2.18 to 1.3.1 * Update scikit-learn from 0.20.2 to 0.20.3 * Update pandas from 0.24.1 to 0.24.2 * Update boto3 from 1.9.105 to 1.9.125 * Update sqlparse from 0.2.4 to 0.3.0 * Update csvkit from 1.0.3 to 1.0.4 * Update fakeredis from 1.0.2 to 1.0.3 * Update hypothesis from 4.7.17 to 4.14.2 * Update tox from 3.7.0 to 3.8.4 * Fix SQLAlchemy warnings that are now errors

…requirements.txt

…irtyduck-integration

* Dirty duck (the whole enchilada) * Improve mkdocs.yml to fit dirty duck markdown version * Added function to create dirty duck md files to manage.py * Updated link at menu bar * Individual md files for dirty duck. Added markdown modules. Modified requirements.txt * Added some suggested modifications * Material design

…5+ GiB) Underlying library ``s3fs`` automatically writes objects to S3 in "chunks" or "parts" -- *i.e.* via multipart upload -- in line with S3's *minimum* limit for multipart of 5 MiB. This should, in general, avoid S3's *maximum* limit per (part) upload of 5 GiB. **However**, ``s3fs`` assumes that no *single* ``write()`` might exceed the maximum, and as such fails to chunk out such too-large upload requests prompted by singular writes of 5+ GiB. This can and should be resolved in ``s3fs``. But first, it can, should be and is resolved here in ``S3Store``. resolves #530

write 5+ GiB (matrices) to S3Store

* Don't auto-upgrade db for new Experiments [Resolves #695] To avoid the problem of time-consuming database upgrades happening when we don't want them, the Experiment now: 1. Checks to see if the results_schema_versions table exists at all. if it doesn't exist, upgrade. This is because means the results schema should be clean in this case, and new users won't have to always run a new thing when they first try Triage. 2. If it does exist, and the version number doesn't match what the code's current HEAD revision is, throw an error. The error message is customized to whether the database revision is a known revision to the code (easy case, just upgrade if you have time) or not (you probably upgraded on a different branch and need to go check out that branch to downgrade).

* Add more user database management options to CLI [Resolves #697] In recent weeks/months, more operations on the results schema have proven to be things that are useful to 'users' (people who use the 'triage' command), not just 'developers' (people who use the 'manage' command). These include: stamping to a specific revision, downgrading, upgrading to a specific revision, and even just viewing the revision history. Here we allow the `triage db` command to interface with alembic to do these things. Furthermore, the old 'stamp' logic in `triage db` isn't terribly useful now that we have been on alembic for a while, and pinning it to experiment config versions wasn't very useful. Using the standard alembic revisions for stamping I think makes more sense, but I copied the dictionary from before into the help text for 'stamp' because it could still be helpful. - Modify old `triage db stamp` logic to use standard alembic revisions - Enable `triage db upgrade` to take a revision (but default to HEAD) - Add `triage db downgrade` that takes a revision - Add `triage db history` to show revisions

Adds a bias_audit_config section to triage experiment config that supports: - Users can specify the protected groups logic using a pre-computed table (from_obj_table) or a query (from_obj_query) that must contain entity_id, date and the attributes columns to generate the groups for bias audit using aequitas. - Users must specify knowledge_date_column, entity_id_column and a list of attribute_columns, otherwise we would not be able to create the table without knowing which columns it has. - The bias_audit_config is optional. If is set, then there is protected_groups_table generator that is basically a replication of the labels generator. - The protected groups table created is in the named protected_groups_{experiment_hash} and is the result of a left join of the cohort table with the from_obj specified by the user.

Add README.md to example/config/, explaining experiment.yaml, audition.yaml, postmodeling_config.yaml and postmodeling_crosstabs.yaml Remove feature.yaml and change documentation of feature-testing since cli.py just takes an experiment config.

The pull request changes the functionality of the string_is_tablesafe validation primitive to only allow lowercase letters (or numbers, underscores) in strings it checks, as well as adding additional tests for feature aggregation prefixes and subset names, both of which will be used for table names. As described in #632, uppercase letters in these experiment config values end up getting lowercased on table creation by referenced using their uppercase forms (with quotes) at various places in the code, causing postgres to return a "table does not exist" error. This PR also removes a redundant/conflicting dev.txt requirement of different versions of black, keeping the newer version.

Incorporates an Aequitas bias audit into Triage. The bias audit is optional and is controlled with experiment configuration. This is run during evaluation time and on each model. One dirtyduck config (inspections_dt) is updated with a sample bias audit config. To enable this, some requirements are updated so that Triage and Aequitas can coexist together more peacefully.

* Pin ipython to latest version 7.5.0 * Pin ipython to latest version 7.5.0 * Pin jupyter to latest version 1.0.0 * Pin jupyter to latest version 1.0.0 * Pin sphinx to latest version 2.0.1 * Pin sphinx_rtd_theme to latest version 0.4.3 * Pin coverage to latest version 4.5.3 * Pin flake8 to latest version 3.7.7 * Pin mkdocs to latest version 1.0.4 * Pin tox to latest version 3.9.0 * Pin tox-pyenv to latest version 1.1.0 * Pin nose to latest version 1.3.7 * Pin mock to latest version 2.0.0 * Pin colorama to latest version 0.4.1 * Pin httpie to latest version 1.0.2 * Pin psycopg2-binary to latest version 2.8.2 * Update black from 18.9b0 to 19.3b0 * Pin mkdocs-material to latest version 4.2.0 * Update alembic from 1.0.8 to 1.0.10 * Update sqlalchemy from 1.3.1 to 1.3.3 * Update psycopg2-binary from 2.7.7 to 2.8.2 * Update boto3 from 1.9.125 to 1.9.139 * Update s3fs from 0.2.0 to 0.2.1 * Update ohio from 0.1.2 to 0.4.0 * Update moto from 1.3.7 to 1.3.8 * Update hypothesis from 4.14.2 to 4.18.3 * Update tox from 3.8.4 to 3.9.0

thcrock · 2019-10-10T20:56:21Z

Why did you consider those unit tests old? Those are the tests that make sure that you can run postmodeling as much as possible even if you skipped predictions in your Triage Experiment run. I don't think those tests' usefulness has changed.

tweddielin · 2019-10-10T22:27:30Z

@thcrock The reason I removed them temporarily is because the new postmodeling API basically is now totally different. There is no ModelEvaluator or ModelGroupEvaluator anymore. @nanounanue re-wrote it with a different pattern. I'm adding all the functions back following the new API design as well as those unit tests. That's also the reason why I didn't merge that into master branch but post-postmodeling branch that @nanounanue created originally.

codecov-io · 2019-10-10T22:35:43Z

Codecov Report

❗ No coverage uploaded for pull request base (post-postmodeling@3cda2e4). Click here to learn what that means.
The diff coverage is n/a.

@@                 Coverage Diff                  @@
##             post-postmodeling     #713   +/-   ##
====================================================
  Coverage                     ?   84.79%           
====================================================
  Files                        ?       93           
  Lines                        ?     6024           
  Branches                     ?        0           
====================================================
  Hits                         ?     5108           
  Misses                       ?      916           
  Partials                     ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3cda2e4...6c0da70. Read the comment docs.

thcrock · 2019-10-10T22:38:10Z

Ah cool

ivanhigueram and others added 30 commits January 31, 2019 12:12

Use URL object in SerializableDBEngine

aa94fe6

Merge remote-tracking branch 'origin/master' into runs_table

7aee1c3

Many changes from review

b84eb35

Forgot introspection module

bd89738

More review changes

61835c8

Rename callable to avoid reserved word

6bfb4e1

Fixed incorrect definition for best_avg_recency_weight

acc2e66

Update selection_rules.py

2beb28e

Merge pull request #665 from dssg/nanounanue-audition-doc-patch

7707b08

Fixed Audition's docs

Merge pull request #637 from dssg/runs_table

da24b61

Introduce experiment_runs table, beef up experiments table

Merge pull request #587 from dssg/slr_importances

3fb59a3

Add feature_importance metric to SLR [solves #509]

MS Triage (#666)

01f357e

* mostly removing non-ascii from the license file. adding explict lineterminator on csv.writer

Dirty duck (the whole enchilada)

169252e

Improve mkdocs.yml to fit dirty duck markdown version

a4b4700

Added function to create dirty duck md files to manage.py

3f64c3f

Updated link at menu bar

47ff0a8

Individual md files for dirty duck. Added markdown modules. Modified …

422bf82

…requirements.txt

Merge branch 'dirtyduck-integration' of github.com:dssg/triage into d…

8261162

…irtyduck-integration

Added some suggested modifications

5c3569c

Material design

375954e

Merge branch 'master' into dirtyduck-integration

64ec319

Completed description of potential paths for Dirty duck

540f997

Fixing instructions on Welcome

0e07f8d

Fixed broken link

0c46b21

Removed “appointment level”

32ec0dd

Fixed typo

e19d86b

jesteria and others added 20 commits May 7, 2019 10:51

Merge pull request #687 from dssg/jsl/s3store-5gb

4f8992b

write 5+ GiB (matrices) to S3Store

Bump experiment to v7 (#689)

fb1207e

Catch base RequestException to reflect differing Travis environment

cc5f66c

Fixes #691 (#693)

6cb43f9

Config doc (#694)

5342254

Add README.md to example/config/, explaining experiment.yaml, audition.yaml, postmodeling_config.yaml and postmodeling_crosstabs.yaml Remove feature.yaml and change documentation of feature-testing since cli.py just takes an experiment config.

test for capitals, remove duplicate dev requirement

f763558

Added DummyClassifier to the SimpleClassifiers batch (#702)

91150d8

Update README.rst

b5e8407

Update README.rst

18368c1

check for empty protected_df (#709)

91ebcbd

add pydotplus to requirements'

258306b

fix plots with triage's result-schemas

1ce00fc

remove old tests for postmodeling

9299831

tweddielin requested a review from nanounanue October 7, 2019 22:12

tweddielin assigned nanounanue Oct 7, 2019

tweddielin added 2 commits October 9, 2019 17:40

fixed tests

40b4b73

remove unused codes

d157ad6

tweddielin added 2 commits October 10, 2019 16:09

fix conflicts

ba5b6c2

merge with master and fix conflicts

6c0da70

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix new plotting APIs with Triage's result schemas #713

Fix new plotting APIs with Triage's result schemas #713

tweddielin commented Oct 7, 2019

thcrock commented Oct 10, 2019

tweddielin commented Oct 10, 2019 •

edited

Loading

codecov-io commented Oct 10, 2019 •

edited

Loading

thcrock commented Oct 10, 2019

Fix new plotting APIs with Triage's result schemas #713

Are you sure you want to change the base?

Fix new plotting APIs with Triage's result schemas #713

Conversation

tweddielin commented Oct 7, 2019

thcrock commented Oct 10, 2019

tweddielin commented Oct 10, 2019 • edited Loading

codecov-io commented Oct 10, 2019 • edited Loading

Codecov Report

thcrock commented Oct 10, 2019

tweddielin commented Oct 10, 2019 •

edited

Loading

codecov-io commented Oct 10, 2019 •

edited

Loading