normalize data for dcm #209

Eh2406 · 2018-03-30T15:03:21Z

This is a start on #208 still to do:

make it optional
add test for both behaviors
add l1 l2 regularization
test regularization

But I thout I'd open this to get some feedback.

Eh2406 · 2018-04-05T16:23:12Z

I don't know why CI is failing. Help?

Eh2406 · 2018-04-09T15:31:15Z

AppVeyor is Failed building wheel for pandana I don't think I can fix that.
I am working on the travis things.

coveralls · 2018-04-09T15:57:38Z

Coverage decreased (-0.4%) to 93.959% when pulling d10e194 on SEMCOG:normalize-data into 2497039 on UDST:master.

coveralls · 2018-04-09T15:57:38Z

Coverage increased (+0.06%) to 94.409% when pulling d832f16 on SEMCOG:normalize-data into 79f815a on UDST:master.

janowicz · 2018-04-18T17:13:49Z

Thanks for this @Eh2406! The passing Travis builds are the relevant benchmark here. The appveyor Windows build are failing due to pandana on Windows, which is unrelated to the present PR. We'll fix that, or we may remove pandana as a hard CI dependency of core urbansim (as little in this repo specifically needs pandana).

Eh2406 · 2018-04-20T19:24:19Z

I don't have a lot of experience with pytest. What test would you like to see, and how do you recommend I implement them?

Eh2406 · 2018-05-01T14:06:10Z

Ping. To be clear I am stuck waiting for advice on what tests you would like to see.

janowicz · 2018-05-02T03:54:57Z

Some thoughts (curious what others think too):
test_dcm.py

Create a "normalized_dcm" fixture that takes the existing basic_dcm fixture as a starting point and set values for normalized/l1/l2
Using the test_mnl_dcm function as a template, implement a high-level smoke-test just to see that the model with normalization can be fitted, that probabilities can be calculated from the resulting model, and that 'Normalization Mean' / 'Normalization Std' are in the resulting fit parameters.

test_mnl.py

Can any of the normalization/regularization changes to mnl.py be tested here? Perhaps a test for expected coefficients that are different when non-zero l1/l2 values supplied? And a smoke-test to ensure that mnl.mnl_simulate() runs with normalization_mean / normalization_std?

add a simple_dcm for the test that cannot be parameterized

Eh2406 · 2018-05-03T15:35:42Z

I parameterized basic_dcm to run with normalization and 3 different regularizations. Then looked through all the newly failing tests. This sussed out 2 places where I did not pass the new arguments in the correct order. Oops. Meny of the test can be made to pass with an if on basic_dcm.normalize. For the 3 that could not be done so easily, I added a simple_dcm that matched the old basic_dcm and switched them to use that.

Thoughts?

Eh2406 · 2018-05-24T16:17:12Z

merged and ci is green. Thoughts? What else would you like to see before accepting?

Eh2406 added 2 commits March 30, 2018 10:59

normalize data for dcm

b36288b

make normalization optional

bbe9501

Eh2406 force-pushed the normalize-data branch from b9a826e to bbe9501 Compare April 4, 2018 21:34

Eh2406 added 2 commits April 4, 2018 17:56

fex tests

a5b3838

add regularization

cd9a255

fix tests

afaa635

Eh2406 force-pushed the normalize-data branch from 4e14686 to afaa635 Compare April 9, 2018 15:16

pep8

7786ca7

Eh2406 added 2 commits April 9, 2018 11:36

fix args

5a3d390

more pep8

d10e194

Eh2406 added 2 commits May 2, 2018 16:04

fix args 2

1de3c80

make most test parameterized on normalization

22b1d27

add a simple_dcm for the test that cannot be parameterized

more direct tests of mnl_estimate

eb4ed01

Eh2406 force-pushed the normalize-data branch from a9786a8 to eb4ed01 Compare May 4, 2018 21:54

Merge remote-tracking branch 'origin/master' into normalize-data

d832f16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

normalize data for dcm #209

normalize data for dcm #209

Eh2406 commented Mar 30, 2018 •

edited

Loading

Eh2406 commented Apr 5, 2018 •

edited

Loading

Eh2406 commented Apr 9, 2018

coveralls commented Apr 9, 2018

coveralls commented Apr 9, 2018 •

edited

Loading

janowicz commented Apr 18, 2018 •

edited

Loading

Eh2406 commented Apr 20, 2018

Eh2406 commented May 1, 2018

janowicz commented May 2, 2018 •

edited

Loading

Eh2406 commented May 3, 2018

Eh2406 commented May 24, 2018

normalize data for dcm #209

Are you sure you want to change the base?

normalize data for dcm #209

Conversation

Eh2406 commented Mar 30, 2018 • edited Loading

Eh2406 commented Apr 5, 2018 • edited Loading

Eh2406 commented Apr 9, 2018

coveralls commented Apr 9, 2018

coveralls commented Apr 9, 2018 • edited Loading

janowicz commented Apr 18, 2018 • edited Loading

Eh2406 commented Apr 20, 2018

Eh2406 commented May 1, 2018

janowicz commented May 2, 2018 • edited Loading

Eh2406 commented May 3, 2018

Eh2406 commented May 24, 2018

Eh2406 commented Mar 30, 2018 •

edited

Loading

Eh2406 commented Apr 5, 2018 •

edited

Loading

coveralls commented Apr 9, 2018 •

edited

Loading

janowicz commented Apr 18, 2018 •

edited

Loading

janowicz commented May 2, 2018 •

edited

Loading