Random train sample in s_test #8

expectopatronum · 2020-03-26T09:53:14Z

Hi, I was wondering why you are taking a random training sample in s_test. And according to the comment TODO: do x, t really have to be chosen RANDOMLY from the train set? you are not certain about that. Is there some hint and the paper or why did you implement it like this? Did you have any new insights?
Thanks and best regards
Verena

The text was updated successfully, but these errors were encountered:

andrewsilva9 · 2020-04-06T18:03:24Z

Towards the latter half of Section 3 in the "Stochastic Estimation" subsection, where the authors recap the Agarwal et al. (2016) method, they mention uniform sampling of t training samples. I would bet that more specifics are in the Agarwal paper, but I don't remember at the moment

dedeswim · 2020-05-31T15:33:33Z

Moreover, in the reference implementation, as fas as I understood, they do get a random sample of size 1. So, shouldn't the batch size of z_loader be 1, to behave in the same way as the reference implementation?

andrewsilva9 · 2020-06-01T00:08:53Z

I believe so, and in my adaptation of this repo, that is how it works.

Koh and Liang did extend this work to examine batches/group effects https://arxiv.org/abs/1905.13289 , so you may be able to estimate the HVP or influence with a larger batch size. But I'm not sure exactly what that would look like (though I believe they do have code available)

dedeswim · 2020-06-01T16:23:23Z

I think a good solution, I adopted and I am using, is creating a DataLoader with the same set as the train_loader, but with a RandomSampler of size recursion_depth. So it is something like that:

hessian_loader = DataLoader(
    train_loader.dataset,
    sampler=torch.utils.data.RandomSampler(
        train_loader.dataset, True, num_samples=5000
    ),
    num_workers=4,
)

And then, inside s_test, iterate directly on

for x, y in z_loader:
   hvp computation

Without the outer loop, nor using the break being used at the moment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Random train sample in s_test #8

Random train sample in s_test #8

expectopatronum commented Mar 26, 2020

andrewsilva9 commented Apr 6, 2020

dedeswim commented May 31, 2020

andrewsilva9 commented Jun 1, 2020

dedeswim commented Jun 1, 2020 •

edited

Loading

Random train sample in s_test #8

Random train sample in s_test #8

Comments

expectopatronum commented Mar 26, 2020

andrewsilva9 commented Apr 6, 2020

dedeswim commented May 31, 2020

andrewsilva9 commented Jun 1, 2020

dedeswim commented Jun 1, 2020 • edited Loading

dedeswim commented Jun 1, 2020 •

edited

Loading