Gpu batch #35

Jeanselme · 2021-01-27T18:28:19Z

Allow to put on GPU only batch to limit memory use

Jeanselme · 2021-01-27T18:29:04Z

This could benefit of two levels of cuda: one fully on gpu (all train and test) and one batch ?

chiragnagpal · 2021-02-09T06:02:41Z

@Jeanselme I think this still requires the model to be moved to CPU before predict_survival can be used. Can you also change https://github.com/autonlab/DeepSurvivalMachines/blob/e88c88556bc603ac58ff83abdbe606e1c29c839b/dsm/losses.py#L307 and https://github.com/autonlab/DeepSurvivalMachines/blob/e88c88556bc603ac58ff83abdbe606e1c29c839b/dsm/losses.py#L341 do move to the same device as 'x'. So that one can run the predict_risk or predict_survival function without any hiccups ?

chiragnagpal · 2021-02-09T06:05:24Z

dsm/dsm_api.py

-                    risk=str(r+1)).detach().numpy())
+        loss += float(losses.conditional_loss(self.torch_model,
+                      x_val, t_val, e_val, elbo=False,
+                      risk=str(r+1)).detach().cpu().numpy())


I dont think this needs to be detached and put to CPU. torch doesnt track this variable.

One has to be careful of the test set's size as it will not easily fit on gpu (either batch it or put model on cpu)

More elegant way for obtaining value in unit length tensor

codecov-io · 2021-02-09T11:49:52Z

Codecov Report

Merging #35 (1c9850a) into master (e88c885) will decrease coverage by 0.42%.
The diff coverage is 41.37%.

@@            Coverage Diff             @@
##           master      #35      +/-   ##
==========================================
- Coverage   53.18%   52.76%   -0.43%     
==========================================
  Files           7        7              
  Lines         831      851      +20     
==========================================
+ Hits          442      449       +7     
- Misses        389      402      +13

Impacted Files	Coverage Δ
dsm/losses.py	`33.95% <33.33%> (ø)`
dsm/dsm_api.py	`53.12% <35.00%> (-2.12%)`	⬇️
dsm/utilities.py	`82.45% <66.66%> (-1.33%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e88c885...1c9850a. Read the comment docs.

chiragnagpal · 2021-02-09T17:44:25Z

dsm/utilities.py

@@ -179,8 +186,7 @@ def train_dsm(model,
                                     elbo=False,
                                     risk=str(r+1))

-    valid_loss = valid_loss.detach().cpu().numpy()
-    costs.append(float(valid_loss))
+    costs.append(valid_loss.item())


lets use float()

.item() automatically puts on cpu if necessary and cast it

chiragnagpal

I am not sure why you use .item() ?

chiragnagpal · 2021-02-09T17:44:38Z

dsm/utilities.py

@@ -74,9 +74,9 @@ def pretrain_dsm(model, t_train, e_train, t_valid, e_valid,
    valid_loss = 0
    for r in range(model.risks): 
      valid_loss += unconditional_loss(premodel, t_valid, e_valid, str(r+1))
-    valid_loss = valid_loss.detach().cpu().numpy()
+    valid_loss = valid_loss.item()


lets use float

Jeanselme added 4 commits January 27, 2021 17:10

Push batch on cuda

e2d398a

Correction on cuda arg

c87ece7

RNN and CNN with cuda

089e0c5

Update loss for cuda

a9cdbd6

chiragnagpal approved these changes Feb 4, 2021

View reviewed changes

Merge branch 'master' into GPU_Batch

bd45a8f

chiragnagpal reviewed Feb 9, 2021

View reviewed changes

Jeanselme added 3 commits February 9, 2021 07:57

Allow loss test on gpu

37ab640

One has to be careful of the test set's size as it will not easily fit on gpu (either batch it or put model on cpu)

Loss .item()

e9dbd3e

More elegant way for obtaining value in unit length tensor

Avoid repeated code

1c9850a

chiragnagpal reviewed Feb 9, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gpu batch #35

Gpu batch #35

Jeanselme commented Jan 27, 2021

Jeanselme commented Jan 27, 2021

chiragnagpal commented Feb 9, 2021

chiragnagpal Feb 9, 2021

codecov-io commented Feb 9, 2021 •

edited

Loading

chiragnagpal Feb 9, 2021

Jeanselme Feb 9, 2021

chiragnagpal left a comment

chiragnagpal Feb 9, 2021

Gpu batch #35

Are you sure you want to change the base?

Gpu batch #35

Conversation

Jeanselme commented Jan 27, 2021

Jeanselme commented Jan 27, 2021

chiragnagpal commented Feb 9, 2021

chiragnagpal Feb 9, 2021

Choose a reason for hiding this comment

codecov-io commented Feb 9, 2021 • edited Loading

Codecov Report

chiragnagpal Feb 9, 2021

Choose a reason for hiding this comment

Jeanselme Feb 9, 2021

Choose a reason for hiding this comment

chiragnagpal left a comment

Choose a reason for hiding this comment

chiragnagpal Feb 9, 2021

Choose a reason for hiding this comment

codecov-io commented Feb 9, 2021 •

edited

Loading