Add simclr eval by sanaAyrml · Pull Request #13 · VectorInstitute/GenerativeSSL

sanaAyrml · 2024-02-01T18:05:18Z

PR Type

[Feature ]

Short Description

I added evaluation module in this new PR which adds a classification head to pre-trained backbone. Then it freezes backbone and only train classification head and report accuracy over train and test data.

This change is

afkanpour · 2024-02-12T17:27:00Z

It looks like this PR includes changes unrelated to linear eval (perhaps from multi-gpu support?). Can you please fix that?

sanaAyrml · 2024-02-12T17:49:52Z

It looks like this PR includes changes unrelated to linear eval (perhaps from multi-gpu support?). Can you please fix that?

I have put this pr before merging ICGAN, maybe that is the reason to have extra changes. I will update it today and put it up.

afkanpour · 2024-02-13T20:23:50Z

+            size (int): Image size.
+        """
+        transform_list = [
+            transforms.Resize(size=(size,size)),


For inference I believe SimCLR performs a random center crop before resizing. Can you please have a look at either their paper or the code to verify?

In the code they didn't have any transformation rather than .ToTensor, but I thought this would fail on imagenet as we have various images sizes. However in the paper they mentioned "As preprocessing, all images were resized to 224 pixels along the shorter side using bicubic resampling, after which we took a 224 × 224 center crop." It is bit vague for me, should we first resize on smaller dimension and then centre crop? or should we first crop on smaller dimension and then resize?

afkanpour · 2024-02-16T21:59:40Z

                data_utils.CenterCropLongEdge(),
                transforms.Resize((size, size)),
                transforms.ToTensor(),
-                transforms.Normalize(self.config.norm_mean, self.config.norm_std),


Why did you remove this? Is it causing an error?

afkanpour · 2024-02-26T15:09:54Z

I think this implementation should be available. Do you need additional capability beyond this? https://github.com/VectorInstitute/GenerativeSSL/blob/main/icgan/data_utils/utils.py#L29

afkanpour · 2024-02-26T15:26:40Z

@@ -0,0 +1,51 @@
+#!/bin/bash
+
+#SBATCH --job-name=train_sunrgbd


afkanpour · 2024-02-26T15:29:29Z

I have added changes to run_simCLR.py in my PR that takes checkpoint_dir and creates a directory based on the time of running the training job. So let's revert these changes.

afkanpour · 2024-02-26T17:20:24Z

+from torch import nn
+from torchvision import models
+
+from ..exceptions.exceptions import InvalidBackboneError


Please use absolute paths.

afkanpour · 2024-02-26T18:46:06Z

        self.device_id = kwargs["device_id"]
-        self.writer = SummaryWriter()
+        # Create a directory to save the model checkpoints and logs
+        now = datetime.now()


I have added logic to run_simCLR.py that does something like this. So you have to either remove these parts, or remove my changes before merging.

afkanpour · 2024-02-26T18:47:15Z

 #SBATCH --nodes=1
 #SBATCH --gres=gpu:4
-#SBATCH --ntasks-per-node=4
+#SBATCH --ntasks-per-node=1


Please keep in mind that this uses a single GPU per node.

afkanpour · 2024-02-26T18:47:57Z

-# “srun” executes the script <ntasks-per-node * nodes> times
-srun python run_simCLR.py \
+# srun execute ntasks-per-node * nodes times
+srun pythong run_simCLR.py \


afkanpour · 2024-02-26T18:51:23Z

If the github repo from which we borrowed simclr implementation provides pretrained model checkpoints, can you please download one of them and test this script on it to see if the results match with what's reported by them?

afkanpour

Do we still need this? If not, should we delete it?

Reviewable status: 0 of 19 files reviewed, 30 unresolved discussions (waiting on @sanaAyrml and @vahid0001)

sanaAyrml added 25 commits January 24, 2024 10:54

Distributed rcdm

acb48e2

Add rcdm model

cdec0c9

Add checkpointing

23cdf80

update

ae42737

update

73c580b

update config

25936dc

edit logging

2d9d197

server

422a10d

add checkpointing

60d3dc3

Add eval files

9ae6ee8

add slrm file

642ad8a

update eval file

ea7cc42

update eval

a9cbba1

check eval

90ea717

check

1cba4c8

check state_dicts

6a34a85

check

86dcdf7

edit eval classes

4e683a4

check

ecfab47

update slrm

20795af

Update eval

38a86e6

edit

d85de17

edit slrm

e359965

correct sample slrm

c78abd0

fix multi gpu

a1fa4ea

sanaAyrml requested review from afkanpour and vahid0001 February 7, 2024 18:50

sanaAyrml marked this pull request as draft February 12, 2024 17:50

debug rcdm error

b36c694

afkanpour reviewed Feb 13, 2024

View reviewed changes

sanaAyrml added 7 commits February 13, 2024 12:27

edit

cf9713d

edit

f8e8eb1

delete normalize

5755923

edit

25d250f

delete print

5754ae1

Merge branch 'main' into add_simclr_eval

509530f

update evaluation

0642905

sanaAyrml marked this pull request as ready for review February 20, 2024 18:53

sanaAyrml added 17 commits February 20, 2024 10:54

update formating

245eb54

update logging

33c0c93

Update bash files

31409dd

Update augmentation and saving file

eb9a4b6

update evaluation

288f749

Update bash file

3b43d4e

edit eval

e64fe5c

check loading

ef6a214

debug eval

cef4572

update

883d9c0

check evaluation

c900b9f

Clean the code

314633c

update

ed78e17

try catch the file exist error

a392667

update

6c5cf20

update logging part

313b705

update slrm scripts

67b5a9f

afkanpour reviewed Feb 26, 2024

View reviewed changes

update resnet pretrained

1e2ba81

afkanpour reviewed Apr 12, 2024

View reviewed changes

Conversation

sanaAyrml commented Feb 1, 2024 • edited by afkanpour Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Type

Short Description

Uh oh!

afkanpour commented Feb 12, 2024

Uh oh!

sanaAyrml commented Feb 12, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sanaAyrml Feb 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

afkanpour left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sanaAyrml commented Feb 1, 2024 •

edited by afkanpour

Loading

sanaAyrml Feb 16, 2024 •

edited

Loading