Ensembling over layers #259

lauritowal · 2023-06-19T19:43:52Z

Ensembling from mid to last layer

…into ensembling_layer

CLAassistant · 2023-06-19T19:43:58Z

All committers have signed the CLA.

for more information, see https://pre-commit.ci

…into ensembling_layer

for more information, see https://pre-commit.ci

…into ensembling_layer

lauritowal · 2023-06-19T20:23:48Z

elk/metrics/eval.py

@@ -41,6 +41,73 @@ def to_dict(self, prefix: str = "") -> dict[str, float]:
        return {**auroc_dict, **cal_acc_dict, **acc_dict, **cal_dict}


+def calc_auroc(y_logits, y_true, ensembling, num_classes):


add annotation

for more information, see https://pre-commit.ci

…into ensembling_layer

for more information, see https://pre-commit.ci

…into ensembling_layer

for more information, see https://pre-commit.ci

elk/evaluation/evaluate.py

for more information, see https://pre-commit.ci

lauritowal

Tests run forever on my machine. Need to check what is wrong there.

AlexTMallen

Mainly just fix the handling of the multidataset case

❯ elk elicit gpt2 imdb amazon_polarity --max_examples 10 300 --debug --num_gpus 1

elk/metrics/eval.py

AlexTMallen · 2023-07-24T16:39:19Z

elk/metrics/eval.py

+        y_logits_collection.append(y_logits)
+
+    # get logits and ground_truth from middle to last layer
+    middle_index = len(layer_outputs) // 2


in some ways I think we should allow the layers over which we ensemble to be configurable. E.g. sometimes the last layers perform worse.

yeah, it makes sense to make it configurable. However, I'm curious, how would you decide which layers to pick?

AlexTMallen · 2023-07-24T16:41:17Z

elk/metrics/eval.py

+    middle_index = len(layer_outputs) // 2
+    y_logits_stacked = torch.stack(y_logits_collection[middle_index:])
+    # layer prompt_ensembling of the stacked logits
+    y_logits_stacked_mean = torch.mean(y_logits_stacked, dim=0)


It seems like the ensembling is done by taking the mean over layers, rather than concatenating. This isn't super clear from comments/docstrings, and hard to tell from reading the code because the shapes aren't commented.

elk/metrics/eval.py

AlexTMallen · 2023-07-25T18:43:24Z

elk/utils/types.py

+from enum import Enum
+
+
+class PromptEnsembling(Enum):


I think it's fine

AlexTMallen · 2023-07-25T21:36:29Z

elk/training/train.py

@@ -53,7 +54,7 @@ def apply_to_layer(
        layer: int,
        devices: list[str],
        world_size: int,
-    ) -> dict[str, pd.DataFrame]:
+    ) -> tuple[dict[str, pd.DataFrame], list[dict]]:


Same comment here regarding return type

AlexTMallen · 2023-07-25T21:39:10Z

elk/run.py

            try:
-                for df_dict in tqdm(mapper(func, layers), total=len(layers)):
-                    for k, v in df_dict.items():
+                for df_dict, layer_output in tqdm(


This doesn't write all the appropriate lines for

❯ elk elicit gpt2 imdb amazon_polarity --max_examples 10 300 --debug --num_gpus 1

There should be evaluation results for both imdb and amazon_polarity in the layer_ensembling_results.csv

sorting remove comment

for more information, see https://pre-commit.ci

my fixes for layer ensembling

for more information, see https://pre-commit.ci fix merge

norabelrose and others added 18 commits April 26, 2023 10:19

Log ensembled metrics

934cd54

Fixing pyright version

dff69bf

Merge remote-tracking branch 'origin/main' into ensembling

b181d3e

experiment with layer ensembling

a493b85

add draft example for ensembling datasets

af5def6

add comment

04a2a82

Merge branch 'main' into ensembling_layer

f433885

add eval in comments

cda7de7

Merge branch 'ensembling_layer' of https://github.com/EleutherAI/elk …

c9f2558

…into ensembling_layer

add different root

0ceaa3a

Merge branch 'ensembling_layer' of https://github.com/EleutherAI/elk …

86fb1c8

…into ensembling_layer

Merge branch 'main' into ensembling_layer

47a3f60

add empty list of vals

0bd274f

Merge branch 'main' into ensembling_layer

04f0b4c

add first version of layer ensembling to eval

994af9b

add vals to train

6ca1916

refactoring & cleanup of eval and layer ensembling

b0d0f83

add annotations

241a03a

pre-commit-ci bot and others added 6 commits June 19, 2023 19:46

[pre-commit.ci] auto fixes from pre-commit.com hooks

e8d042a

for more information, see https://pre-commit.ci

rename vals to layer_outputs

a4ace25

Merge branch 'ensembling_layer' of https://github.com/EleutherAI/elk …

b025c71

…into ensembling_layer

[pre-commit.ci] auto fixes from pre-commit.com hooks

2156ad8

for more information, see https://pre-commit.ci

fir formatting

e391da6

Merge branch 'ensembling_layer' of https://github.com/EleutherAI/elk …

449971f

…into ensembling_layer

lauritowal commented Jun 19, 2023

View reviewed changes

lauritowal changed the title ~~Ensembling layer~~ Ensembling over layers Jun 19, 2023

lauritowal and others added 3 commits June 21, 2023 12:30

make layer ensembling work on multiple gpus

528367d

[pre-commit.ci] auto fixes from pre-commit.com hooks

d4df517

for more information, see https://pre-commit.ci

make sure we use the same device

2661ea1

lauritowal and others added 11 commits July 13, 2023 19:35

precomit fixes

e6c9d4c

Merge branch 'ensembling_layer' of https://github.com/EleutherAI/elk …

8093294

…into ensembling_layer

fix num_classes

6028152

[pre-commit.ci] auto fixes from pre-commit.com hooks

6d7d99a

for more information, see https://pre-commit.ci

fix bug where y_true has a dimension of two

4148857

cleanup

964f03d

Merge branch 'ensembling_layer' of https://github.com/EleutherAI/elk …

f7ed262

…into ensembling_layer

[pre-commit.ci] auto fixes from pre-commit.com hooks

06dad69

for more information, see https://pre-commit.ci

add y_true_initial

0d2545b

merge

4a717ce

fix test error

5952b4b

derpyplops reviewed Jul 18, 2023

View reviewed changes

elk/evaluation/evaluate.py Outdated Show resolved Hide resolved

lauritowal requested a review from AlexTMallen July 22, 2023 20:42

lauritowal and others added 3 commits July 22, 2023 21:44

Merge branch 'main' into ensembling_layer

7efe38f

replace mode with prompt_ensembling.value

049cd63

[pre-commit.ci] auto fixes from pre-commit.com hooks

c8236dd

for more information, see https://pre-commit.ci

lauritowal commented Jul 22, 2023

View reviewed changes

remove .value for lm_eval

56d1796

AlexTMallen requested changes Jul 25, 2023

View reviewed changes

derpyplops and others added 10 commits July 27, 2023 11:46

add LayerApplied

4d9c781

fix run.py part

8961e95

multidataset layer ensembling

bd06cd3

little refactoring

f8882c6

sorting remove comment

fix tests

23183bc

add annotation + cleanup

d091f9d

[pre-commit.ci] auto fixes from pre-commit.com hooks

776c186

for more information, see https://pre-commit.ci

Merge pull request #282 from EleutherAI/fix-ensembling-jon

9629ba5

my fixes for layer ensembling

Merge branch 'main' into ensembling_layer

45b527f

[pre-commit.ci] auto fixes from pre-commit.com hooks

64e762a

for more information, see https://pre-commit.ci fix merge

derpyplops force-pushed the ensembling_layer branch from f3319c1 to 64e762a Compare October 13, 2023 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensembling over layers #259

Ensembling over layers #259

lauritowal commented Jun 19, 2023 •

edited

Loading

CLAassistant commented Jun 19, 2023 •

edited

Loading

lauritowal Jun 19, 2023

lauritowal left a comment

AlexTMallen left a comment

AlexTMallen Jul 24, 2023

lauritowal Jul 27, 2023

AlexTMallen Jul 24, 2023

AlexTMallen Jul 25, 2023

AlexTMallen Jul 25, 2023

AlexTMallen Jul 25, 2023

		@@ -41,6 +41,73 @@ def to_dict(self, prefix: str = "") -> dict[str, float]:
		return {auroc_dict, cal_acc_dict, acc_dict, cal_dict}


		def calc_auroc(y_logits, y_true, ensembling, num_classes):

Ensembling over layers #259

Are you sure you want to change the base?

Ensembling over layers #259

Conversation

lauritowal commented Jun 19, 2023 • edited Loading

CLAassistant commented Jun 19, 2023 • edited Loading

lauritowal Jun 19, 2023

Choose a reason for hiding this comment

lauritowal left a comment

Choose a reason for hiding this comment

AlexTMallen left a comment

Choose a reason for hiding this comment

AlexTMallen Jul 24, 2023

Choose a reason for hiding this comment

lauritowal Jul 27, 2023

Choose a reason for hiding this comment

AlexTMallen Jul 24, 2023

Choose a reason for hiding this comment

AlexTMallen Jul 25, 2023

Choose a reason for hiding this comment

AlexTMallen Jul 25, 2023

Choose a reason for hiding this comment

AlexTMallen Jul 25, 2023

Choose a reason for hiding this comment

lauritowal commented Jun 19, 2023 •

edited

Loading

CLAassistant commented Jun 19, 2023 •

edited

Loading