Added the ndcg metric [WIP] #2632

kamalojasv181 · 2022-07-23T13:30:11Z

Related #2631

Description: This is the implementation [WIP] for the NDCG metric.

Check list:

vfdev-5

@kamalojasv181 thanks for the PR. I left few comments on few points, but I haven't yet explored the code to compute ndcg.

Can you please write few tests vs scikit-learn as ref ?

ignite/metrics/ndcg.py

vfdev-5

Thanks a lot for the update @kamalojasv181 !
I left few other comments on the implementation and the API.

Let's also start working on docs and tests

ignite/metrics/ndcg.py

kamalojasv181 · 2022-07-24T09:15:47Z

Thanks for all the feedback. I will revert with a pull request addressing everything!

vfdev-5 · 2022-07-24T09:19:18Z

Thanks for all the feedback. I will revert with a pull request addressing everything!

You can just continue working with this pull request, no need to revert anything.

…d_cumulative_gain#Discounted_Cumulative_Gain

ignite/metrics/recsys/ndcg.py

vfdev-5

Thanks for the updates @kamalojasv181 !
I have few other code update suggestions.

tests/ignite/metrics/test_ndcg.py

ignite/metrics/recsys/ndcg.py

tests/ignite/metrics/test_ndcg.py

…ts and other suggested changes

kamalojasv181 · 2022-07-26T12:53:32Z

@vfdev-5 there are a bunch of things I did in this commit:

Put the data on GPU for tests
Replaced torch.add with +.
Changed the naming and ordering as per ignite as (y_pred, y_true ....)
Added function to handle ties and wrote corresponding tests(A part of it is un-vectorized, TODO: Think of implementation to vectorise it.)
Added check for log_base > 0 and not equal to 1.
Added tests for log_base and exponential.
Separated the _ndcg_smaple_score and _dcg_smaple_score from the NDCG class.
NOTE: log_base tests could have been done with the other tests, but I put them separately so that if sklearn changes their code, then we know about it (as the tests would be failing)

If there is anything else, lemme know before I can finally add some comments against the class and documentation.

ignite/metrics/recsys/ndcg.py

tests/ignite/metrics/test_ndcg.py

vfdev-5 · 2022-07-26T14:37:34Z

ignite/metrics/recsys/ndcg.py

+        discounted_gains = torch.tensor(
+            [_tie_averaged_dcg(y_p, y_t, discount_cumsum, device) for y_p, y_t in zip(y_pred, y_true)], device=device
+        )


So, there is no way to make it vectorized == without for-loop ?

I havent checked yet. For now I have added this implementation. It's a TODO

tests/ignite/metrics/test_ndcg.py

ignite/metrics/recsys/ndcg.py

…undat tests

tests/ignite/metrics/test_ndcg.py

puhuk · 2022-08-30T07:27:15Z

@vfdev-5 @kamalojasv181

Belows are checklist for test in ddp configuration

Data generating with different random seed per each process
- torch.random_seed(12 + rank + i)
Generated data size per each rank should be n_iters * batch_size to update with its batch_size in update()
- y_true = torch.randint(0, n_classes, size=(n_iters * batch_size,))
Update data with batch style in update()
- y_pred[i * batch_size : (i+1) * batch_size]

Check generated data from each processes are gathered for computing (after running Engine)

   engine.run(data=data, max_epochs=n_epochs)
   y_true = idist.all_gather(y_true)
   y_preds = idist.all_gather(y_preds)

Check calculated metric is same as reference
- pytest.approx(calculated_value) == reference_value

The whole process should be seem like (Or you can refer from ignite/tests/ignite/metrics/test_accuracy.py)

ignite/tests/ignite/metrics/test_accuracy.py

Line 488 in 26f7cec

def _test_distrib_integration_list_of_tensors_or_numbers(device):

acc = Accuracy(is_multilabel=True, device=metric_device)

# data generation
torch.manual_seed(12 + rank)
y_true = torch.randint(0, n_classes, size=(n_iters * batch_size,)).to(device)
y_preds = torch.rand(n_iters * batch_size, n_classes).to(device)

# update each batch
def update(engine, i):
            return (
                y_preds[i * batch_size : (i + 1) * batch_size, :],
                y_true[i * batch_size : (i + 1) * batch_size],
            )

# Initialize Engine
engine = Engine(update)
acc = Accuracy(device=metric_device)
acc.attach(engine, "acc")

data = list(range(n_iters))
engine.run(data=data, max_epochs=n_epochs)

# all gather data
y_pred = idist.all_gather(y_pred)
y = idist.all_gather(y)

res = engine.state.metrics["acc"]

# calculate reference value with scikit learn and compare
true_res = sklearn.metrics.accuracy_score(y_true.cpu().numpy(), torch.argmax(y_preds, dim=1).cpu().numpy())
assert pytest.approx(res) == true_res

tests/ignite/metrics/test_ndcg.py

…ne multiomial distribution

vfdev-5 · 2022-08-31T11:45:18Z

tests/ignite/metrics/test_ndcg.py

+            return (
+                [v for v in y_preds[i * batch_size : (i + 1) * batch_size, ...]],
+                [v for v in y_true[i * batch_size : (i + 1) * batch_size]],
+            )


@kamalojasv181 Why do you return tuple of 2 lists instead of a tuple of two tensors ?

I have taken inspiration from the code provided by @puhuk . Here each element of the list is a batch. We feed our engine one batch at a time; hence using a list is also ok. To maintain uniformity across the code, I have kept it this way.

Oh, I see, he provided a wrong link. Yes, in accuracy we also a test on list of tensors and numbers but this is untypical. Here is a typical example

ignite/tests/ignite/metrics/test_accuracy.py

Lines 412 to 463 in 26f7cec

def _test_distrib_integration_multilabel(device):

rank = idist.get_rank()

def _test(n_epochs, metric_device):

metric_device = torch.device(metric_device)

n_iters = 80

batch_size = 16

n_classes = 10

torch.manual_seed(12 + rank)

y_true = torch.randint(0, 2, size=(n_iters * batch_size, n_classes, 8, 10)).to(device)

y_preds = torch.randint(0, 2, size=(n_iters * batch_size, n_classes, 8, 10)).to(device)

def update(engine, i):

return (

y_preds[i * batch_size : (i + 1) * batch_size, ...],

y_true[i * batch_size : (i + 1) * batch_size, ...],

)

engine = Engine(update)

acc = Accuracy(is_multilabel=True, device=metric_device)

acc.attach(engine, "acc")

data = list(range(n_iters))

engine.run(data=data, max_epochs=n_epochs)

y_true = idist.all_gather(y_true)

y_preds = idist.all_gather(y_preds)

assert (

acc._num_correct.device == metric_device

), f"{type(acc._num_correct.device)}:{acc._num_correct.device} vs {type(metric_device)}:{metric_device}"

assert "acc" in engine.state.metrics

res = engine.state.metrics["acc"]

if isinstance(res, torch.Tensor):

res = res.cpu().numpy()

true_res = accuracy_score(to_numpy_multilabel(y_true), to_numpy_multilabel(y_preds))

assert pytest.approx(res) == true_res

metric_devices = ["cpu"]

if device.type != "xla":

metric_devices.append(idist.device())

for metric_device in metric_devices:

for _ in range(2):

_test(n_epochs=1, metric_device=metric_device)

_test(n_epochs=2, metric_device=metric_device)

vfdev-5 · 2022-08-31T11:46:26Z

@sadra-barikbin can you check why tests/ignite/metrics/test_precision.py::test_binary_input[None] is failing systematically ?

E       assert array([1.]) == approx([1.0 ± 1.0e-06, 0.0 ± 1.0e-12])
E         Impossible to compare arrays with different shapes.
E         Shapes: (2,) and (1,)

tests/ignite/metrics/test_precision.py:130: AssertionError

puhuk · 2022-12-07T12:01:22Z

@kamalojasv181 Hi, do you need any help to finalize this PR? Please feel free to let me and @vfdev-5 know :)

ili0820 · 2023-08-27T07:11:14Z

Any updates on this ? If it is not finished yet, I'd love to contribute @vfdev-5

vfdev-5 · 2023-08-27T08:01:14Z

Yes, this PR is not finished, unfortunately. @ili0820 if you can help with getting it landed it would be great!

ili0820 · 2023-08-30T16:23:07Z

after reviewing all the comments,
As far as I understand, the testing part is the last and only problem that has been not resolved yet,
this seems resolved and
this is the problem.
Is this correct? @vfdev-5

vfdev-5 · 2023-08-30T17:18:55Z

after reviewing all the comments, As far as I understand, the testing part is the last and only problem that has been not resolved yet, this seems resolved and this is the problem. Is this correct? @vfdev-5

@ili0820 yes, it remains few things here:

https://github.com/pytorch/ignite/pull/2632/files#r930048810
test for distributed configuration

In case if you are not familiar with DDP, please check: https://pytorch-ignite.ai/tutorials/advanced/01-collective-communication/. As for testing best practices, we would like to use now distributed fixture as here:

ignite/tests/ignite/metrics/test_ssim.py

Line 226 in f2b1183

def test_distrib_integration(distributed, metric_device):

if you have any questions, you can reach out to us on Discord in #start-contributing channel.

kamalojasv181 and others added 2 commits July 23, 2022 18:49

Added the ndcg metric [WIP]

cd6ec7f

Merge branch 'master' into ndcg

6421879

github-actions bot added the module: metrics Metrics module label Jul 23, 2022

kamalojasv181 marked this pull request as draft July 23, 2022 16:45

kamalojasv181 added 2 commits July 23, 2022 22:46

added GPU support, corrected mypy errors, and minor fixes

4535af1

Merge branch 'ndcg' of https://github.com/kamalojasv181/ignite into ndcg

eb73c99

vfdev-5 reviewed Jul 23, 2022

View reviewed changes

kamalojasv181 added 2 commits July 24, 2022 07:22

Incorporated the suggested changes

70d06e5

Fixed mypy error

6a86f5f

vfdev-5 reviewed Jul 24, 2022

View reviewed changes

ignite/metrics/ndcg.py Outdated Show resolved Hide resolved

ignite/metrics/ndcg.py Outdated Show resolved Hide resolved

ignite/metrics/ndcg.py Outdated Show resolved Hide resolved

ignite/metrics/ndcg.py Outdated Show resolved Hide resolved

kamalojasv181 added 3 commits July 24, 2022 21:27

Fixed bugs in NDCG and added tests for output and reset

7b7ed6f

Fixed mypy error

2c87ee1

Added the exponential form on https://en.wikipedia.org/wiki/Discounte…

f4c628a

…d_cumulative_gain#Discounted_Cumulative_Gain

vfdev-5 reviewed Jul 25, 2022

View reviewed changes

ignite/metrics/recsys/ndcg.py Outdated Show resolved Hide resolved

Corrected true, pred order and corresponding tests

e72b59e

vfdev-5 reviewed Jul 25, 2022

View reviewed changes

kamalojasv181 added 3 commits July 26, 2022 16:01

Added ties case, exponential tests, log_base tests, corresponding tes…

ef63d85

…ts and other suggested changes

Added GPU check on top

115501b

Put tensors on GPU inside the function to pervent error

189b579

vfdev-5 reviewed Jul 26, 2022

View reviewed changes

tests/ignite/metrics/test_ndcg.py Outdated Show resolved Hide resolved

vfdev-5 reviewed Jul 26, 2022

View reviewed changes

tests/ignite/metrics/test_ndcg.py Outdated Show resolved Hide resolved

vfdev-5 reviewed Jul 26, 2022

View reviewed changes

tests/ignite/metrics/test_ndcg.py Outdated Show resolved Hide resolved

kamalojasv181 added 3 commits July 27, 2022 13:40

Improved tests and minor bugfixes

c509456

Removed device hyperparam from _ndcg_smaple_scores

84900f0

Skipped GPU tests for CPU only systems

9bfc06e

vfdev-5 reviewed Jul 27, 2022

View reviewed changes

ignite/metrics/recsys/ndcg.py Outdated Show resolved Hide resolved

kamalojasv181 and others added 9 commits August 29, 2022 16:34

Merge branch 'master' of https://github.com/pytorch/ignite into ndcg

962bcef

Merge branch 'pytorch:master' into ndcg

79979cc

Changed test name to test_output_cuda from test_output_gpu

0c1d6fd

Merge branch 'ndcg' of https://github.com/kamalojasv181/ignite into ndcg

85cdcaf

Merge branch 'master' into ndcg

fdf7877

Changed variable names to replacement and ignore_ties and removed red…

2931d20

…undat tests

Changed variable names to replacement and ignore_ties and removed red…

c308e41

…undat tests

Merge branch 'master' of https://github.com/pytorch/ignite into ndcg

95ede6c

Merge branch 'ndcg' of https://github.com/kamalojasv181/ignite into ndcg

3a4d2af

vfdev-5 reviewed Aug 30, 2022

View reviewed changes

tests/ignite/metrics/test_ndcg.py Show resolved Hide resolved

vfdev-5 reviewed Aug 30, 2022

View reviewed changes

tests/ignite/metrics/test_ndcg.py Outdated Show resolved Hide resolved

vfdev-5 reviewed Aug 30, 2022

View reviewed changes

tests/ignite/metrics/test_ndcg.py Outdated Show resolved Hide resolved

kamalojasv181 and others added 6 commits August 30, 2022 13:33

Removed redundant test cases and removed the redundant if statement

6e66273

Added distributed tests, added multiple test cases corresponding to o…

eb75afa

…ne multiomial distribution

Made the tests wsork on in ddp configuration

dcf276d

Merge branch 'master' of https://github.com/pytorch/ignite into ndcg

cb273e7

Merge branch 'pytorch:master' into ndcg

b0f449b

Merge branch 'ndcg' of https://github.com/kamalojasv181/ignite into ndcg

388db23

vfdev-5 reviewed Aug 31, 2022

View reviewed changes

kamalojasv181 and others added 2 commits September 1, 2022 14:35

Merge branch 'pytorch:master' into ndcg

b3b6b28

Returning tuple of two tensors instead of tuple of list of tensors

6dcf3b2

ili0820 mentioned this pull request Aug 31, 2023

Distributed ndcg #3054

Open

3 tasks

exitflynn mentioned this pull request Mar 19, 2025

Add NDCG metrics #3346

Draft

	def _test_distrib_integration_multilabel(device):

	rank = idist.get_rank()

	def _test(n_epochs, metric_device):
	metric_device = torch.device(metric_device)
	n_iters = 80
	batch_size = 16
	n_classes = 10

	torch.manual_seed(12 + rank)

	y_true = torch.randint(0, 2, size=(n_iters * batch_size, n_classes, 8, 10)).to(device)
	y_preds = torch.randint(0, 2, size=(n_iters * batch_size, n_classes, 8, 10)).to(device)

	def update(engine, i):
	return (
	y_preds[i * batch_size : (i + 1) * batch_size, ...],
	y_true[i * batch_size : (i + 1) * batch_size, ...],
	)

	engine = Engine(update)

	acc = Accuracy(is_multilabel=True, device=metric_device)
	acc.attach(engine, "acc")

	data = list(range(n_iters))
	engine.run(data=data, max_epochs=n_epochs)

	y_true = idist.all_gather(y_true)
	y_preds = idist.all_gather(y_preds)

	assert (
	acc._num_correct.device == metric_device
	), f"{type(acc._num_correct.device)}:{acc._num_correct.device} vs {type(metric_device)}:{metric_device}"

	assert "acc" in engine.state.metrics
	res = engine.state.metrics["acc"]
	if isinstance(res, torch.Tensor):
	res = res.cpu().numpy()

	true_res = accuracy_score(to_numpy_multilabel(y_true), to_numpy_multilabel(y_preds))

	assert pytest.approx(res) == true_res

	metric_devices = ["cpu"]
	if device.type != "xla":
	metric_devices.append(idist.device())
	for metric_device in metric_devices:
	for _ in range(2):
	_test(n_epochs=1, metric_device=metric_device)
	_test(n_epochs=2, metric_device=metric_device)

Uh oh!

Added the ndcg metric [WIP] #2632

Are you sure you want to change the base?

Added the ndcg metric [WIP] #2632

Uh oh!

Conversation

kamalojasv181 commented Jul 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vfdev-5 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vfdev-5 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kamalojasv181 commented Jul 24, 2022

Uh oh!

vfdev-5 commented Jul 24, 2022

Uh oh!

Uh oh!

vfdev-5 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kamalojasv181 commented Jul 26, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vfdev-5 Jul 26, 2022

Choose a reason for hiding this comment

Uh oh!

kamalojasv181 Jul 26, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

puhuk commented Aug 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

vfdev-5 Aug 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kamalojasv181 Aug 31, 2022

Choose a reason for hiding this comment

Uh oh!

vfdev-5 Aug 31, 2022

Choose a reason for hiding this comment

Uh oh!

vfdev-5 commented Aug 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

puhuk commented Dec 7, 2022

kamalojasv181 commented Jul 23, 2022 •

edited

Loading

vfdev-5 left a comment •

edited

Loading

puhuk commented Aug 30, 2022 •

edited

Loading

vfdev-5 Aug 31, 2022 •

edited

Loading

vfdev-5 commented Aug 31, 2022 •

edited

Loading