ENH Add zero division handling to cohen_kappa_score by StefanieSenger · Pull Request #31172 · scikit-learn/scikit-learn

StefanieSenger · 2025-04-10T14:18:16Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR adds a replace_undefined_by param to cohen_kappa_score to deal with cases of division by zero.
Also adds tests.

github-actions · 2025-04-10T14:19:38Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: ed62103. Link to the linter CI: here}

StefanieSenger · 2025-04-10T15:14:34Z

sklearn/metrics/_classification.py

+        msg = (
+            "`y2` does not contain any label that is also both present in `y1` and in "
+            "`labels`. cohen_kappa_score is undefined and set to the value defined in "
+            "the `replace_undefined_by` param, which defaults to 0.0."
+        )


In my thinking, this warning message also covers cases where both y1 and y2 are empty (as in "test case: empty inputs"). At least it would motivate people to inspect their data and then they might find out if their data is empty.

However, we could also branch here and make a separate warning message for empty inputs.

StefanieSenger · 2025-04-10T16:34:58Z

sklearn/metrics/_classification.py

+            "`y1` and `y2` only have one label in common that is also in `labels`. "
+            "cohen_kappa_score is undefined and set to the value defined in the "
+            "`replace_undefined_by` param, which defaults to 0.0."
+        )


In my thinking, this message also fits the cases when y1 and y2 only deal with one label (as in "test case: both inputs only have one label").

StefanieSenger · 2025-04-12T16:46:16Z

sklearn/metrics/_classification.py

+    mgs_changing_default = (
+        "The default return value of `cohen_kappa_score` in case of a division "
+        "by zero has been deprecated in 1.7 and will be changed to 0.0 in version "
+        "1.9. Set `replace_undefined_by=0.0` to use the new default and to silence "
+        "this Warning."
+    )


I would pick 0.0 as a future default (instead of -1.0 which is the worst score), because it is the least expressive of the scores, representing matching labels by chance.

If users would use cohen_kappa_score as part of their custom metric, that calculates the mean over several cohen_kappa_scores, 0.0 would be a neutral element like the "ignore" option that we have talked about in this comment: #29048 (comment)

virchan

Thank you for the PR @StefanieSenger!

Overall, I think the replace_undefined_by parameter behaves as expected, as discussed in #29048 (comment).

I just have a few minor suggestions—otherwise, LGTM!

sklearn/metrics/_classification.py

virchan · 2025-04-30T05:44:48Z

sklearn/metrics/tests/test_classification.py

+@pytest.mark.parametrize("replace_undefined_by", [0.0, np.nan])
+def test_cohen_kappa_zero_division(replace_undefined_by):


The test function looks good to me overall—my comment is just a minor nitpick:

Suggested change

@pytest.mark.parametrize("replace_undefined_by", [0.0, np.nan])

def test_cohen_kappa_zero_division(replace_undefined_by):

@pytest.mark.parametrize(

"test_case",

[

([], [], None, None, None),

([1] * 5 + [2] * 5, [3] * 10, [1, 2], None, None),

([3] * 10, [3] * 10, None, None, None),

([1] * 5 + [2] * 5, [3] * 10, [1, 2], "linear", None),

],

)

@pytest.mark.parametrize("replace_undefined_by", [0.0, np.nan])

def test_cohen_kappa_zero_division(replace_undefined_by):

This way, we could also simplify the unpacking slightly:

y1, y2, labels, weights, sample_weight = test_case y_1, y2 = np.array(y1), np.array(y2)

and only need one assert:

assert check_equal( cohen_kappa_score( y1, y2, labels=labels, weights=weights, replace_undefined_by=replace_undefined_by, ), replace_undefined_by, )

Totally optional though—happy for you to resolve this if you’d prefer to keep it as is.

That's a good suggestion.

I have made this test leaner, I couldn't do without commenting every single test case though. 🤷 To help the next person looking at this see why these test cases trigger a division by zero.

Co-authored-by: Virgil Chan <virchan.math@gmail.com>

StefanieSenger

Thanks a lot for your review, @virchan!
That was pretty helpful and I have applied all your suggestions.

virchan

LGTM! Thanks @StefanieSenger!

@adrinjalali, would you like to have a look?

adrinjalali

Otherwise LGTM.

sklearn/metrics/_classification.py

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

StefanieSenger · 2025-05-05T10:55:22Z

Thank you for reviewing, @adrinjalali! I have addressed your suggestions. Would you have another look?

adrinjalali · 2025-05-06T07:35:54Z

The only thing now here that I don't find natural is that the default value of the parameter would become 0 after the deprecation cycle. I think the default value should be np.nan to make sure users actually notice it's undefined, and if they want to make it something else, they can.

Having 0 as the default value for undefined seems a bit odd to me.

Not sure what @glemaitre thinks about this.

sklearn/metrics/_classification.py

StefanieSenger · 2025-11-21T11:08:06Z

I'd be happy for reviews, maybe @adrinjalali would like to have a look?

adrinjalali

I'm a bit lost as which comments are relevant from the review history @StefanieSenger , please let me know which discussions are open.

sklearn/metrics/_classification.py

StefanieSenger · 2025-12-15T12:40:01Z

I'm a bit lost as which comments are relevant from the review history @StefanieSenger , please let me know which discussions are open.

Only this one (#31172 (comment)), @adrinjalali.

Edit: I answered to your and @jeremiedbb's suggestions here. Please let me know what you think.

StefanieSenger · 2025-12-19T15:23:59Z

After pulling from main, the tests fail. I will investigate, but on another day. Maybe a newly added test.

StefanieSenger · 2025-12-21T07:10:53Z

After pulling from main, the tests fail. I will investigate, but on another day. Maybe a newly added test.

Yes, it came from #32549, where a new handling for empty inputs was added. I have removed the obsolete test case from here.

sklearn/metrics/_classification.py

jeremiedbb

LGTM. Thanks !

adrinjalali

Minor point to improve the warning message, otherwise LGTM. Feel free to merge if CI is green after fixing the message.

sklearn/metrics/_classification.py

adrinjalali · 2026-01-27T14:20:35Z

sklearn/metrics/_classification.py

+    msg_zero_division = (
+        "`y1`, `y2` and `labels` have only one label in common. "
+        "`cohen_kappa_score` is undefined and set to the value defined by the "
+        "`replace_undefined_by` param, which defaults to `np.nan`."


Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

StefanieSenger · 2026-01-28T10:32:04Z

Thanks reviewing, @virchan, @jeremiedbb and @adrinjalali. I'm happy about the result.
I've enabled auto-merge.

ENH Add zero division handling to cohen_kappa_score

5a000a0

github-actions bot added the module:metrics label Apr 10, 2025

StefanieSenger added 2 commits April 10, 2025 16:23

add changelog

02fd573

add warnings raised in case of zero division

2d84ded

StefanieSenger commented Apr 10, 2025

View reviewed changes

refine test comments

4b00d9f

StefanieSenger commented Apr 10, 2025

View reviewed changes

StefanieSenger added this to the 1.7 milestone Apr 10, 2025

StefanieSenger added 4 commits April 10, 2025 23:02

correct version

f58492a

improve docstring of test

245da3e

wording

ede386e

add deprecation cycle for default behaviour if zero division

b93b445

StefanieSenger commented Apr 12, 2025

View reviewed changes

StefanieSenger and others added 3 commits April 19, 2025 07:25

Merge branch 'main' into undefined_cohen_kappa_score

612b800

fix linting

a7f4ba6

Merge branch 'main' into undefined_cohen_kappa_score

375d204

virchan reviewed Apr 30, 2025

View reviewed changes

StefanieSenger and others added 4 commits April 30, 2025 16:06

Apply suggestions from code review

6d8e59b

Co-authored-by: Virgil Chan <virchan.math@gmail.com>

clean up test and correct warning message

973b219

Merge branch 'main' into undefined_cohen_kappa_score

3947409

leaner test

2ee10a3

StefanieSenger commented Apr 30, 2025

View reviewed changes

virchan approved these changes Apr 30, 2025

View reviewed changes

adrinjalali reviewed May 5, 2025

View reviewed changes

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

StefanieSenger and others added 3 commits May 5, 2025 12:11

Apply suggestions from code review

13af4c8

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

deal with zero division in helper function

703eaae

Merge branch 'main' into undefined_cohen_kappa_score

c194eae

fix np.isclose

4249d18

StefanieSenger commented Nov 17, 2025

View reviewed changes

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

empty commit to re-trigger CI

ed62103

adrinjalali added this to Labs Dec 10, 2025

adrinjalali moved this to Todo in Labs Dec 10, 2025

StefanieSenger self-assigned this Dec 11, 2025

StefanieSenger moved this from Todo to In progress in Labs Dec 11, 2025

adrinjalali unassigned StefanieSenger Dec 11, 2025

adrinjalali reviewed Dec 15, 2025

View reviewed changes

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

adrinjalali moved this from In progress to In progress - High Priority in Labs Dec 15, 2025

StefanieSenger and others added 2 commits December 18, 2025 11:06

remove line

725b4f7

Merge branch 'main' into undefined_cohen_kappa_score

169b898

remove obsolete test case

83d42bb

Merge branch 'main' into undefined_cohen_kappa_score

0196de5

jeremiedbb reviewed Jan 12, 2026

View reviewed changes

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

StefanieSenger and others added 2 commits January 16, 2026 13:55

apply suggestions from code review

13af497

Merge branch 'main' into undefined_cohen_kappa_score

5d1eecd

jeremiedbb approved these changes Jan 17, 2026

View reviewed changes

adrinjalali approved these changes Jan 27, 2026

View reviewed changes

StefanieSenger and others added 2 commits January 28, 2026 11:26

Update sklearn/metrics/_classification.py

efffedf

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

nicer error message

6bf94c4

StefanieSenger enabled auto-merge (squash) January 28, 2026 10:32

StefanieSenger merged commit be7ec61 into scikit-learn:main Jan 28, 2026
37 checks passed

github-project-automation bot moved this from In progress - High Priority to Done in Labs Jan 28, 2026

StefanieSenger deleted the undefined_cohen_kappa_score branch January 28, 2026 11:13

		@pytest.mark.parametrize("replace_undefined_by", [0.0, np.nan])
		def test_cohen_kappa_zero_division(replace_undefined_by):

Uh oh!

Conversation

StefanieSenger commented Apr 10, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

github-actions bot commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

StefanieSenger Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

StefanieSenger Apr 10, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 12, 2025

Choose a reason for hiding this comment

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

virchan Apr 30, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 30, 2025

Choose a reason for hiding this comment

StefanieSenger left a comment

Choose a reason for hiding this comment

virchan left a comment

Choose a reason for hiding this comment

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

StefanieSenger commented May 5, 2025

adrinjalali commented May 6, 2025

Uh oh!

StefanieSenger commented Nov 21, 2025

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

StefanieSenger commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

StefanieSenger commented Dec 19, 2025

StefanieSenger commented Dec 21, 2025

Uh oh!

Uh oh!

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Jan 27, 2026

Choose a reason for hiding this comment

StefanieSenger commented Jan 28, 2026

Uh oh!

Labels

6 participants

github-actions bot commented Apr 10, 2025 •

edited

Loading

StefanieSenger Apr 10, 2025 •

edited

Loading

StefanieSenger commented Dec 15, 2025 •

edited

Loading