[Whisper] Add rocm expected results to certain tests #40482

ivarflakstad · 2025-08-27T10:35:36Z

What does this PR do?

Adds expected results to certain tests that had slightly different results on AMD devices.

ivarflakstad · 2025-08-27T10:39:53Z

tests/models/whisper/test_modeling_whisper.py

+        cuda = [" Folks, I spend a lot of time right over there, night after night after night, actually. Carefully selecting for you the day's noosiest, most aerodynamic headlines, stress testing, and those topical anti-lock breaks and power steering, painstakingly stitching, leather seating so soft, it would make JD power and her associates blush to create the luxury sedan that is my nightly monologue. But sometimes, you sometimes, folks. I lurched a consciousness in the back of an abandoned school bus and slap myself awake."]
+        cuda1 = [" Folks, I spend a lot of time right over there, night after night after night, actually. Carefully selecting for you the day's noosiest, most aerodynamic headlines, stress testing, and those topical anti-lock breaks and power steering, painstakingly stitching, leather seating so soft, it would make JD power and her associates blush to create the luxury sedan that is my nightly monologue. But sometimes, you sometimes, folks. I lurched a consciousness in the back of an abandoned school bus and slap myself a wig."]
+        rocm = [" Folks, I spend a lot of time right over there, night after night after night, actually. Carefully selecting for you the day's noosiest, most aerodynamic headlines, stress testing, and those topical anti-lock breaks and power steering, painstakingly stitching, leather seating, so soft, it would make JD power and her associates blush to create the luxury sedan that is my nightly monologue. But sometimes, you sometimes, folks, I lurched a consciousness in the back of an abandoned school bus and slap myself awake."]
        # fmt: on
+        expected_output = Expectations({("cuda", None): cuda, ("rocm", None): rocm}).get_expectation()
+        expected_output1 = Expectations({("cuda", None): cuda1, ("rocm", None): rocm}).get_expectation()


This one is interesting. Notice how cuda and cuda1 end with different words.
This did not happen when I ran it on the MI325. If the second result is supposed to end with "wig" then we have to look into it.
@eustlb thoughts? ☺️

It seems like cuda output uses this generation config while cuda1 uses a different generation config so they are expected to have different results.

So I think with rocm you should generate another output with the same config as cuda1, to have something like rocm1.

I think they could generate different results because the generation config are not the same, yet it's possible that the two results be the same. Even with cuda and cuda1 the two generations are very close, so it's not a stretch to think that on rocm the output tokens are the same for the two generations.

Right, but the purpose of the test is to verify that the generation config has the expected effect.
So we either have a rocm bug or need to adjust the config.

ivarflakstad · 2025-08-27T10:40:38Z

run-slow: whisper

github-actions · 2025-08-27T10:41:59Z

This comment contains run-slow, running the specified jobs:

models: ['models/whisper']
quantizations: [] ...

HuggingFaceDocBuilderDev · 2025-08-27T10:44:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

remi-or

LGTM just add precise versions for ROCm please!

tests/models/whisper/test_modeling_whisper.py

remi-or · 2025-08-27T11:03:06Z

tests/models/whisper/test_modeling_whisper.py

+        cuda1 = [" Folks, I spend a lot of time right over there, night after night after night, actually. Carefully selecting for you the day's noosiest, most aerodynamic headlines, stress testing, and those topical anti-lock breaks and power steering, painstakingly stitching, leather seating so soft, it would make JD power and her associates blush to create the luxury sedan that is my nightly monologue. But sometimes, you sometimes, folks. I lurched a consciousness in the back of an abandoned school bus and slap myself a wig."]
+        rocm = [" Folks, I spend a lot of time right over there, night after night after night, actually. Carefully selecting for you the day's noosiest, most aerodynamic headlines, stress testing, and those topical anti-lock breaks and power steering, painstakingly stitching, leather seating, so soft, it would make JD power and her associates blush to create the luxury sedan that is my nightly monologue. But sometimes, you sometimes, folks, I lurched a consciousness in the back of an abandoned school bus and slap myself awake."]
        # fmt: on
+        expected_output = Expectations({("cuda", None): cuda, ("rocm", None): rocm}).get_expectation()


Specify version for rocm

tests/models/whisper/test_modeling_whisper.py

ahadnagy

Thanks Ivar! i think it's fine not specifying the exact platform versions until we need to differentiate.

…names

ivarflakstad · 2025-08-27T11:15:53Z

run-slow: whisper

github-actions · 2025-08-27T11:17:17Z

This comment contains run-slow, running the specified jobs:

models: ['models/whisper']
quantizations: [] ...

remi-or

approved w/ one very minor nit

tests/models/whisper/test_modeling_whisper.py

github-actions · 2025-08-27T16:07:42Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: whisper

Add rocm expected results to certain tests

969345a

ivarflakstad requested review from ahadnagy and remi-or August 27, 2025 10:35

ivarflakstad commented Aug 27, 2025

View reviewed changes

remi-or approved these changes Aug 27, 2025

View reviewed changes

ahadnagy approved these changes Aug 27, 2025

View reviewed changes

Specify rocm version in expectations so we know origin. Improved var …

cb44b99

…names

ivarflakstad requested a review from remi-or August 27, 2025 11:15

remi-or approved these changes Aug 27, 2025

View reviewed changes

tests/models/whisper/test_modeling_whisper.py Show resolved Hide resolved

Update test var names

ba53fa8

Merge branch 'main' into amd-whisper-expectations

7c5e2e1

ivarflakstad enabled auto-merge (squash) August 27, 2025 16:11

ivarflakstad merged commit 3c343c6 into main Aug 27, 2025
19 checks passed

ivarflakstad deleted the amd-whisper-expectations branch August 27, 2025 16:19

[Whisper] Add rocm expected results to certain tests #40482

[Whisper] Add rocm expected results to certain tests #40482

Uh oh!

Conversation

ivarflakstad commented Aug 27, 2025

What does this PR do?

Uh oh!

ivarflakstad Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

ebezzam Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

remi-or Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

ivarflakstad Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

ivarflakstad commented Aug 27, 2025

Uh oh!

github-actions bot commented Aug 27, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 27, 2025

Uh oh!

remi-or left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

remi-or Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ahadnagy left a comment

Choose a reason for hiding this comment

Uh oh!

ivarflakstad commented Aug 27, 2025

Uh oh!

github-actions bot commented Aug 27, 2025

Uh oh!

remi-or left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants