early stop when all sequence reach EOS #57

je1lee · 2024-04-09T05:49:06Z

With model.generate() it takes too long even sequence generation have done earlier with EOS token. Because now, it generates til it reached to output_len

fix the generate method to stop when every sequence has generated EOS token

je1lee · 2024-04-16T03:57:37Z

@pengchongjin any idea for this?

pengchongjin · 2024-05-29T16:08:18Z

Thanks for the change. Could you please paste a few example outputs before and after this change?

Also please make sure to test both run.py and run_xla.py. Thanks!

je1lee · 2024-06-03T06:03:03Z

@pengchongjin
test done with both scripts

BEFORE

model generates token regardless of eos token, so time spent in generation increases quadratically as output_len increases

AFTER

model stop generate when model samples out eos token time spent in generation remain still as output_len increases

je1lee and others added 4 commits April 9, 2024 05:43

fix: early stop when all sequence reach EOS

55a1c73

style: tab in line

488e5f2

fix: (xla) early stop when all sequence reach EOS

815a0c9

Merge branch 'google:main' into fix/earlystop

e1d6092

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

early stop when all sequence reach EOS #57

early stop when all sequence reach EOS #57

Uh oh!

je1lee commented Apr 9, 2024 •

edited

Loading

je1lee commented Apr 16, 2024

pengchongjin commented May 29, 2024

je1lee commented Jun 3, 2024 •

edited

Loading

Labels

2 participants

early stop when all sequence reach EOS #57

Are you sure you want to change the base?

early stop when all sequence reach EOS #57

Uh oh!

Conversation

je1lee commented Apr 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

je1lee commented Apr 16, 2024

pengchongjin commented May 29, 2024

je1lee commented Jun 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Labels

2 participants

je1lee commented Apr 9, 2024 •

edited

Loading

je1lee commented Jun 3, 2024 •

edited

Loading