Fix computation of cache-line size on Power #6515

keithc-ca · 2022-05-12T19:22:34Z

We need to inform the compiler that our uses of dcbz clobber memory, otherwise the optimizer can reasonably assume, e.g. in omrcpu_startup(), that buf[0] == 255 at the beginning of the for loop.

pshipton · 2022-05-12T19:43:50Z

port/unix/omrcpu.c

-		:"r"((void *) &buf[512]));
+	memset(buf, 255, sizeof(buf));
+
+	__asm__ __volatile__(


I suspect the __volatile__ isn't needed. The port library copy doesn't use it.

Ah nm, I see you just added it here. I guess it doesn't hurt.

It was used in both places in j9memclr.cpp and won't hurt. In fact, it may save us from a future compiler optimization.

pshipton · 2022-05-13T03:06:08Z

jenkins build all

pshipton · 2022-05-16T14:15:02Z

@jdmpapin @dsouzai is there a committer that can look at this?

jdmpapin · 2022-05-16T15:42:59Z

The changes LGTM, but can the cache line size detection be factored out into a single place?

keithc-ca · 2022-05-16T15:59:10Z

The changes LGTM, but can the cache line size detection be factored out into a single place?

I'll look into whether there's a clear path to that.

keithc-ca · 2022-05-16T18:25:49Z

Updated as suggested, reusing getCacheLineSize() in omrcpu_startup() and using uint32_t as the storage type for the cache line size everywhere.

jdmpapin · 2022-05-16T18:42:03Z

The description of the functional fix (i.e. the memory clobber) seems to have gone missing from the commit message. It's probably more important now than it was before, since there are more changes

The compiler needs to be told that our uses of dcbz clobber memory, otherwise the optimizer can reasonably assume in getCacheLineSize(), that each element of buf still has the value (255) assigned by memset(). * reuse getCacheLineSize() in omrcpu_startup() * use uint32_t consistently for cache line size Signed-off-by: Keith W. Campbell <keithc@ca.ibm.com>

keithc-ca · 2022-05-16T19:20:59Z

That description was never in the commit message itself (only in the description here); it's now also in the commit message.

jdmpapin · 2022-05-16T21:16:44Z

Jenkins build all

This allows the compiler to do the widening operation once, rather than repeatedly in loops, fixing a performance regression introduced in eclipse-omr#6515. Signed-off-by: Keith W. Campbell <keithc@ca.ibm.com>

keithc-ca requested review from charliegracie, rwy7 and youngar as code owners May 12, 2022 19:22

keithc-ca mentioned this pull request May 12, 2022

Getting cache line size doesn't work on plinux with gcc 10 eclipse-openj9/openj9#15056

Closed

github-actions bot added the comp:port label May 12, 2022

keithc-ca marked this pull request as draft May 12, 2022 19:28

keithc-ca force-pushed the ppc_cache_line_size branch from d538b0a to df47457 Compare May 12, 2022 19:40

pshipton reviewed May 12, 2022

View reviewed changes

pshipton approved these changes May 12, 2022

View reviewed changes

keithc-ca marked this pull request as ready for review May 13, 2022 01:57

keithc-ca requested a review from mstoodle as a code owner May 13, 2022 01:57

AdamBrousseau mentioned this pull request May 13, 2022

Change jdk17 linux compilers to gcc 10.3 eclipse-openj9/openj9#14799

Closed

babsingh mentioned this pull request May 13, 2022

OSX PR builds: OMR Socket Test Failures #6516

Open

keithc-ca force-pushed the ppc_cache_line_size branch from df47457 to 18814e6 Compare May 16, 2022 18:24

keithc-ca force-pushed the ppc_cache_line_size branch from 18814e6 to 8b3bf5b Compare May 16, 2022 19:20

jdmpapin approved these changes May 16, 2022

View reviewed changes

jdmpapin merged commit c67d754 into eclipse-omr:master May 16, 2022

keithc-ca deleted the ppc_cache_line_size branch May 17, 2022 12:17

pshipton mentioned this pull request May 19, 2022

Remove exception for avoiding use of gcc10 with openj9 adoptium/temurin-build#2833

Merged

keithc-ca mentioned this pull request Jun 23, 2022

Use a wider type for local variables holding the cache line size #6587

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix computation of cache-line size on Power #6515

Fix computation of cache-line size on Power #6515

Uh oh!

keithc-ca commented May 12, 2022 •

edited

Loading

Uh oh!

pshipton May 12, 2022

Uh oh!

pshipton May 12, 2022

Uh oh!

keithc-ca May 12, 2022

Uh oh!

pshipton commented May 13, 2022

Uh oh!

pshipton commented May 16, 2022 •

edited

Loading

Uh oh!

jdmpapin commented May 16, 2022

Uh oh!

keithc-ca commented May 16, 2022

Uh oh!

keithc-ca commented May 16, 2022

Uh oh!

jdmpapin commented May 16, 2022

Uh oh!

keithc-ca commented May 16, 2022

Uh oh!

jdmpapin commented May 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix computation of cache-line size on Power #6515

Fix computation of cache-line size on Power #6515

Uh oh!

Conversation

keithc-ca commented May 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pshipton May 12, 2022

Choose a reason for hiding this comment

Uh oh!

pshipton May 12, 2022

Choose a reason for hiding this comment

Uh oh!

keithc-ca May 12, 2022

Choose a reason for hiding this comment

Uh oh!

pshipton commented May 13, 2022

Uh oh!

pshipton commented May 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jdmpapin commented May 16, 2022

Uh oh!

keithc-ca commented May 16, 2022

Uh oh!

keithc-ca commented May 16, 2022

Uh oh!

jdmpapin commented May 16, 2022

Uh oh!

keithc-ca commented May 16, 2022

Uh oh!

jdmpapin commented May 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

keithc-ca commented May 12, 2022 •

edited

Loading

pshipton commented May 16, 2022 •

edited

Loading