CUDA: Error handling variables not added to the `@llvm.used` list

#9267 fixed dropping of kernels by `pynvjitlink` by adding kernels to the `@llvm.used` list.

We also add global variables for representing an error handling state:

https://github.com/numba/numba/blob/03f2722e624d46223e3c95bc3910905b72d9a24d/numba/cuda/target.py#L219-L224

The variables seem to get optimized away when LTO is used with `pynvjitlink`, and I suspect they should also be added to the `@llvm.used` list to prevent them being optimized away - from the perspective of device code, they are only ever written to, so they look un-needed - it's only the host that looks up their values after kernel execution.

	def define_error_gv(postfix):
	name = wrapfn.name + postfix
	gv = cgutils.add_global_variable(wrapper_module, ir.IntType(32),
	name)
	gv.initializer = ir.Constant(gv.type.pointee, None)
	return gv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: Error handling variables not added to the `@llvm.used` list #9526

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

CUDA: Error handling variables not added to the @llvm.used list #9526

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

CUDA: Error handling variables not added to the `@llvm.used` list #9526