🔬This is a nightly-only experimental API. (
stdsimd
#48556)Available on
target_arch="nvptx64"
only.Expand description
NVPTX intrinsics (experimental)
These intrinsics form the foundation of the CUDA programming model.
The reference is the CUDA C Programming Guide. Relevant is also the LLVM NVPTX Backend documentation.
Functions
- __assert_fail⚠ExperimentalSyscall to be used whenever the assert expression produces a
false
value. - _block_dim_x⚠Experimentalx-th thread-block dimension.
- _block_dim_y⚠Experimentaly-th thread-block dimension.
- _block_dim_z⚠Experimentalz-th thread-block dimension.
- _block_idx_x⚠Experimentalx-th thread-block index.
- _block_idx_y⚠Experimentaly-th thread-block index.
- _block_idx_z⚠Experimentalz-th thread-block index.
- _grid_dim_x⚠Experimentalx-th block-grid dimension.
- _grid_dim_y⚠Experimentaly-th block-grid dimension.
- _grid_dim_z⚠Experimentalz-th block-grid dimension.
- _syncthreads⚠ExperimentalSynchronizes all threads in the block.
- _thread_idx_x⚠Experimentalx-th thread index.
- _thread_idx_y⚠Experimentaly-th thread index.
- _thread_idx_z⚠Experimentalz-th thread index.
- free⚠ExperimentalFree previously dynamically allocated memory.
- malloc⚠ExperimentalAllocate memory dynamically from a fixed-size heap in global memory.
- trap⚠ExperimentalGenerates the trap instruction
TRAP
- vprintf⚠ExperimentalPrint formatted output from a kernel to a host-side output stream.