A numeric designator of the computational capabilities of a CUDA device.
More...
#include <device_properties.hpp>
A numeric designator of the computational capabilities of a CUDA device.
- Note
- The CUDA programming guide has tables (<a href="https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications-feature-support-per-compute-capability"this one and <a href="https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications-technical-specifications-per-compute-capability"this one) listing the specific features and capabilities for different CC values.
◆ as_combined_number()
constexpr unsigned cuda::device::compute_capability_t::as_combined_number |
( |
| ) |
const |
|
noexcept |
Produces a single-number representation of the compute capability.
In certain contexts (e.g. compiler command-line parameters), compute capabilities are specified by a single number, e.g. 75 for major 7, minor 5. This perform one direction of the conversion; see also {from_combined_number}.
◆ from_combined_number()
static constexpr compute_capability_t cuda::device::compute_capability_t::from_combined_number |
( |
unsigned |
combined | ) |
|
|
staticnoexcept |
Converts a single-number representation of a compute capability into a proper structured instance of this class.
In certain contexts (e.g. compiler command-line parameters), compute capabilities are specified by a single number, e.g. 75 for major 7, minor 5. This perform one direction of the conversion; see also {as_combined_number}.
◆ is_valid()
constexpr bool cuda::device::compute_capability_t::is_valid |
( |
| ) |
const |
|
noexcept |
- Returns
- true if there actually are any GPUs listed with this combination of major and minor compute capability numbers
◆ max_in_flight_threads_per_processor()
unsigned cuda::device::compute_capability_t::max_in_flight_threads_per_processor |
( |
| ) |
const |
- Note
- : Based on _ConvertSMVer2Cores() in the CUDA samples helper code
◆ max_shared_memory_per_block()
- Note
- On some architectures, the shared memory / L1 balance is configurable, so that this may not be the current, actual maximum a specific kernel can use at a specific point.
◆ minor_
unsigned cuda::device::compute_capability_t::minor_ |
The minor designator, indicating mostly numeric choices of capabilities (e.g.
how many SMs, homw much memory, whether the numbers of functional units will be skewed more towards double-precision or integer operations etc.)
The documentation for this struct was generated from the following file: