Thin C++-flavored wrappers for the CUDA Runtime API
Public Member Functions | Static Public Member Functions | Public Attributes | List of all members
cuda::device::compute_capability_t Struct Reference

A numeric designator of the computational capabilities of a CUDA device. More...

#include <device_properties.hpp>

Collaboration diagram for cuda::device::compute_capability_t:
Collaboration graph

Public Member Functions

constexpr unsigned major () const
unsigned constexpr minor () const
constexpr unsigned as_combined_number () const noexcept
constexpr bool is_valid () const noexcept
unsigned max_warp_schedulings_per_processor_cycle () const
unsigned max_resident_warps_per_processor () const
unsigned max_in_flight_threads_per_processor () const
memory::shared::size_t max_shared_memory_per_block () const

Static Public Member Functions

static constexpr compute_capability_t from_combined_number (unsigned combined) noexcept

Public Attributes

compute_architecture_t architecture
unsigned minor_

Detailed Description

A numeric designator of the computational capabilities of a CUDA device.

Wikipedia has a <a href=""table listing the specific features and capabilities for different CC values.

Member Function Documentation

◆ max_shared_memory_per_block()

memory::shared::size_t cuda::device::compute_capability_t::max_shared_memory_per_block ( ) const
On some architectures, the shared memory / L1 balance is configurable, so you might not get the maxima here without making this configuration setting

The documentation for this struct was generated from the following file: