Static Public Member Functions
static std::vector< double >	unit_weights (const TeamLocality_t &tloc)
	Shared memory bandwidth capacities of every unit factored by the mean memory bandwidth capacity of all units in the team. More...

Detailed Description

Definition at line 63 of file LoadBalancePattern.h.

Member Function Documentation

◆ unit_weights()

static std::vector<double> dash::BytesPerCycleMeasure::unit_weights ( const TeamLocality_t & tloc )

inlinestatic

Shared memory bandwidth capacities of every unit factored by the mean memory bandwidth capacity of all units in the team.

Consequently, a vector of 1's is returned if all units have identical memory bandwidth.

The memory bandwidth balancing weight for a unit is relative to the bytes/cycle measure of its affine core and considers the lower bound ("maximum of minimal") throughput between the unit to any other unit in the host system's shared memory domain.

This is mostly relevant for accelerators that have no direct access to the host system's shared memory. For example, Intel MIC accelerators are connected to the host with a 6.2 GB/s PCIE bus and a single MIC core operates at 1.1 Ghz with 4 hardware threads. The resulting measure (bytes/cycle) is calculated as:

Mpk = 6.2 GB/s Cpk = 1.1 Ghz * 4 = 4.4 G cycles/s BpC = Mpk / Cpk = 5.63 bytes/cycle

The principal idea is that any data used in operations on the MIC target must be moved over the slow PCIE interconnect first. The offload overhead therefore reduces the amount of data assigned to a MIC accelerator, despite its superior ops/s performance.

Definition at line 98 of file LoadBalancePattern.h.

   {
     std::vector<double> unit_mem_perc;
 
 #if 0
     // TODO: Calculate and assign neutral weights for units located at
     //       cores with unknown memory bandwidth.
 
     std::vector<size_t> unit_mem_capacities;
     size_t total_mem_capacity = 0;
 
     // Calculate average memory bandwidth first:
     for (auto u : tloc.units()) {
       auto & unit_loc     = tloc.unit_locality(u);
       size_t unit_mem_cap = std::max<int>(0, unit_loc.max_shmem_mbps());
       if (unit_mem_cap > 0) {
         total_mem_capacity += unit_mem_cap;
       }
       unit_mem_capacities.push_back(unit_mem_cap);
     }
     if (total_mem_capacity == 0) {
       total_mem_capacity = tloc.units().size();
     }
     DASH_LOG_TRACE_VAR("LoadBalancePattern.init_mem_bandwidth_weights",
                        total_mem_capacity);
     DASH_LOG_TRACE_VAR("LoadBalancePattern.init_mem_bandwidth_weights",
                        unit_mem_capacities);
 
     double avg_mem_capacity = static_cast<double>(total_mem_capacity) /
                               tloc.units().size();
 
     // Use average value for units with unknown memory bandwidth:
     for (auto membw = unit_mem_capacities.begin();
          membw != unit_mem_capacities.end(); ++membw) {
       if (*membw <= 0) {
         *membw = avg_mem_capacity;
       }
     }
 #endif
 
     std::vector<double> unit_bytes_per_cycle;
     double total_bytes_per_cycle = 0;
 
     // Calculating bytes/cycle per core for every unit:
     for (auto u : tloc.global_units()) {
       auto   unit_loc     = tloc.unit_locality(u);
       double unit_mem_bw  = std::max<int>(0, unit_loc.max_shmem_mbps());
       double unit_core_fq = unit_loc.num_threads() *
                             unit_loc.cpu_mhz();
       double unit_bps     = unit_mem_bw / unit_core_fq;
       unit_bytes_per_cycle.push_back(unit_bps);
       total_bytes_per_cycle += unit_bps;
     }
 
     double avg_bytes_per_cycle =
       static_cast<double>(total_bytes_per_cycle) / tloc.global_units().size();
 
     unit_mem_perc.reserve(unit_bytes_per_cycle.size());
     for (auto unit_bps : unit_bytes_per_cycle) {
       unit_mem_perc.push_back(unit_bps / avg_bytes_per_cycle);
     }
     return unit_mem_perc;
   }

The documentation for this class was generated from the following file:

/tmp/tmporruphar/dash/include/dash/pattern/LoadBalancePattern.h

Static Public Member Functions

Detailed Description

Member Function Documentation

◆ unit_weights()