我一直在使用NVIDIA分析器(nvprof),但有两个指标我不太明白:
inst_inter_thread_communication
Number of inter-thread communication instructions executed by non-predicated threads
inst_misc
Number of miscellaneous instructions executed by non-predicated threads
我想知道哪些指令属于线程间通信指令,哪些属于其他杂项指令。
参考: http://docs.nvidia.com/cuda/profiler-users-guide/#metrics-reference