Skip to content
Started 5 mo 20 days ago
Took 6 hr 0 min
Aborted

Build #2 (Dec 1, 2023, 11:52:59 AM)

Changes
  1. implementation and tests (details / githubweb)
  2. implementation and tests (details / githubweb)
  3. #5635: Add parallel_scan overload with return value for ThreadVectorRange (details / githubweb)
  4. #5635: Add test for parallel_scan with return value for ThreadVectorRange (details / githubweb)
  5. address reviews [skip ci] (details / githubweb)
  6. formatting (details / githubweb)
  7. fix lambda capture (details / githubweb)
  8. remove unnecessary file, fix constraints (details / githubweb)
  9. remove file (details / githubweb)
  10. fix (details / githubweb)
  11. fix lambda capture and constraints (details / githubweb)
  12. keep only subset (details / githubweb)
  13. revert files (details / githubweb)
  14. remove file (details / githubweb)
  15. fix syntax (details / githubweb)
  16. format (details / githubweb)
  17. Add support for HIP Graph (details / githubweb)
  18. Do not use HIP Graph with ROCm 5.2 (details / githubweb)
  19. Fix reviewer's comments (details / githubweb)
  20. Adding is_scoped_enum & to_underlying (#6356) (details / githubweb)
  21. SIMD: add float simd support (#6177) (details / githubweb)
  22. team-level std algos: part 4 (#6208) (details / githubweb)
  23. Added a gen ctor for float (#6397) (details / githubweb)
  24. team-level std algos: part 5 (#6209) (details / githubweb)
  25. Deprecate Cuda(cudaStream_t, bool) (details / githubweb)
  26. Fixup checked interger operations death test (details / githubweb)
  27. Deprecate HIP(hipStream_t, bool) (details / githubweb)
  28. Let Kokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC be ON by default (details / githubweb)
  29. Print whether KOKKOS_ENABLE_IMPL_CUDA_MALLOC_ASYNC is defined (details / githubweb)
  30. Introduce disable_malloc_async Cuda option with generated makefiles (details / githubweb)
  31. Preserve one build that disables Kokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC (details / githubweb)
  32. Use archive extraction time for timestamps (details / githubweb)
  33. Disable performance benchmarks in AppVeyor CI (details / githubweb)
  34. team-level std algos: part 6 (#6210) (details / githubweb)
  35. address comments (details / githubweb)
  36. OpenMP backend refactor files. (#6403) (details / githubweb)
  37. Drop (unused) `Cuda::cuda_internal_maximum_shared_words` (details / githubweb)
  38. Drop check that the host backend is initialized before the Cuda/HIP/SYCL one (details / githubweb)
  39. Drop unused HIPInternal::m_maxSharedWords data member (details / githubweb)
  40. Drop unused HIPInternal::m_hipArch static data member (details / githubweb)
  41. !initialized() should be a precondition for calling {Cuda,HIP,SYCL}Internal::initialize (details / githubweb)
  42. Drop pre-Kepler logic in Cuda::impl_initialize (details / githubweb)
  43. use single (details / githubweb)
  44. address comments (details / githubweb)
  45. formatting (details / githubweb)
  46. Team-level std algos: part 7 (#6211) (details / githubweb)
  47. formatting (details / githubweb)
  48. Enable death tests for fedora rawhide (details / githubweb)
  49. core/src: Add half math functions to private header (#6124) (details / githubweb)
  50. Drop check whether device supports unified addressing (details / githubweb)
  51. fix single as per Christian's suggestion (details / githubweb)
  52. Only warn once (at initialization) when forcing allocation in unified memory (details / githubweb)
  53. check-copyright improvements (#6399) (details / githubweb)
  54. Use execution space instance argument to get device properties in block size deduction (details / githubweb)
  55. Address reviewer' comments (details / githubweb)
  56. Fix to avoid #186-D pointless comparison warning. (details / githubweb)
  57. add comment (details / githubweb)
  58. improve tests to address review (details / githubweb)
  59. Fix guard for isnan test for bhalf_t (details / githubweb)
  60. avoid potential race condition HIP (details / githubweb)
  61. [SYCL][Reduction] Group counter should use at least memory_order::acq_rel (details / githubweb)
  62. Same for scan (details / githubweb)
  63. Initialize m_num_scratch_locks for Cuda parallel_for TeamPolicy (details / githubweb)
  64. improve tests with intra-team result check (details / githubweb)
  65. Fixes for Kokkos::Array (#6372) (details / githubweb)
  66. try fix for unique, previous impl to remove later (details / githubweb)
  67. #5635: Add parallel_scan changes for CUDA and TeamThreadRange (details / githubweb)
  68. remove old impl (details / githubweb)
  69. #5635: Enable TeamThreadRange test for CUDA (details / githubweb)
  70. Clean up benchmarks/gups (details / githubweb)
  71. benchmark/gups: use CMake (details / githubweb)
  72. OpenMPTarget: Disable check for SIMD compiler macros (details / githubweb)
  73. #5635: Add parallel_scan with value for CUDA and ThreadVectorRange (details / githubweb)
  74. add missing assert (details / githubweb)
  75. #5635: Add test for parallel_scan with return value for ThreadVectorRange (details / githubweb)
  76. add intra team check for missing test (details / githubweb)
  77. fix intel compile error (details / githubweb)
  78. fix unreachable for intel (details / githubweb)
  79. re-enable unit tests for sort and random via makefile (#6422) (details / githubweb)
  80. OpenMPTarget init-join fix (#6444) (details / githubweb)
  81. Fix Cuda parallel_scan ThreadVectorRange range (details / githubweb)
  82. Assign final sum in Cuda parallel_scan ThreadVectorRange (details / githubweb)
  83. Fix compiling code using Kokkos::printf for OpenMPTarget on Intel GPUs (#6443) (details / githubweb)
  84. std_algos: improving min, max, minmax (#6421) (details / githubweb)
  85. team-level stdalgos: improve tests, check intra-team result matching (part 2/7) (#6426) (details / githubweb)
  86. Skip bessel function tests known to fail on Intel GPUs (#6434) (details / githubweb)
  87. team-level stdalgos: improve tests, check intra-team result matching (part 6/7) (#6436) (details / githubweb)
  88. Fix race condition in functor_vec_scan_ret_val test (details / githubweb)
  89. Fix parallel_scan_with_reducers test (details / githubweb)
  90. team-level stdalgos: improve tests, check intra-team result matching (part 3/7) (#6425) (details / githubweb)
  91. improve tests (#6432) (details / githubweb)
  92. improve tests (#6437) (details / githubweb)
  93. Move final assignment to correct scope (details / githubweb)
  94. fix casting warning in Random test (details / githubweb)
  95. Workaround for ROCm 5.6+ failing to compile with AVX2 SIMD support (#6449) (details / githubweb)
  96. HIP: Restrict AVX2 workaround to ROCm 5.6 and 5.7 (details / githubweb)
  97. fixes build error for TeamReduce and TeamTranformReduced tests for specific GCC (#6459) (details / githubweb)
  98. improve tests to check intra-team result (#6431) (details / githubweb)
  99. SIMD: Math functions should be in namespace Kokkos (details / githubweb)
  100. SYCL: Disable another bessel function test for Intel GPUs (details / githubweb)
  101. team-level std algos: part 10 (#6256) (details / githubweb)
  102. team-level std algos: part 11 (#6258) (details / githubweb)
  103. #5635: HIP: Add Overloads for parallel_scan with return value for TeamThreadRange (#6302) (details / githubweb)
  104. #5635: Add parallel_scan overloads with value for Threads (details / githubweb)
  105. #5635: Move some tests for parallel_scan to TestTeamScan (details / githubweb)
  106. use shortcut (details / githubweb)
  107. Skip testing for non-power-of-two team sizes (details / githubweb)
  108. Fix formatting (details / githubweb)
  109. Add parallel_scan overloads with value for HIP backend (details / githubweb)
  110. Use std::is_same_v (details / githubweb)
  111. #5635: Move some tests for parallel_scan to TestTeamScan (details / githubweb)
  112. #5635: SYCL: Add parallel_scan overload with return value (details / githubweb)
  113. Use std::is_same_v (details / githubweb)
  114. OpenMP: Fix TeamThreadRange parallel_scan with return value for team_size > 1 (details / githubweb)
  115. Add compatible copy assignment operator to DualView (details / githubweb)
  116. Fix minimum version for Google benchmark (details / githubweb)
  117. Add test of copy constructor/assignment operator for DualView. (details / githubweb)
  118. Compute concurrency on HIP using Kokkos hardcoded m_maxWavesPerCU (details / githubweb)
  119. Modify test so that source and destination view are of different type (details / githubweb)
  120. Use call operator instead of run_me function (details / githubweb)
  121. team-level std algos: part 12 (#6350) (details / githubweb)
  122. core/src: Add half single and double mixed compare (LT,GT,LE,GE) (#6407) (details / githubweb)
  123. Moving abort and assert into their own public headers (#6445) (details / githubweb)
  124. Add test for parallel_scan with return value for ThreadVectorRange (details / githubweb)
  125. Add parallel_scan overloads with value for Threads (details / githubweb)
  126. Allow detecting SIMD types based on compiler macros  (#6188) (details / githubweb)
  127. Add KOKKOS_ARCH_ARM_NEON (details / githubweb)
  128. Fix implementation for cyl_bessel_i0 (details / githubweb)
  129. cleaning: remove iostream from headers where possible (IWYU) (details / githubweb)
  130. Fix compiling SIMD unit tests on NVIDIA (details / githubweb)
  131. guards to ensure DBL_EPSILON return for POWER8,9 (details / githubweb)
  132. Fix NVCC warnings (#6483) (details / githubweb)
  133. team-level std algos: part 13 (#6351) (details / githubweb)
  134. Also fix annotations for generator constructor for AVX512 and NEON (details / githubweb)
  135. #5635: SYCL: Add parallel_scan overload with value for ThreadVectorRange (details / githubweb)
  136. Fix atomic operations bug for Min and Max (#6435) (details / githubweb)
  137. Fix example/build_cmake_installed_different_compiler (details / githubweb)
  138. fix impl (details / githubweb)
  139. Update core/src/HIP/Kokkos_HIP_KernelLaunch.hpp (details / githubweb)
  140. Split Kokkos_Threads_Parallel files (details / githubweb)
  141. HPX: Implement TeamThread and ThreadVector parallel_scan with return value (details / githubweb)
  142. Serial: Allow for distinct execution space instances (#6441) (details / githubweb)
  143. add overload for TeamThreadRange (details / githubweb)
  144. address review comment (details / githubweb)
  145. Update to HIP TeamPolicy Block number heuristic (#6284) (details / githubweb)
  146. SIMD: Split math functions from SIMD_Common.hpp (#6487) (details / githubweb)
  147. Allow NVHPC as device compiler only with Kokkos_ENABLE_IMPL_NVHPC_AS_DEVICE_COMPILER=ON (details / githubweb)
  148. ad threadvector (details / githubweb)
  149. fix order (details / githubweb)
  150. remove guards (details / githubweb)
  151. UnorderedMap(space instance): proposal for #6067 (details / githubweb)
  152. Rename Kokkos_ThreadsExec to align with the other backends (details / githubweb)
  153. Promote Kokkos_Printf.hpp to public include (details / githubweb)
  154. simd: add floor, ceil, round, trunc operations (#6393) (details / githubweb)
  155. Update CI in OpenMPTarget to use llvm-17  (#6472) (details / githubweb)
  156. Rename Kokkos_ThreadsTeam.hpp to Kokkos_Threads_Team.hpp (details / githubweb)
  157. Kokkos_SIMD_Scalar.hpp: remove extra ';' (details / githubweb)
  158. add guards (details / githubweb)
  159. avoid auto (details / githubweb)
  160. [ci skip] Update Kokkos version to 4.2.99 (details / githubweb)
  161. [deprecated code 3] remove all default device init tests (details / githubweb)
  162. [deprecated code 3] remove InitArguments (details / githubweb)
  163. [deprecated code 3] remove KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_* macros (details / githubweb)
  164. [deprecated code 3] remove using declaration in Kokkos::Experimental:: for clamp, min, max, and minmax (details / githubweb)
  165. [deprecated code 3] remove using declaration in Kokkos::Experimental:: for all math functions (details / githubweb)
  166. [deprecated code 3] remove using declaration in Kokkos::Experimental:: for all math constants (details / githubweb)
  167. [deprecated code 3] remove {OpenMP,HPX}::partition_master (details / githubweb)
  168. OpenMP backend cleanup following removal of deprecated code 3 (details / githubweb)
  169. Get rid of FIXME_OPENMP (details / githubweb)
  170. [deprecated code 3] remove MasterLock (details / githubweb)
  171. [deprecated code 3] remove ENABLE_DEPRECATED_CODE_3 option (details / githubweb)
  172. fixup! [deprecated code 3] remove using declaration in Kokkos::Experimental:: for all math functions (details / githubweb)
  173. Do not append " - blocks" to the bitset label (details / githubweb)
  174. with_updated_label -> append_to_label (details / githubweb)
  175. SYCL: Use SYCL_EXT_ONEAPI_DEVICE_GLOBAL to detect support for device global variables (details / githubweb)
  176. Fixup in README (github -> GitHub) (details / githubweb)
  177. Threads: replace enum with constexpr int and enum class (#6514) (details / githubweb)
  178. Added unit tests for reduction ops and few intel svml intrinsics (details / githubweb)
  179. Remove sleep and wake functions (details / githubweb)
  180. Prefer defaulted default constructor for Bitset (#6524) (details / githubweb)
  181. SYCL: Use host-pinned memory to copy reduction/scan result (#6500) (details / githubweb)
  182. Remove unused variables (details / githubweb)
  183. Remove Sentinel struct from Threads (details / githubweb)
  184. Small cleanup of ThreadsInternal::initialize (details / githubweb)
  185. Remove useless forward declaration (details / githubweb)
  186. Remove spawn function (details / githubweb)
  187. Add comments (details / githubweb)
  188. Fix indentation (details / githubweb)
  189. Fix typo in macro guard (details / githubweb)
  190. Reduce number of View constructor instantiations (details / githubweb)
  191. Bump HPX version used in CI to 1.9.0 (details / githubweb)
  192. Split files in HIP backend (details / githubweb)
  193. Trim some fat in `CudaInternal` (towards multiple GPUs support) (#6544) (details / githubweb)
  194. Only define STDALGO_TEAM_SOURCES_* once (details / githubweb)
  195. Rollback changes to view constructors to reduce the number of instantiations (#6564) (details / githubweb)
  196. Threads remove unused variables and functions (#6566) (details / githubweb)
  197. Remove unused Sandia testing files (#6568) (details / githubweb)
  198. fallback implementation cleanup (details / githubweb)
  199. Remove empty quotation marks for static_assert (details / githubweb)
  200. [ci skip] Drop unused <impl/Kokkos_Memory_Fence.hpp> header (details / githubweb)
  201. Drop Clang+CUDA workaround (details / githubweb)
  202. OpenMPTarget: CI compiler upgrade. (#6545) (details / githubweb)
  203. Add crtrott's launch_latency benchmark (#6379) (details / githubweb)
  204. Simplify fence functions in the Threads backend (#6571) (details / githubweb)
  205. simd: temporarily skip device math ops unit test for OpenMPTarget build (#6574) (details / githubweb)
  206. OpenMPTarget: Guard scratch memory usage in ParallelReduce (details / githubweb)
  207. SYCL: Implement DESUL_ATOMICS_ENABLE_SYCL_SEPARABLE_COMPILATION path (#6534) (details / githubweb)
  208. Desul atomics: Trade SYCL-specific compile definition for a macro defintion in the configuration header (details / githubweb)
  209. Replace Marsaglia polar method with Box-muller to generate a normally distributed random number (#6556) (details / githubweb)
  210. OpenMP: No memset in viewfill (#6573) (details / githubweb)
  211. Revert "Desul atomics: Trade SYCL-specific compile definition for a macro defintion in the configuration header" (details / githubweb)
  212. OpenACC: add atomics support (#6446) (details / githubweb)
  213. Fix infinity, quiet_NaN, signaling_Nan, isfinite, isnan, isinf for half_t and bhalf_t (#6543) (details / githubweb)
  214. kokkos(unique): fix allocation of temporary view to enfore using the provided space instance (details / githubweb)
  215. Use binary wrapper for consistency in definition of half types numeric traits (#6590) (details / githubweb)
  216. try fix (details / githubweb)
  217. Fix TestNumericTriats.hpp for SYCL with bfloat16 support (details / githubweb)
  218. Fix generated Makefile when using gnu_generate_makefile.sh and make >= 4.3 (details / githubweb)
  219. Threads: add missing broadcast to TeamThreadRange parallel_scan (#6601) (details / githubweb)
  220. Added missing operator* to NEON simd (details / githubweb)
  221. [ci skip] Update changelog on develop for 4.2.00 (#6592) (details / githubweb)
  222. Remove KOKKOS_IMPL_DO_NOT_USE_PRINTF  (#6593) (details / githubweb)
  223. try fix (details / githubweb)
  224. avoid pyt package (details / githubweb)
  225. try (details / githubweb)
  226. fix for macos (details / githubweb)
  227. remove comments (details / githubweb)
  228. use reference (details / githubweb)
  229. add branching (details / githubweb)
  230. [ci skip] fix formatting (details / githubweb)
  231. GitHub Workflows: Use Ubuntu 22.04 instead of Fedora for Intel compiler testing (details / githubweb)
  232. nvcc(wrapper): adding missing `--generate-line-info` arg (details / githubweb)
  233. Add clang-format check to GitHub workflows (#6612) (details / githubweb)
  234. graph(HIP): adding inline keyword to fix #6623 (details / githubweb)
  235. Add jenkins multibranch pipeline options (details / githubweb)
  236. kokkos(profiling): do not finalize in any backend (details / githubweb)
  237. Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format (details / githubweb)
  238. Disabling OpenACC in the CI because it emits too many warnings (details / githubweb)
  239. unorderedmap: modernize traits (details / githubweb)
  240. nvcc wrapper: remove troubling flag to fix 6628 (#6629) (details / githubweb)
Changes
  1. implementation and tests (details / githubweb)
  2. implementation and tests (details / githubweb)
  3. #5635: Add parallel_scan overload with return value for ThreadVectorRange (details / githubweb)
  4. #5635: Add test for parallel_scan with return value for ThreadVectorRange (details / githubweb)
  5. address reviews [skip ci] (details / githubweb)
  6. formatting (details / githubweb)
  7. fix lambda capture (details / githubweb)
  8. remove unnecessary file, fix constraints (details / githubweb)
  9. remove file (details / githubweb)
  10. fix (details / githubweb)
  11. fix lambda capture and constraints (details / githubweb)
  12. keep only subset (details / githubweb)
  13. revert files (details / githubweb)
  14. remove file (details / githubweb)
  15. fix syntax (details / githubweb)
  16. format (details / githubweb)
  17. Add support for HIP Graph (details / githubweb)
  18. Do not use HIP Graph with ROCm 5.2 (details / githubweb)
  19. Fix reviewer's comments (details / githubweb)
  20. Adding is_scoped_enum & to_underlying (#6356) (details / githubweb)
  21. SIMD: add float simd support (#6177) (details / githubweb)
  22. team-level std algos: part 4 (#6208) (details / githubweb)
  23. Added a gen ctor for float (#6397) (details / githubweb)
  24. team-level std algos: part 5 (#6209) (details / githubweb)
  25. Deprecate Cuda(cudaStream_t, bool) (details / githubweb)
  26. Fixup checked interger operations death test (details / githubweb)
  27. Deprecate HIP(hipStream_t, bool) (details / githubweb)
  28. Let Kokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC be ON by default (details / githubweb)
  29. Print whether KOKKOS_ENABLE_IMPL_CUDA_MALLOC_ASYNC is defined (details / githubweb)
  30. Introduce disable_malloc_async Cuda option with generated makefiles (details / githubweb)
  31. Preserve one build that disables Kokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC (details / githubweb)
  32. Use archive extraction time for timestamps (details / githubweb)
  33. Disable performance benchmarks in AppVeyor CI (details / githubweb)
  34. team-level std algos: part 6 (#6210) (details / githubweb)
  35. address comments (details / githubweb)
  36. OpenMP backend refactor files. (#6403) (details / githubweb)
  37. Drop (unused) `Cuda::cuda_internal_maximum_shared_words` (details / githubweb)
  38. Drop check that the host backend is initialized before the Cuda/HIP/SYCL one (details / githubweb)
  39. Drop unused HIPInternal::m_maxSharedWords data member (details / githubweb)
  40. Drop unused HIPInternal::m_hipArch static data member (details / githubweb)
  41. !initialized() should be a precondition for calling {Cuda,HIP,SYCL}Internal::initialize (details / githubweb)
  42. Drop pre-Kepler logic in Cuda::impl_initialize (details / githubweb)
  43. use single (details / githubweb)
  44. address comments (details / githubweb)
  45. formatting (details / githubweb)
  46. Team-level std algos: part 7 (#6211) (details / githubweb)
  47. formatting (details / githubweb)
  48. Enable death tests for fedora rawhide (details / githubweb)
  49. core/src: Add half math functions to private header (#6124) (details / githubweb)
  50. Drop check whether device supports unified addressing (details / githubweb)
  51. fix single as per Christian's suggestion (details / githubweb)
  52. Only warn once (at initialization) when forcing allocation in unified memory (details / githubweb)
  53. check-copyright improvements (#6399) (details / githubweb)
  54. Use execution space instance argument to get device properties in block size deduction (details / githubweb)
  55. Address reviewer' comments (details / githubweb)
  56. Fix to avoid #186-D pointless comparison warning. (details / githubweb)
  57. add comment (details / githubweb)
  58. improve tests to address review (details / githubweb)
  59. Fix guard for isnan test for bhalf_t (details / githubweb)
  60. avoid potential race condition HIP (details / githubweb)
  61. [SYCL][Reduction] Group counter should use at least memory_order::acq_rel (details / githubweb)
  62. Same for scan (details / githubweb)
  63. Initialize m_num_scratch_locks for Cuda parallel_for TeamPolicy (details / githubweb)
  64. improve tests with intra-team result check (details / githubweb)
  65. Fixes for Kokkos::Array (#6372) (details / githubweb)
  66. try fix for unique, previous impl to remove later (details / githubweb)
  67. #5635: Add parallel_scan changes for CUDA and TeamThreadRange (details / githubweb)
  68. remove old impl (details / githubweb)
  69. #5635: Enable TeamThreadRange test for CUDA (details / githubweb)
  70. Clean up benchmarks/gups (details / githubweb)
  71. benchmark/gups: use CMake (details / githubweb)
  72. OpenMPTarget: Disable check for SIMD compiler macros (details / githubweb)
  73. #5635: Add parallel_scan with value for CUDA and ThreadVectorRange (details / githubweb)
  74. add missing assert (details / githubweb)
  75. #5635: Add test for parallel_scan with return value for ThreadVectorRange (details / githubweb)
  76. add intra team check for missing test (details / githubweb)
  77. fix intel compile error (details / githubweb)
  78. fix unreachable for intel (details / githubweb)
  79. re-enable unit tests for sort and random via makefile (#6422) (details / githubweb)
  80. OpenMPTarget init-join fix (#6444) (details / githubweb)
  81. Fix Cuda parallel_scan ThreadVectorRange range (details / githubweb)
  82. Assign final sum in Cuda parallel_scan ThreadVectorRange (details / githubweb)
  83. Fix compiling code using Kokkos::printf for OpenMPTarget on Intel GPUs (#6443) (details / githubweb)
  84. std_algos: improving min, max, minmax (#6421) (details / githubweb)
  85. team-level stdalgos: improve tests, check intra-team result matching (part 2/7) (#6426) (details / githubweb)
  86. Skip bessel function tests known to fail on Intel GPUs (#6434) (details / githubweb)
  87. team-level stdalgos: improve tests, check intra-team result matching (part 6/7) (#6436) (details / githubweb)
  88. Fix race condition in functor_vec_scan_ret_val test (details / githubweb)
  89. Fix parallel_scan_with_reducers test (details / githubweb)
  90. team-level stdalgos: improve tests, check intra-team result matching (part 3/7) (#6425) (details / githubweb)
  91. improve tests (#6432) (details / githubweb)
  92. improve tests (#6437) (details / githubweb)
  93. Move final assignment to correct scope (details / githubweb)
  94. fix casting warning in Random test (details / githubweb)
  95. Workaround for ROCm 5.6+ failing to compile with AVX2 SIMD support (#6449) (details / githubweb)
  96. HIP: Restrict AVX2 workaround to ROCm 5.6 and 5.7 (details / githubweb)
  97. fixes build error for TeamReduce and TeamTranformReduced tests for specific GCC (#6459) (details / githubweb)
  98. improve tests to check intra-team result (#6431) (details / githubweb)
  99. SIMD: Math functions should be in namespace Kokkos (details / githubweb)
  100. SYCL: Disable another bessel function test for Intel GPUs (details / githubweb)
  101. team-level std algos: part 10 (#6256) (details / githubweb)
  102. team-level std algos: part 11 (#6258) (details / githubweb)
  103. #5635: HIP: Add Overloads for parallel_scan with return value for TeamThreadRange (#6302) (details / githubweb)
  104. #5635: Add parallel_scan overloads with value for Threads (details / githubweb)
  105. #5635: Move some tests for parallel_scan to TestTeamScan (details / githubweb)
  106. use shortcut (details / githubweb)
  107. Skip testing for non-power-of-two team sizes (details / githubweb)
  108. Fix formatting (details / githubweb)
  109. Add parallel_scan overloads with value for HIP backend (details / githubweb)
  110. Use std::is_same_v (details / githubweb)
  111. #5635: Move some tests for parallel_scan to TestTeamScan (details / githubweb)
  112. #5635: SYCL: Add parallel_scan overload with return value (details / githubweb)
  113. Use std::is_same_v (details / githubweb)
  114. OpenMP: Fix TeamThreadRange parallel_scan with return value for team_size > 1 (details / githubweb)
  115. Add compatible copy assignment operator to DualView (details / githubweb)
  116. Fix minimum version for Google benchmark (details / githubweb)
  117. Add test of copy constructor/assignment operator for DualView. (details / githubweb)
  118. Compute concurrency on HIP using Kokkos hardcoded m_maxWavesPerCU (details / githubweb)
  119. Modify test so that source and destination view are of different type (details / githubweb)
  120. Use call operator instead of run_me function (details / githubweb)
  121. team-level std algos: part 12 (#6350) (details / githubweb)
  122. core/src: Add half single and double mixed compare (LT,GT,LE,GE) (#6407) (details / githubweb)
  123. Moving abort and assert into their own public headers (#6445) (details / githubweb)
  124. Add test for parallel_scan with return value for ThreadVectorRange (details / githubweb)
  125. Add parallel_scan overloads with value for Threads (details / githubweb)
  126. Allow detecting SIMD types based on compiler macros  (#6188) (details / githubweb)
  127. Add KOKKOS_ARCH_ARM_NEON (details / githubweb)
  128. Fix implementation for cyl_bessel_i0 (details / githubweb)
  129. cleaning: remove iostream from headers where possible (IWYU) (details / githubweb)
  130. Fix compiling SIMD unit tests on NVIDIA (details / githubweb)
  131. guards to ensure DBL_EPSILON return for POWER8,9 (details / githubweb)
  132. Fix NVCC warnings (#6483) (details / githubweb)
  133. team-level std algos: part 13 (#6351) (details / githubweb)
  134. Also fix annotations for generator constructor for AVX512 and NEON (details / githubweb)
  135. #5635: SYCL: Add parallel_scan overload with value for ThreadVectorRange (details / githubweb)
  136. Fix atomic operations bug for Min and Max (#6435) (details / githubweb)
  137. Fix example/build_cmake_installed_different_compiler (details / githubweb)
  138. fix impl (details / githubweb)
  139. Update core/src/HIP/Kokkos_HIP_KernelLaunch.hpp (details / githubweb)
  140. Split Kokkos_Threads_Parallel files (details / githubweb)
  141. HPX: Implement TeamThread and ThreadVector parallel_scan with return value (details / githubweb)
  142. Serial: Allow for distinct execution space instances (#6441) (details / githubweb)
  143. add overload for TeamThreadRange (details / githubweb)
  144. address review comment (details / githubweb)
  145. Update to HIP TeamPolicy Block number heuristic (#6284) (details / githubweb)
  146. SIMD: Split math functions from SIMD_Common.hpp (#6487) (details / githubweb)
  147. Allow NVHPC as device compiler only with Kokkos_ENABLE_IMPL_NVHPC_AS_DEVICE_COMPILER=ON (details / githubweb)
  148. ad threadvector (details / githubweb)
  149. fix order (details / githubweb)
  150. remove guards (details / githubweb)
  151. UnorderedMap(space instance): proposal for #6067 (details / githubweb)
  152. Rename Kokkos_ThreadsExec to align with the other backends (details / githubweb)
  153. Promote Kokkos_Printf.hpp to public include (details / githubweb)
  154. simd: add floor, ceil, round, trunc operations (#6393) (details / githubweb)
  155. Update CI in OpenMPTarget to use llvm-17  (#6472) (details / githubweb)
  156. Rename Kokkos_ThreadsTeam.hpp to Kokkos_Threads_Team.hpp (details / githubweb)
  157. Kokkos_SIMD_Scalar.hpp: remove extra ';' (details / githubweb)
  158. add guards (details / githubweb)
  159. avoid auto (details / githubweb)
  160. [ci skip] Update Kokkos version to 4.2.99 (details / githubweb)
  161. [deprecated code 3] remove all default device init tests (details / githubweb)
  162. [deprecated code 3] remove InitArguments (details / githubweb)
  163. [deprecated code 3] remove KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_* macros (details / githubweb)
  164. [deprecated code 3] remove using declaration in Kokkos::Experimental:: for clamp, min, max, and minmax (details / githubweb)
  165. [deprecated code 3] remove using declaration in Kokkos::Experimental:: for all math functions (details / githubweb)
  166. [deprecated code 3] remove using declaration in Kokkos::Experimental:: for all math constants (details / githubweb)
  167. [deprecated code 3] remove {OpenMP,HPX}::partition_master (details / githubweb)
  168. OpenMP backend cleanup following removal of deprecated code 3 (details / githubweb)
  169. Get rid of FIXME_OPENMP (details / githubweb)
  170. [deprecated code 3] remove MasterLock (details / githubweb)
  171. [deprecated code 3] remove ENABLE_DEPRECATED_CODE_3 option (details / githubweb)
  172. fixup! [deprecated code 3] remove using declaration in Kokkos::Experimental:: for all math functions (details / githubweb)
  173. Do not append " - blocks" to the bitset label (details / githubweb)
  174. with_updated_label -> append_to_label (details / githubweb)
  175. SYCL: Use SYCL_EXT_ONEAPI_DEVICE_GLOBAL to detect support for device global variables (details / githubweb)
  176. Fixup in README (github -> GitHub) (details / githubweb)
  177. Threads: replace enum with constexpr int and enum class (#6514) (details / githubweb)
  178. Added unit tests for reduction ops and few intel svml intrinsics (details / githubweb)
  179. Remove sleep and wake functions (details / githubweb)
  180. Prefer defaulted default constructor for Bitset (#6524) (details / githubweb)
  181. SYCL: Use host-pinned memory to copy reduction/scan result (#6500) (details / githubweb)
  182. Remove unused variables (details / githubweb)
  183. Remove Sentinel struct from Threads (details / githubweb)
  184. Small cleanup of ThreadsInternal::initialize (details / githubweb)
  185. Remove useless forward declaration (details / githubweb)
  186. Remove spawn function (details / githubweb)
  187. Add comments (details / githubweb)
  188. Fix indentation (details / githubweb)
  189. Fix typo in macro guard (details / githubweb)
  190. Reduce number of View constructor instantiations (details / githubweb)
  191. Bump HPX version used in CI to 1.9.0 (details / githubweb)
  192. Split files in HIP backend (details / githubweb)
  193. Trim some fat in `CudaInternal` (towards multiple GPUs support) (#6544) (details / githubweb)
  194. Only define STDALGO_TEAM_SOURCES_* once (details / githubweb)
  195. Rollback changes to view constructors to reduce the number of instantiations (#6564) (details / githubweb)
  196. Threads remove unused variables and functions (#6566) (details / githubweb)
  197. Remove unused Sandia testing files (#6568) (details / githubweb)
  198. fallback implementation cleanup (details / githubweb)
  199. Remove empty quotation marks for static_assert (details / githubweb)
  200. [ci skip] Drop unused <impl/Kokkos_Memory_Fence.hpp> header (details / githubweb)
  201. Drop Clang+CUDA workaround (details / githubweb)
  202. OpenMPTarget: CI compiler upgrade. (#6545) (details / githubweb)
  203. Add crtrott's launch_latency benchmark (#6379) (details / githubweb)
  204. Simplify fence functions in the Threads backend (#6571) (details / githubweb)
  205. simd: temporarily skip device math ops unit test for OpenMPTarget build (#6574) (details / githubweb)
  206. OpenMPTarget: Guard scratch memory usage in ParallelReduce (details / githubweb)
  207. SYCL: Implement DESUL_ATOMICS_ENABLE_SYCL_SEPARABLE_COMPILATION path (#6534) (details / githubweb)
  208. Desul atomics: Trade SYCL-specific compile definition for a macro defintion in the configuration header (details / githubweb)
  209. Replace Marsaglia polar method with Box-muller to generate a normally distributed random number (#6556) (details / githubweb)
  210. OpenMP: No memset in viewfill (#6573) (details / githubweb)
  211. Revert "Desul atomics: Trade SYCL-specific compile definition for a macro defintion in the configuration header" (details / githubweb)
  212. OpenACC: add atomics support (#6446) (details / githubweb)
  213. Fix infinity, quiet_NaN, signaling_Nan, isfinite, isnan, isinf for half_t and bhalf_t (#6543) (details / githubweb)
  214. kokkos(unique): fix allocation of temporary view to enfore using the provided space instance (details / githubweb)
  215. Use binary wrapper for consistency in definition of half types numeric traits (#6590) (details / githubweb)
  216. try fix (details / githubweb)
  217. Fix TestNumericTriats.hpp for SYCL with bfloat16 support (details / githubweb)
  218. Fix generated Makefile when using gnu_generate_makefile.sh and make >= 4.3 (details / githubweb)
  219. Threads: add missing broadcast to TeamThreadRange parallel_scan (#6601) (details / githubweb)
  220. Added missing operator* to NEON simd (details / githubweb)
  221. [ci skip] Update changelog on develop for 4.2.00 (#6592) (details / githubweb)
  222. Remove KOKKOS_IMPL_DO_NOT_USE_PRINTF  (#6593) (details / githubweb)
  223. try fix (details / githubweb)
  224. avoid pyt package (details / githubweb)
  225. try (details / githubweb)
  226. fix for macos (details / githubweb)
  227. remove comments (details / githubweb)
  228. use reference (details / githubweb)
  229. add branching (details / githubweb)
  230. [ci skip] fix formatting (details / githubweb)
  231. GitHub Workflows: Use Ubuntu 22.04 instead of Fedora for Intel compiler testing (details / githubweb)
  232. nvcc(wrapper): adding missing `--generate-line-info` arg (details / githubweb)
  233. Add clang-format check to GitHub workflows (#6612) (details / githubweb)
  234. graph(HIP): adding inline keyword to fix #6623 (details / githubweb)
  235. Add jenkins multibranch pipeline options (details / githubweb)
  236. kokkos(profiling): do not finalize in any backend (details / githubweb)
  237. Replace ubuntu:18.04 with ubuntu:20.04 as base image for clang-format (details / githubweb)
  238. Disabling OpenACC in the CI because it emits too many warnings (details / githubweb)
  239. unorderedmap: modernize traits (details / githubweb)
  240. nvcc wrapper: remove troubling flag to fix 6628 (#6629) (details / githubweb)

Branch indexing

This run spent:

  • 16 hr waiting;
  • 6 hr 0 min build duration;
  • 6 hr 0 min total from scheduled to completion.
Revision: ec485624db9cfc2bf82f7e00f288ae9db610cd79
Repository: https://github.com/kokkos/kokkos.git
  • PR-6117
Revision: 0771bec93b17c2e2134203b96f979e676e3da258
Repository: https://github.com/kokkos/kokkos.git
  • PR-6117

Timeout has been exceeded