Skip to content
Started 3 mo 11 days ago
Took 14 min
Failed

Build #6 (Jan 29, 2024, 2:07:56 PM)

Changes
  1. Introduce constructor for multi-GPU support. (details / githubweb)
  2. Fix typo (details / githubweb)
  3. Add test (details / githubweb)
  4. Fix typo. (details / githubweb)
  5. Explicitly check for valid device id (details / githubweb)
  6. Set the device id in cuda_kernel_arch (details / githubweb)
  7. Check for default device (details / githubweb)
  8. Check that device associated with stream matches requested device (details / githubweb)
  9. Remove extra constructor (details / githubweb)
  10. Remove logical memory spaces (details / githubweb)
  11. Address reviewer comments (details / githubweb)
  12. m_cudaDev isn't static anymore (details / githubweb)
  13. Set the device id explicitly for CUDA API calls in impl_initialize (details / githubweb)
  14. Add warp sync for Cuda parallel reduce (details / githubweb)
  15. update comment to include final() mention (details / githubweb)
  16. This PR fixes the too-much-OpenACC-warning issue, mentioned in PR #6639. (details / githubweb)
  17. add missing header fix #6644 (details / githubweb)
  18. Fixup test math functions ulp should double -> int (details / githubweb)
  19. Drop DualView converting copy assignment operator (details / githubweb)
  20. Don't use rocm-docker for clang-format (details / githubweb)
  21. Diable HIP CI (details / githubweb)
  22. Remove deprecation warning for AllocationMechanism for gcc <11.0 (details / githubweb)
  23. OpenMPTarget: clang extensions for dynamic shared memory. (#6380) (details / githubweb)
  24. Fix builtin_unreachable use for MSVC/CUDA (details / githubweb)
  25. Fix missing include on msvc/cuda (details / githubweb)
  26. Avoid lambdas in constexpr branch for msvc/cuda (details / githubweb)
  27. Sidestep lacking CTAD support msvc/cuda (details / githubweb)
  28. Fix formatting (details / githubweb)
  29. Move header for Damien because he is right (details / githubweb)
  30. Unit test for issue 3371 (negative vector length should not yield a negative max_team_size) (#6076) (details / githubweb)
  31. Add CMakeLists.txt for stream benchmark (details / githubweb)
  32. Do not negate the dependent true traits helper (details / githubweb)
  33. Drop guards to accommodate external code defining KOKKOS_ASSERT (details / githubweb)
  34. Use omp_get_max_active_levels() when supported (details / githubweb)
  35. Add missing gfx940 (details / githubweb)
  36. Add Impl::always_false type-dendent false trait (details / githubweb)
  37. Per review prefer always_false<Arg>::value to is_void_v<Arg> (details / githubweb)
  38. Improve "no copy mechanism" exception message (details / githubweb)
  39. Add a unit test for new deep_copy exception msg (details / githubweb)
  40. Add missing include sstream (details / githubweb)
  41. src->source, dst->destination (details / githubweb)
  42. Workaround for ROCm 6.0 failing to compile with AVX2 SIMD support (details / githubweb)
  43. SYCL: Force inlining of Kokkos::printf (#6650) (details / githubweb)
  44. Improve handling of printf in OMPT on Intel GPUs (details / githubweb)
  45. OpenMP: Use `omp_get_nested` for older gcc versions (#6685) (details / githubweb)
  46. Disable more Bessel tests for SYCL on INtel GPUs (details / githubweb)
  47. fill_random without exceution space instance should fence (details / githubweb)
  48. Drop unnecessary guarding for a tool library being loaded in ProfilingSection (details / githubweb)
  49. Drop unnecessary header include in Kokkos_Profiling_ProfileSection.hpp (details / githubweb)
  50. #5333: CUDA: Use scratch space appropriate to small reduction elements in Team reductions (#5334) (details / githubweb)
  51. Cuda: Allocate using the correct device (#6392) (details / githubweb)
  52. Let `Profiling::ProfilingSection(std::string)` constructor be explicit and nodiscard (#6690) (details / githubweb)
  53. Cosmetic changes to ProfilingSection (details / githubweb)
  54. GitHub CI: Test with AddressSanitizer (#6676) (details / githubweb)
  55. Kokkos::Array deduction guide (#6373) (details / githubweb)
  56. Add CI for MSVC+Cuda (#6661) (details / githubweb)
  57. SYCL: Address deprecations after oneAPI 2023.2.0 (#6577) (details / githubweb)
  58. Fixup cast tolerance to double before printing (details / githubweb)
  59. Try linking against CUDA libararies even with KOKKOS_ENABLE_COMPILE_AS_CMAKE_LANGUAGE (details / githubweb)
  60. test_array_ctad: disable test for intel versions < 2021 (details / githubweb)
  61. Enable `{transform_}exclusive_scan` in place (#6667) (details / githubweb)
  62. Add `ATOMICS_BYPASS` configuration option to disable atomics (#6692) (details / githubweb)
  63. Check matching static extents in View constructor (#5190) (details / githubweb)
  64. Remove Kokkos::[b]half_t volatile overloads (#6579) (details / githubweb)
  65. add tests (details / githubweb)
  66. Provide `kokkos_swap` as part of Core and deprecate `Experimental::swap` in Algorithms (#6697) (details / githubweb)
  67. Provide new public headers `<Kokkos_Clamp.hpp>` and `<Kokkos_MinMax.hpp>` (#6687) (details / githubweb)
  68. Fix TeamThreadMDRange parallel_reduce (#6511) (details / githubweb)
  69. add tests for in-place `inclusive_scan` (#6682) (details / githubweb)
  70. Drop pointless Kokkos::Impl::CudaExec forward declaration (details / githubweb)
  71. Don't use the compiler launcher script if the compile language is CUDA. (#6704) (details / githubweb)
  72. Deprecate `{Cuda,HIP}::detect_device_count()` and `Cuda::[detect_]device_arch()` (#6710) (details / githubweb)
  73. Get rid of CudaInternal::cuda_get_error_{name,string}_wrapper (details / githubweb)
  74. No need to jump through so many hoops to print the error message (details / githubweb)
  75. HIP: Forgot to delete matching brace closing the namespace (details / githubweb)
  76. Make initialize and finalize of the Cuda/HIP singleton less special (#6714) (details / githubweb)
  77. Kokkos_HIP.cpp: include Kokkos_Core.hpp to resolve errors (details / githubweb)
  78. Add bound checks in RangePolicy and MDRangePolicy (#6617) (details / githubweb)
  79. Temporary fix to reenable HIP CI (details / githubweb)
  80. Let the smart pointer manage the CUDA/HIP stream (#6721) (details / githubweb)
  81. Fix Docker env variables (details / githubweb)
  82. Ensure view_allocation_error does not silently ignore that no exception was thrown (details / githubweb)
  83. Add RawMemoryAllocationFailure::AllocationMechanism::OpenACCMalloc enumerator (details / githubweb)
  84. [OpenACC] throw if acc_malloc returned nullptr (details / githubweb)
  85. Fixup using declaration (details / githubweb)
  86. Disable openacc.view_allocation_error test (details / githubweb)
  87. Guard `[MD]RangePolicy` precondition check for deprecated code 4 (#6726) (details / githubweb)
  88. Add C++26 standard to CMake Setup (details / githubweb)
  89. Add support for C++26 in generated makefiles (details / githubweb)
  90. Add KOKKOS_ENABLE_CXX26 to the configuration metadata (details / githubweb)
  91. Reenable HIP testing (details / githubweb)
  92. Disabling failing HIP test in the CI (details / githubweb)
  93. Use team_size_max to fix "Team size too large" error in reducer test (#6725) (details / githubweb)
  94. Don't touch my records! (refactor Cuda/HIP/SYCL/Threads to not directly mess with `SharedAllocationRecord`) (#6732) (details / githubweb)
  95. Untangle SharedAllocationRecord spaghetti code (details / githubweb)
  96. Trilinos: Don't let Kokkos set CMAKE_CXX_FLAGS (details / githubweb)
  97. Fix TestThreadVectorMDRangeParallelReduce (#6734) (details / githubweb)
  98. Cuda multi-GPU support: Allow execution space instance constructor to run (#6706) (details / githubweb)
  99. add more warp sync for cuda reductions (details / githubweb)
  100. Drop support for deprecated command-line arguments and environment variables (#6744) (details / githubweb)
  101. Avoid unnecessary zero-memset of the scratch flags in SYCL (#6739) (details / githubweb)
  102. std(remove-if): fixing tmp view alloc + avoid evaluating twice the predicate during final pass (details / githubweb)

Branch indexing

This run spent:

  • 13 min waiting;
  • 14 min build duration;
  • 14 min total from scheduled to completion.
Revision: d0fedabc39271a3424edbf450cde9d2b3584a97a
Repository: https://github.com/kokkos/kokkos.git
  • PR-6641