Skip to content
Failed

Changes

Summary

  1. Introduce constructor for multi-GPU support. (details)
  2. Fix typo (details)
  3. Add test (details)
  4. Fix typo. (details)
  5. Explicitly check for valid device id (details)
  6. Set the device id in cuda_kernel_arch (details)
  7. Check for default device (details)
  8. Check that device associated with stream matches requested device (details)
  9. Remove extra constructor (details)
  10. Remove logical memory spaces (details)
  11. Address reviewer comments (details)
  12. m_cudaDev isn't static anymore (details)
  13. Set the device id explicitly for CUDA API calls in impl_initialize (details)
  14. Add warp sync for Cuda parallel reduce (details)
  15. update comment to include final() mention (details)
  16. This PR fixes the too-much-OpenACC-warning issue, mentioned in PR #6639. (details)
  17. add missing header fix #6644 (details)
  18. Fixup test math functions ulp should double -> int (details)
  19. Drop DualView converting copy assignment operator (details)
  20. Don't use rocm-docker for clang-format (details)
  21. Diable HIP CI (details)
  22. Remove deprecation warning for AllocationMechanism for gcc <11.0 (details)
  23. OpenMPTarget: clang extensions for dynamic shared memory. (#6380) (details)
  24. Fix builtin_unreachable use for MSVC/CUDA (details)
  25. Fix missing include on msvc/cuda (details)
  26. Avoid lambdas in constexpr branch for msvc/cuda (details)
  27. Sidestep lacking CTAD support msvc/cuda (details)
  28. Fix formatting (details)
  29. Move header for Damien because he is right (details)
  30. Unit test for issue 3371 (negative vector length should not yield a negative max_team_size) (#6076) (details)
  31. Add CMakeLists.txt for stream benchmark (details)
  32. Do not negate the dependent true traits helper (details)
  33. Drop guards to accommodate external code defining KOKKOS_ASSERT (details)
  34. Use omp_get_max_active_levels() when supported (details)
  35. Add missing gfx940 (details)
  36. Add Impl::always_false type-dendent false trait (details)
  37. Per review prefer always_false<Arg>::value to is_void_v<Arg> (details)
  38. Improve "no copy mechanism" exception message (details)
  39. Add a unit test for new deep_copy exception msg (details)
  40. Add missing include sstream (details)
  41. src->source, dst->destination (details)
  42. Workaround for ROCm 6.0 failing to compile with AVX2 SIMD support (details)
  43. SYCL: Force inlining of Kokkos::printf (#6650) (details)
  44. Improve handling of printf in OMPT on Intel GPUs (details)
  45. OpenMP: Use `omp_get_nested` for older gcc versions (#6685) (details)
  46. Disable more Bessel tests for SYCL on INtel GPUs (details)
  47. fill_random without exceution space instance should fence (details)
  48. Drop unnecessary guarding for a tool library being loaded in ProfilingSection (details)
  49. Drop unnecessary header include in Kokkos_Profiling_ProfileSection.hpp (details)
  50. #5333: CUDA: Use scratch space appropriate to small reduction elements in Team reductions (#5334) (details)
  51. Cuda: Allocate using the correct device (#6392) (details)
  52. Let `Profiling::ProfilingSection(std::string)` constructor be explicit and nodiscard (#6690) (details)
  53. Cosmetic changes to ProfilingSection (details)
  54. GitHub CI: Test with AddressSanitizer (#6676) (details)
  55. Kokkos::Array deduction guide (#6373) (details)
  56. Add CI for MSVC+Cuda (#6661) (details)
  57. SYCL: Address deprecations after oneAPI 2023.2.0 (#6577) (details)
  58. Fixup cast tolerance to double before printing (details)
  59. Try linking against CUDA libararies even with KOKKOS_ENABLE_COMPILE_AS_CMAKE_LANGUAGE (details)
  60. test_array_ctad: disable test for intel versions < 2021 (details)
  61. Enable `{transform_}exclusive_scan` in place (#6667) (details)
  62. Add `ATOMICS_BYPASS` configuration option to disable atomics (#6692) (details)
  63. Check matching static extents in View constructor (#5190) (details)
  64. Remove Kokkos::[b]half_t volatile overloads (#6579) (details)
  65. add tests (details)
  66. Provide `kokkos_swap` as part of Core and deprecate `Experimental::swap` in Algorithms (#6697) (details)
  67. Provide new public headers `<Kokkos_Clamp.hpp>` and `<Kokkos_MinMax.hpp>` (#6687) (details)
  68. Fix TeamThreadMDRange parallel_reduce (#6511) (details)
  69. add tests for in-place `inclusive_scan` (#6682) (details)
  70. Drop pointless Kokkos::Impl::CudaExec forward declaration (details)
  71. Don't use the compiler launcher script if the compile language is CUDA. (#6704) (details)
  72. Deprecate `{Cuda,HIP}::detect_device_count()` and `Cuda::[detect_]device_arch()` (#6710) (details)
  73. Get rid of CudaInternal::cuda_get_error_{name,string}_wrapper (details)
  74. No need to jump through so many hoops to print the error message (details)
  75. HIP: Forgot to delete matching brace closing the namespace (details)
  76. Make initialize and finalize of the Cuda/HIP singleton less special (#6714) (details)
  77. Kokkos_HIP.cpp: include Kokkos_Core.hpp to resolve errors (details)
  78. Add bound checks in RangePolicy and MDRangePolicy (#6617) (details)
  79. Temporary fix to reenable HIP CI (details)
  80. Let the smart pointer manage the CUDA/HIP stream (#6721) (details)
  81. Fix Docker env variables (details)
  82. Ensure view_allocation_error does not silently ignore that no exception was thrown (details)
  83. Add RawMemoryAllocationFailure::AllocationMechanism::OpenACCMalloc enumerator (details)
  84. [OpenACC] throw if acc_malloc returned nullptr (details)
  85. Fixup using declaration (details)
  86. Disable openacc.view_allocation_error test (details)
  87. Guard `[MD]RangePolicy` precondition check for deprecated code 4 (#6726) (details)
  88. Add C++26 standard to CMake Setup (details)
  89. Add support for C++26 in generated makefiles (details)
  90. Add KOKKOS_ENABLE_CXX26 to the configuration metadata (details)
  91. Reenable HIP testing (details)
  92. Disabling failing HIP test in the CI (details)
  93. Use team_size_max to fix "Team size too large" error in reducer test (#6725) (details)
  94. Don't touch my records! (refactor Cuda/HIP/SYCL/Threads to not directly mess with `SharedAllocationRecord`) (#6732) (details)
  95. Untangle SharedAllocationRecord spaghetti code (details)
  96. Trilinos: Don't let Kokkos set CMAKE_CXX_FLAGS (details)
  97. Fix TestThreadVectorMDRangeParallelReduce (#6734) (details)
  98. Cuda multi-GPU support: Allow execution space instance constructor to run (#6706) (details)
  99. add more warp sync for cuda reductions (details)
  100. Drop support for deprecated command-line arguments and environment variables (#6744) (details)
  101. Avoid unnecessary zero-memset of the scratch flags in SYCL (#6739) (details)
  102. std(remove-if): fixing tmp view alloc + avoid evaluating twice the predicate during final pass (details)
Commit b8482317e26c3ae0eb421eb4b5b95fe0d16a0743 by Daniel Arndt
Introduce constructor for multi-GPU support.
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 64a9b3d854521b7b6dabf7fd9e162c959045b2e3 by noreply
Fix typo

Co-authored-by: Dong Hun Lee <59181952+ldh4@users.noreply.github.com>
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
The file was modified core/unit_test/CMakeLists.txt (diff)
The file was addedcore/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp
Commit 1683786110806f22fb14b2c9e065ee8350cec8ac by noreply
Fix typo.

Co-authored-by: Dong Hun Lee <59181952+ldh4@users.noreply.github.com>
The file was modified core/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp (diff)
Commit fa1aaa712a36394c5c1f1508771cbf13160353b7 by Daniel Arndt
Explicitly check for valid device id
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 41253bd55d7fad8c72f5f3756792705b66bdc954 by Daniel Arndt
Set the device id in cuda_kernel_arch
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit f6977cf431cf245341e6dd2f744c79c5a62a315e by Daniel Arndt
Check for default device
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit e156d5859e36fbc3c4f84575a0c8734689c1b3eb by Daniel Arndt
Check that device associated with stream matches requested device
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 1fcce6936bf667339350381c8b265b5d82d810da by Daniel Arndt
Remove extra constructor
The file was modified core/unit_test/CMakeLists.txt (diff)
The file was removedcore/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp
The file was modified core/src/Cuda/Kokkos_Cuda.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 6d95b621e110df87c488cb93a2f647d1a592af35 by Damien L-G
Remove logical memory spaces
The file was modified core/src/HIP/Kokkos_HIP_Space.hpp (diff)
The file was modified core/src/Kokkos_Core.hpp (diff)
The file was modified core/src/Kokkos_Core_fwd.hpp (diff)
The file was removedcore/src/Kokkos_LogicalSpaces.hpp
The file was removedcore/unit_test/tools/TestLogicalSpaces.hpp
The file was modified core/src/Cuda/Kokkos_CudaSpace.hpp (diff)
The file was modified core/src/Kokkos_HBWSpace.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_Space.hpp (diff)
The file was modified core/unit_test/CMakeLists.txt (diff)
The file was modified core/src/Kokkos_HostSpace.hpp (diff)
Commit a07c7a2b6c9999020859a7de86004e447790d0c4 by Daniel Arndt
Address reviewer comments
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 403c34f304077eeb92dc8fd8cb2fd24ae6c35688 by Daniel Arndt
m_cudaDev isn't static anymore
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit d4a517f82d73a780d03ab06ece627b47d7aad43f by Daniel Arndt
Set the device id explicitly for CUDA API calls in impl_initialize
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 4d4a343e50bf4d5998efe20f97e8670c93bdf2dd by tccleve
Add warp sync for Cuda parallel reduce

compute-sanitizer --tool=racecheck discovered a potential racecondition for Cuda parallel reductions (using range policy) where data was being updated on a single thread inside a warp, but the warp was not being synchronized before being read.
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_Range.hpp (diff)
Commit b00c1e06856af8dc2dede9057968a6ebb96d9960 by tccleve
update comment to include final() mention
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_Range.hpp (diff)
Commit 685620918f93420d8e6ec0cd9b03c72dfd2e5a6e by lees2
This PR fixes the too-much-OpenACC-warning issue, mentioned in PR #6639.
This PR also re-enables the OpenACC CI test.
The file was modified tpls/desul/include/desul/atomics/Lock_Based_Fetch_Op_OpenACC.hpp (diff)
The file was modified tpls/desul/include/desul/atomics/Fetch_Op_OpenACC.hpp (diff)
The file was modified .jenkins (diff)
The file was modified tpls/desul/include/desul/atomics/Compare_Exchange_OpenACC.hpp (diff)
Commit 07cdd7000fc57f818a4cb063524e9bc7cfc6bd72 by fnrizzi
add missing header fix #6644
The file was modified algorithms/unit_tests/TestStdAlgorithmsExclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTransformInclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsInclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTransformExclusiveScan.cpp (diff)
Commit 71729af719b2eb20b512efad05ae609e74e1dd7a by Damien L-G
Fixup test math functions ulp should double -> int
The file was modified core/unit_test/TestMathematicalFunctions.hpp (diff)
Commit b9b63dfd89fb670d9e18309ad25ef74c6bdd9400 by Damien L-G
Drop DualView converting copy assignment operator

It is unnecessary, we already have a converting constructor and a
regular copy assignment operator.
The file was modified containers/src/Kokkos_DualView.hpp (diff)
Commit 9fd95ebcb1d2f590d376e42ca993068138c0c829 by Daniel Arndt
Don't use rocm-docker for clang-format
The file was modified .jenkins (diff)
The file was modified .jenkins (diff)
Commit eadc210bf4ae3d06c4863e4c7001abf5732d3b4f by Daniel Arndt
Remove deprecation warning for AllocationMechanism for gcc <11.0
The file was modified core/src/Kokkos_HostSpace.hpp (diff)
Commit 843fca336a8bc056b6650310f6c5152f06e3b7e7 by noreply
OpenMPTarget: clang extensions for dynamic shared memory. (#6380)

* OpenMPTarget: clang extensions for dynamic shared memory.

* OpenMPTarget: Rebase and use macros for pragma.

* OpenMPTarget: Fix bug. Add Macros file.

* OpenMPTarget: Fix resize_scratch.

* OpenMPTarget: restore map clause.

* OpenMPTarget: Fix comment.

* OpenMPTarget: change L0 values in resize_scratch.

* OpenMPTarget: Fix comment and spaces.

---------

Co-authored-by: Rahulkumar Gayatri <rgayatri@lbl.gov>
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTarget_Parallel.hpp (diff)
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTarget_ParallelFor_Team.hpp (diff)
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTarget_Exec.cpp (diff)
The file was addedcore/src/OpenMPTarget/Kokkos_OpenMPTarget_Macros.hpp
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTarget_Parallel_Common.hpp (diff)
Commit fb0380b91b79b5744ea2b12e28a5663c59aeacc4 by crtrott
Fix builtin_unreachable use for MSVC/CUDA

Also split math functions test differently to avoid need for bigobj
The file was modified core/unit_test/TestMathematicalFunctions.hpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsCommon.hpp (diff)
Commit 458910fbf37ba7fae78e54a2e5b78522cb1b2ccb by crtrott
Fix missing include on msvc/cuda
The file was modified core/src/Serial/Kokkos_Serial.hpp (diff)
Commit 7dcf1deba623ce533f905beb9f37bc26e2009144 by crtrott
Avoid lambdas in constexpr branch for msvc/cuda
The file was modified simd/unit_tests/include/TestSIMD_GeneratorCtors.hpp (diff)
Commit 249f8b4fb42679d37ee4cca9d0c3012739050ac2 by crtrott
Sidestep lacking CTAD support msvc/cuda
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamRemoveIf.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamReplaceIf.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamPartitionCopy.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamCopyIf.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamIsPartitioned.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamReplaceCopyIf.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamPartitionPoint.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamRemoveCopyIf.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsCommon.hpp (diff)
The file was modified simd/unit_tests/include/TestSIMD_GeneratorCtors.hpp (diff)
The file was modified core/unit_test/TestMathematicalFunctions.hpp (diff)
Commit e524ec7777bb2f6f06391bbea0bc007f624ee599 by crtrott
Move header for Damien because he is right
The file was modified core/src/Serial/Kokkos_Serial.hpp (diff)
Commit ed08974c76ab00a28897c31b876dbd56d283d86d by noreply
Unit test for issue 3371 (negative vector length should not yield a negative max_team_size) (#6076)

* #3371: Added TeamPolicy construction test

* Check recommended team size

* Move size_max and size_recommended tests to TeamPolicy constructor test

* Define tag type outside of test body

* Add missing overload

* tests: remove spurious check

* Remove redundant qualifier

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

* Fix formatting

---------

Co-authored-by: Cezary Skrzyński <cezary.skrzynski@ng-analytics.com>
Co-authored-by: Cezary Skrzyński <czars1988@gmail.com>
Co-authored-by: Damien L-G <dalg24+github@gmail.com>
The file was modified core/unit_test/TestTeamPolicyConstructors.hpp (diff)
Commit 379d5db1a67a1bad6d7d9c8e25a6933655fc8c5e by cwpears
Add CMakeLists.txt for stream benchmark
The file was addedbenchmarks/stream/CMakeLists.txt
The file was modified benchmarks/CMakeLists.txt (diff)
Commit 76ea3a3a95e4619b80e3adf7edda6db05d4ed32d by Damien L-G
Do not negate the dependent true traits helper
The file was modified core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Team.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Range.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_Team.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_MDRange.hpp (diff)
Commit ae71e400213ae256c352e3e8c46cdc9db6d089bf by Damien L-G
Drop guards to accommodate external code defining KOKKOS_ASSERT
The file was modified core/src/Kokkos_Assert.hpp (diff)
Commit a996c12a0315957458df283b5ea797b3e10573ee by Daniel Arndt
Use omp_get_max_active_levels() when supported
The file was modified core/src/OpenMP/Kokkos_OpenMP_Instance.hpp (diff)
The file was modified generate_makefile.bash (diff)
The file was modified Makefile.kokkos (diff)
The file was modified cmake/KokkosCore_config.h.in (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.hpp (diff)
The file was modified cmake/kokkos_arch.cmake (diff)
Commit 33db3046a173b5e96d1dff62c8555a8da5cea9bb by Damien L-G
Add Impl::always_false type-dendent false trait
The file was modified core/src/impl/Kokkos_Utilities.hpp (diff)
Commit e4a7cfc78f0564f2aefb8a98cab3144baffd15b6 by Damien L-G
Per review prefer always_false<Arg>::value to is_void_v<Arg>
The file was modified core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Team.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_MDRange.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_Team.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Range.hpp (diff)
Commit 316ceac58eedf97fec5ee9f2fb87018a83fb97ec by bmkelle
Improve "no copy mechanism" exception message
The file was modified core/src/Kokkos_CopyViews.hpp (diff)
Commit 838f8938e9aaf1ac78027575743bb95b442470a9 by bmkelle
Add a unit test for new deep_copy exception msg
The file was modified core/unit_test/TestViewCopy_a.hpp (diff)
Commit 72bc7ed42f0f658ebc3b158520feba086111fc3b by bmkelle
Add missing include sstream
The file was modified core/src/Kokkos_CopyViews.hpp (diff)
Commit 154a57df8c53bff88dc4806967cb8a49d8ed1c40 by bmkelle
src->source, dst->destination

and reformat a bit
The file was modified core/src/Kokkos_CopyViews.hpp (diff)
Commit 5df22b87b046290e84c1c6a2d4d3bcbbc47d1e5c by Bruno Turcksin
Workaround for ROCm 6.0 failing to compile with AVX2 SIMD support
The file was modified simd/src/Kokkos_SIMD_AVX2.hpp (diff)
Commit 52e44d6cfe35dcfc55c3d91ddff70cc63f8d514c by noreply
SYCL: Force inlining of Kokkos::printf (#6650)

* SYCL: Force inlining of Kokkos::printf

* Always force inline Kokkos::printf
The file was modified core/src/Kokkos_Printf.hpp (diff)
Commit 79164a43adc5da6e3eee12f3cfc66e3e1402ed3f by Daniel Arndt
Improve handling of printf in OMPT on Intel GPUs
The file was modified core/src/Kokkos_Printf.hpp (diff)
Commit cbbe09b93db49e081ab4d400975aca8c007c4223 by noreply
OpenMP: Use `omp_get_nested` for older gcc versions (#6685)

* OpenMP: fix for issue 6670.

* OpenMP: Update gcc version when using max_active_level.

* OpenMP: Edit execute_in_serial.

* Apply suggestions from code review

Co-authored-by: Daniel Arndt <arndtd@ornl.gov>

* clang-format

---------

Co-authored-by: Rahulkumar Gayatri <rgayatri@lbl.gov>
Co-authored-by: Damien L-G <dalg24+github@gmail.com>
Co-authored-by: Daniel Arndt <arndtd@ornl.gov>
The file was modified core/src/OpenMP/Kokkos_OpenMP_Instance.hpp (diff)
Commit 8de16ea357136aeaa0f06a27d24820d3d2f37235 by Daniel Arndt
Disable more Bessel tests for SYCL on INtel GPUs
The file was modified core/unit_test/TestMathematicalSpecialFunctions.hpp (diff)
Commit 391daefd51480fcf674d4c891e9296c1eb8e0e55 by Daniel Arndt
fill_random without exceution space instance should fence
The file was modified algorithms/src/Kokkos_Random.hpp (diff)
Commit 5aa0ceee4b0132ec411f7fc1e11b4b9a1039cde3 by Damien L-G
Drop unnecessary guarding for a tool library being loaded in ProfilingSection
The file was modified core/src/Kokkos_Profiling_ProfileSection.hpp (diff)
Commit 73c75075536f905818560f0bc415348b462fe38f by Damien L-G
Drop unnecessary header include in Kokkos_Profiling_ProfileSection.hpp
The file was modified core/src/Kokkos_Profiling_ProfileSection.hpp (diff)
Commit 02b46c09c75697da5a1e4bfed5429a56c1bfb7ef by noreply
#5333: CUDA: Use scratch space appropriate to small reduction elements in Team reductions (#5334)

* #5333: Add maybe failing test case?

* Revise test to try to target the right code

* NOMERGE #5333: Partial implementation of fix

* #5333: Update tests

* #5333: Fix comparision types in reducers test

* #5333: Re-enable tests

* #5333: Add test for Scalar with size of 1

* Fix formatting

* Remove unrelated changes

* Extract TeamPolicy tests into a separate method

* Disable new tests (OpenACC, OpenMPTarget, Serial, SYCL, point_t)

* Use smaller number of teams for bhalf_t

* Use pointer_type for m_scratch_space parameter

* Skip failing tests (SYCL)

* Fix tests for OpenMPTarget

* Fix num_teams_done variable in SYCL TeamPolicy reduction

* Restore guards for array_reduce<float, 7>

* Also make array_reduce<float, 7> work for SYCL

* tests: change point_t default constructor

---------

Co-authored-by: Arkadiusz Szczepkowicz <arek.szczepkowicz@ng-analytics.com>
Co-authored-by: Cezary Skrzyński <cezary.skrzynski@ng-analytics.com>
Co-authored-by: Daniel Arndt <arndtd@ornl.gov>
The file was modified core/src/SYCL/Kokkos_SYCL_Team.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelReduce_Team.hpp (diff)
The file was modified core/unit_test/TestReducers.hpp (diff)
The file was modified core/unit_test/TestReducers_d.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_Team.hpp (diff)
The file was modified core/unit_test/TestNonTrivialScalarTypes.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_ReduceScan.hpp (diff)
Commit 4078a0d8a022ab1ad2291554b2820916fa3bc361 by noreply
Cuda: Allocate using the correct device (#6392)

* Cuda: Allocate using the correct device

* Avoid warning about uninitialized variable

* exec_space_provided -> stream_sync_only

* Fix up ASYNC support

* Only check for errors in synchronization if allocation was successful

* Rename arguments and make constructor private

---------

Co-authored-by: Daniel Arndt <arndtd@ornl.com>
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
The file was modified core/src/Cuda/Kokkos_CudaSpace.hpp (diff)
The file was modified core/src/Cuda/Kokkos_CudaSpace.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit f485cfa53cb2d25cab98e7a1d41a116027ec2363 by noreply
Let `Profiling::ProfilingSection(std::string)` constructor be explicit and nodiscard (#6690)

* Add nodiscard attribute to Profiling::ProfilingSection

* Make Profiling::ProfilingSection constructor explicit
The file was modified core/src/Kokkos_Profiling_ProfileSection.hpp (diff)
Commit 4eae6a99f1123f31496504f60cb6a4735559052c by Damien L-G
Cosmetic changes to ProfilingSection
The file was modified core/src/Kokkos_Profiling_ProfileSection.hpp (diff)
Commit 654a51f60cef8900868f55a93661cee80f451cd9 by noreply
GitHub CI: Test with AddressSanitizer (#6676)

* GitHub CI: Test with AddressSanitizer

* Disable view_allocation_error when running with AddressSanitizer

* Try  __attribute__((no_sanitize(address)))

* Revert "Try  __attribute__((no_sanitize(address)))"

This reverts commit 335f7f26ae96034ceae221e5a80bd93b4769ed1d.
The file was modified .github/workflows/continuous-integration-workflow.yml (diff)
The file was modified core/unit_test/TestViewAPI_d.hpp (diff)
Commit efc0c365c0b61c63a47712656788d2d30ab16d4b by noreply
Kokkos::Array deduction guide (#6373)

* Added a deduction guide for Kokkos::Array

* Added unit test for Kokkos::Array deduction guide

* Decorated is_equal with KOKKOS_FUNCTION
The file was modified core/src/Kokkos_Array.hpp (diff)
The file was modified core/unit_test/TestArray.cpp (diff)
Commit 06de563f9958c58a6f292d742d8f658d67600d7c by noreply
Add CI for MSVC+Cuda (#6661)

The file was added.github/workflows/windows.yml
Commit 9f5e38e976f1c9003d5895803e5d5ce03015dbbb by noreply
SYCL: Address deprecations after oneAPI 2023.2.0 (#6577)

* Address deprecations in oneAPI 2023.2.0

* Define KOKKOS_IMPL_SYCL_GET_MULTI_PTR
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelFor_Team.hpp (diff)
The file was modified core/src/setup/Kokkos_Setup_SYCL.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelReduce_Team.hpp (diff)
Commit cbf1c644c287a9a1f3c4220b7dbe196b7aba0b44 by Damien L-G
Fixup cast tolerance to double before printing
The file was modified core/unit_test/TestMathematicalFunctions.hpp (diff)
Commit 3358970c2fe7c36e6de79912edd2a65f5a46b53f by Daniel Arndt
Try linking against CUDA libararies even with KOKKOS_ENABLE_COMPILE_AS_CMAKE_LANGUAGE
The file was modified core/src/CMakeLists.txt (diff)
The file was modified cmake/kokkos_tpls.cmake (diff)
Commit 716bef2a4c8025076b2893e9faaf4330b716041d by ndellin
test_array_ctad: disable test for intel versions < 2021

Address issue #6702
The file was modified core/unit_test/TestArray.cpp (diff)
Commit 3523bc3e789110445320af86e1feea6862b31b68 by noreply
Enable `{transform_}exclusive_scan` in place (#6667)

* Enable exclusive_scan in place

* add test for inplace operation

* improve tests

* formatting

* fix test for transform_*

* simplify args

* revise tests

---------

Co-authored-by: Francesco Rizzi <fnrizzi@sandia.gov>
The file was modified algorithms/src/std_algorithms/impl/Kokkos_FunctorsForExclusiveScan.hpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTransformExclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamTransformExclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamExclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsExclusiveScan.cpp (diff)
Commit 27286c32d7e0c157d1a4d7f00f9441979ed8eabe by noreply
Add `ATOMICS_BYPASS` configuration option to disable atomics (#6692)

* Add NAME_TBD_UNSAFE_ATOMICS configuration option to disable atomics

* Rename configuration option and macro NAME_TBD_UNSAFE_ATOMICS -> ATOMICS_BYPASS

* Improve option description and configuration time error message

Co-Authored-By: Christian Trott <crtrott@sandia.gov>
Co-Authored-By: Daniel Arndt <arndtd@ornl.gov>

* Mention Kokkos_ENABLE_ATOMICS_BYPASS option in error msssage

* Simplify sanity check for disabling atomics in <Kokkos_Macros.hpp>

Co-Authored-By: Daniel Arndt <arndtd@ornl.gov>

---------

Co-authored-by: Christian Trott <crtrott@sandia.gov>
Co-authored-by: Daniel Arndt <arndtd@ornl.gov>
The file was modified cmake/kokkos_arch.cmake (diff)
The file was modified core/src/Kokkos_Atomics_Desul_Volatile_Wrapper.hpp (diff)
The file was modified cmake/KokkosCore_config.h.in (diff)
The file was modified cmake/kokkos_enable_options.cmake (diff)
The file was modified core/src/Kokkos_Atomics_Desul_Wrapper.hpp (diff)
The file was modified core/src/Kokkos_Macros.hpp (diff)
The file was modified core/src/Serial/Kokkos_Serial.cpp (diff)
Commit 0e4a158a7585f2a93b93d4121788016fe0dba4c9 by noreply
Check matching static extents in View constructor (#5190)

* Also check static extents

* Use separate bools for the two checks and improve error messages

* Improve error messages

* Refactor test

* Add another test

* Extend existing tests

* Use make_index_sequence

* Remove LIVE and DIE

* Try removing guards for OpenMPTarget

* Use different style in view_construction_with_wrong_static_extents

* Remove template template parameter

* Also check unmanaged views

* Fix DynRankView

* Add FIXME

* Pass label by const char *

* fix terminating define

* Deduce rank and dynamic_rank from View

* Only call runtime_check_rank when KOKKOS_ENABLE_DEBUG_BOUNDS_CHECK is defined

* Guard with if constexpr

---------

Co-authored-by: Francesco Rizzi <fnrizzi@sandia.gov>
The file was modified core/unit_test/TestViewAPI.hpp (diff)
The file was modified core/src/Kokkos_View.hpp (diff)
The file was modified core/unit_test/TestViewCtorDimMatch.hpp (diff)
The file was modified containers/src/Kokkos_DynRankView.hpp (diff)
The file was modified core/src/impl/Kokkos_ViewArray.hpp (diff)
Commit 96d530a24034467db26070081b62a0a23c60530e by noreply
Remove Kokkos::[b]half_t volatile overloads (#6579)

* Deprecate Kokkos::[b]half_t volatile overloads

* Remove volatile [b]half_t altogether
The file was modified core/unit_test/TestHalfOperators.hpp (diff)
The file was modified core/src/impl/Kokkos_Half_FloatingPointWrapper.hpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTransformInclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamTransformInclusiveScan.cpp (diff)
Commit 0ba8c40fc56c54bc0fd5808a23cf44e48749c187 by noreply
Provide `kokkos_swap` as part of Core and deprecate `Experimental::swap` in Algorithms (#6697)

* Move Kokkos_Swap.hpp header to core

* Include only what we use (in <Kokkos_Swap.hpp>)

* Rename header guards in <Kokkos_Swap.hpp> (because I can)

* Drop useless comment

* Fix noexcept specification of Kokkos::swap

* Constexprify Kokkos::swap

* Fixup type requirements in Kokkos::swap

* Drop the inline (because I can)

* fixup! Move Kokkos_Swap.hpp header to core

* Promote swap to namespace Kokkos::

* Move and update Kokkos::swap unit test to core

* Add support for swapping arrays

* Fixup size_t

* Provide Kokkos::swap overload for a few classes

* Rename Kokkos::swap -> Kokkos::kokkos_swap

* For backward compatibility provide deprecated Experimental::swap
The file was modified algorithms/src/std_algorithms/impl/Kokkos_Reverse.hpp (diff)
The file was modified algorithms/src/Kokkos_StdAlgorithms.hpp (diff)
The file was modified core/src/Kokkos_Pair.hpp (diff)
The file was modified algorithms/src/std_algorithms/impl/Kokkos_SwapRanges.hpp (diff)
The file was modified core/src/Kokkos_Core.hpp (diff)
The file was addedcore/src/Kokkos_Swap.hpp
The file was modified core/unit_test/CMakeLists.txt (diff)
The file was addedcore/unit_test/TestSwap.hpp
The file was modified algorithms/unit_tests/TestStdAlgorithmsModOps.cpp (diff)
The file was removedalgorithms/src/std_algorithms/Kokkos_Swap.hpp
The file was modified core/src/Kokkos_Array.hpp (diff)
The file was modified algorithms/src/sorting/impl/Kokkos_NestedSortImpl.hpp (diff)
The file was modified algorithms/src/std_algorithms/Kokkos_IterSwap.hpp (diff)
Commit 89ba3fbae8ab2ae2dbfd30eef05c3bac387630a5 by noreply
Provide new public headers `<Kokkos_Clamp.hpp>` and `<Kokkos_MinMax.hpp>` (#6687)

* Promote min, max, clamp to public

* Drop unnecessary <Kokkos_MinMaxClamp.hpo> header includes

* Split Kokkos_MinMaxClamp.hpp into Kokkos_MinMax.hpp and Kokkos_Clamp.hpp

---------

Co-authored-by: Damien L-G <dalg24@gmail.com>
The file was modified core/src/Kokkos_View.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_Range.hpp (diff)
The file was addedcore/src/Kokkos_Clamp.hpp
The file was removedcore/src/Kokkos_MinMaxClamp.hpp
The file was modified core/src/HIP/Kokkos_HIP_TeamPolicyInternal.hpp (diff)
The file was modified core/src/Kokkos_Core.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_MDRange.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_Team.hpp (diff)
The file was addedcore/src/Kokkos_MinMax.hpp
Commit ee5cbfc25c81b9efc5b2a4abd82d9a9d5d3d07bd by noreply
Fix TeamThreadMDRange parallel_reduce (#6511)

* Fix TeamThreadRangeMD parallel_reduce

* Use vector_reduce for Cuda, HIP, and SYCL

* Initialize reduction variable

* Test with maximal vector length

* TeamHandle::execution_space->typename TeamHandle::execution_space

* Fix OpenMPTarget

* Restore tests

* reducer is unused

* Workarounds for OpenMPTarget

* Reduce

* Fix OpenMPTarget tests

* Restore tests

* Strengthen test case for OpenMPTarget

* Add some static_asserts
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTarget_Parallel.hpp (diff)
The file was modified core/src/Kokkos_ExecPolicy.hpp (diff)
The file was modified core/unit_test/TestTeamMDRange.hpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsCommon.hpp (diff)
Commit 36da6cca7fb4b4b7082f0ec91b4e388578787e8f by noreply
add tests for in-place `inclusive_scan` (#6682)

* add test for inclusive scan in place

* refine tests

* remove comment

* revert transform since it is moved to separate pr
The file was modified algorithms/unit_tests/TestStdAlgorithmsInclusiveScan.cpp (diff)
The file was modified algorithms/unit_tests/TestStdAlgorithmsTeamInclusiveScan.cpp (diff)
Commit 0254c631bb2496fb80669d050e53d37d258fd4d1 by Damien L-G
Drop pointless Kokkos::Impl::CudaExec forward declaration

There is no defintion for it...
The file was modified core/src/Cuda/Kokkos_Cuda.hpp (diff)
Commit 9393b358fb1300d15c41957c42a566f329bcac85 by noreply
Don't use the compiler launcher script if the compile language is CUDA. (#6704)

* Don't use the compiler launcher script if the compile language is CUDA.

* Updated descriptive comment.
The file was modified cmake/KokkosConfig.cmake.in (diff)
Commit c75d730d203077bf79439c9b3f4a33cbbc18c6a9 by noreply
Deprecate `{Cuda,HIP}::detect_device_count()` and `Cuda::[detect_]device_arch()` (#6710)

* CUDA/HIP: Inline getting device count

get_device_count() -> int cannot be generic. Get rid of the extra
indirection because it brings no benefit.

* Get rid of CudaInternalDevices and cleanup Cuda::print_configuration()

* Get rid of cuda_get_device_{count,properties}_wrapper()

* Missed a few CudaInternalDevices and get rid of Cuda::detect_device_arch()

* Get rid of Cuda::device_arch()

* Fixup

Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com>

* Don’t mess with Voodoo

* Be more conservative and deprecate before removing

* Clang-format for suggestion made on GH

* Remove stray const qualifier

* Forgot that device_arch() was static

* Who Let the Bugs Out??

* The same thing we do every night, Pinky - try to take over the world!

* this should fix it

---------

Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com>
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda.hpp (diff)
The file was modified core/src/impl/Kokkos_Core.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP.hpp (diff)
Commit 868e42e7beb8a49f06b5c7c505c1e1ab5f9eb1e4 by Damien L-G
Get rid of CudaInternal::cuda_get_error_{name,string}_wrapper
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
Commit e6ff1a46969737a0508df86799c93b4051faa6de by Damien L-G
No need to jump through so many hoops to print the error message
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit 4e33b3bf9d73aeebe5e0897eeacb651e6a498d6a by Damien L-G
HIP: Forgot to delete matching brace closing the namespace
The file was modified core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
Commit 35a867d374227522e44c5240e18365ac0635980a by noreply
Make initialize and finalize of the Cuda/HIP singleton less special (#6714)

Make initialization of the Cuda/HIP singleton less special
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
Commit 256c0ca62e56216ebd6351b3ed2c7bdacef8c654 by ndellin
Kokkos_HIP.cpp: include Kokkos_Core.hpp to resolve errors

Resolves errors of form:
/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_ROCM560/kokkos/core/src/HIP/Kokkos_HIP.cpp:96:15: error: no member named 'hip_global_unique_token_locks' in namespace 'Kokkos::Impl'
  (void)Impl::hip_global_unique_token_locks(true);

Note: include of Kokkos_HIP_UniqueToken.hpp was insufficient as this triggered new errors:
/home/ndellin/kokkos/core/src/HIP/Kokkos_HIP_UniqueToken.hpp:40:29: error: implicit instantiation of undefined template 'Kokkos::View<unsigned int *, Kokkos::HIPSpace>'
  View<uint32_t*, HIPSpace> m_locks;
The file was modified core/src/HIP/Kokkos_HIP.cpp (diff)
Commit 179d2e67fd42eeb894cf9b5e976104a13de990c2 by noreply
Add bound checks in RangePolicy and MDRangePolicy (#6617)

* Added a bounds check in MDRangePolicy that checks that all lower bounds are less than its upper bound

* Modified the wording on the abort

* Converted the error msg from a stringstream to a string

* Modified abort msg

* Fixed the unit test output based on backend's default iterate direction

* Update core/unit_test/TestMDRangePolicyConstructors.hpp

Formatting.

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

* Updated RangePolicy to have the same precondition as MDRangePolicy

---------

Co-authored-by: Damien L-G <dalg24+github@gmail.com>
The file was modified core/src/Kokkos_ExecPolicy.hpp (diff)
The file was modified core/unit_test/TestMDRangePolicyConstructors.hpp (diff)
The file was modified core/src/KokkosExp_MDRangePolicy.hpp (diff)
The file was modified core/unit_test/TestRangePolicyConstructors.hpp (diff)
Commit f42a8cb0326df4cb68c3e9da87842d4403799386 by Bruno Turcksin
Temporary fix to reenable HIP CI
The file was modified .jenkins (diff)
Commit 86f5bb7d892b74defdcde7da8866bb9a5d5afbd4 by noreply
Let the smart pointer manage the CUDA/HIP stream (#6721)

* Let the smart pointer manage the CUDA/HIP stream

* Fixup do not null the stream in {Cuda/HIP}Internal::finalize()

* Fixup hip not tested either

* Not breaking HIP backend for 3rd time in a row

Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com>

---------

Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com>
The file was modified core/src/HIP/Kokkos_HIP.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
Commit 9eca17795dfc38564aaf53f5e26899e4f89d114d by Bruno Turcksin
Fix Docker env variables
The file was modified .jenkins (diff)
Commit 8f743cf95c4718abaa65fcca42e0e4f5c55c647f by Damien L-G
Ensure view_allocation_error does not silently ignore that no exception was thrown
The file was modified core/unit_test/TestViewAPI.hpp (diff)
Commit a3aa567af5d335888b65cfdf1948997dec21d69d by Damien L-G
Add RawMemoryAllocationFailure::AllocationMechanism::OpenACCMalloc enumerator
The file was modified core/src/impl/Kokkos_Error.hpp (diff)
Commit f9f3c6e13cde62016b42f856ccc96fe4a8771a28 by Damien L-G
[OpenACC] throw if acc_malloc returned nullptr
The file was modified core/src/OpenACC/Kokkos_OpenACCSpace.cpp (diff)
Commit 3d33665ffc72c8d7a50ba9ffdbb3972530371de3 by Damien L-G
Fixup using declaration
The file was modified core/src/OpenACC/Kokkos_OpenACCSpace.cpp (diff)
Commit 5781d176e8ce86dcc5605c0e668d850a4592bdd1 by Damien L-G
Disable openacc.view_allocation_error test
The file was modified core/unit_test/TestViewAPI_d.hpp (diff)
Commit 6912b3998d4722c6c19db1a5d110e15d882d0ff5 by noreply
Guard `[MD]RangePolicy` precondition check for deprecated code 4 (#6726)

* Guard [MD]RangePolicy precondition check for deprecated code 4

* No good reason to use a raw string literal for the warning msg

* Drop pointless inline specifier

* Restore original behavior when deprecated code 4 is enabled

* first blush

* Fixup obviously wont display both message when aborting

* Trust me, it's not badly written. It's just way above your head.

* Add assertions for the old behavior

* Fiddling with string comparison

* Attempt to resolve death tests error msg matching issues

* Fix that stupid regex
The file was modified core/src/Kokkos_ExecPolicy.hpp (diff)
The file was modified core/src/impl/Kokkos_Error.cpp (diff)
The file was modified core/unit_test/TestMDRangePolicyConstructors.hpp (diff)
The file was modified core/src/impl/Kokkos_Error.hpp (diff)
The file was modified core/unit_test/TestRangePolicyConstructors.hpp (diff)
The file was modified core/src/KokkosExp_MDRangePolicy.hpp (diff)
Commit bd3c0a552c423eb6f11f950cc68771ecc18b44b6 by dev
Add C++26 standard to CMake Setup
The file was modified cmake/kokkos_pick_cxx_std.cmake (diff)
The file was modified cmake/KokkosCore_config.h.in (diff)
The file was modified cmake/kokkos_test_cxx_std.cmake (diff)
Commit 39a0f3d675f79841e2237de57c0c7efa702b663d by Damien L-G
Add support for C++26 in generated makefiles
The file was modified Makefile.kokkos (diff)
Commit 87f32846bb0ac950ae7b639e922369b5754ffb60 by Damien L-G
Add KOKKOS_ENABLE_CXX26 to the configuration metadata
The file was modified core/src/impl/Kokkos_Core.cpp (diff)
The file was modified .jenkins (diff)
Commit 523d70189dd7a04cb2d75dbe7e23cbb904fb578f by Bruno Turcksin
Disabling failing HIP test in the CI
The file was modified core/unit_test/hip/TestHIP_Memory_Requirements.cpp (diff)
Commit 407e18dc8c652ff6d6c6c6796e2866891580ab4f by noreply
Use team_size_max to fix "Team size too large" error in reducer test (#6725)

* Fix test to not use team size too large

For some configurations, one of the reducer test was setting a team size too large.

* Remove team_size calculation from test not using it

* Remove unused typedef

* Use int in for-loop to avoid warning

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

* Fix teamsize request for OPENMPTARGET

* Cleanup test and match TestTeam.hpp

Use 32 as a minimum team size like in TestTeam.hpp. Better match layout
in the 3 tests here.

---------

Co-authored-by: Damien L-G <dalg24+github@gmail.com>
The file was modified core/unit_test/TestReducers.hpp (diff)
Commit 5610068c5d908e94806ce8e9b7a8118ec959b1eb by noreply
Don't touch my records! (refactor Cuda/HIP/SYCL/Threads to not directly mess with `SharedAllocationRecord`) (#6732)

* Do not use SharedAllocationRecord directly

* Purge Cuda/HIP graph implementation from SharedAllocationRecord

* MemorySpace::{free -> allocate} and FIXMEs for size argument

* Properly cast allocated pointer from void* to size_type*

* Fixup Cuda/HIP graph header includes and pointer casting

* Fix the FIXMEs
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_GraphNodeKernel.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_Instance.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_GraphNodeKernel.hpp (diff)
The file was modified core/src/Threads/Kokkos_Threads_Instance.cpp (diff)
Commit d18ad8f349634e23fe78db185114ebdc1e9674fb by Damien L-G
Untangle SharedAllocationRecord spaghetti code
The file was modified core/src/HIP/Kokkos_HIP_Space.cpp (diff)
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTargetSpace.hpp (diff)
The file was modified core/src/impl/Kokkos_HBWSpace.cpp (diff)
The file was modified core/src/impl/Kokkos_SharedAlloc.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_SharedAllocationRecord.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_SharedAllocationRecord.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Space.hpp (diff)
The file was modified core/src/Cuda/Kokkos_CudaSpace.cpp (diff)
The file was removedcore/src/impl/Kokkos_MemorySpace.hpp
The file was modified core/src/Kokkos_HBWSpace.hpp (diff)
The file was modified core/src/impl/Kokkos_HostSpace.cpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_UniqueToken.hpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_SharedAllocationRecord.cpp (diff)
The file was modified Makefile.targets (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACC_SharedAllocationRecord.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_Space.hpp (diff)
The file was modified core/src/Kokkos_HostSpace.hpp (diff)
The file was modified core/src/impl/Kokkos_SharedAlloc_timpl.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_Space.cpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_UniqueToken.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_GraphNodeKernel.hpp (diff)
The file was removedcore/src/impl/Kokkos_MemorySpace.cpp
The file was modified core/src/impl/Kokkos_SharedAlloc.cpp (diff)
The file was modified core/src/Cuda/Kokkos_CudaSpace.hpp (diff)
The file was modified core/src/OpenMPTarget/Kokkos_OpenMPTargetSpace.cpp (diff)
The file was modified core/src/OpenACC/Kokkos_OpenACCSpace.cpp (diff)
Commit c17969f3338f9b5866496e31f34796703038d85a by Daniel Arndt
Trilinos: Don't let Kokkos set CMAKE_CXX_FLAGS
The file was modified CMakeLists.txt (diff)
Commit a1a6ea14c181b0e4f88cec82652bf594ceba5e22 by noreply
Fix TestThreadVectorMDRangeParallelReduce (#6734)

* Fix TestThreadVectorMDRangeParallelReduce

* Eliminate teamSum
The file was modified core/unit_test/TestTeamMDRange.hpp (diff)
Commit 2dc7cbcc9cebe83569a644ccf3a7c6e825e6ec4a by noreply
Cuda multi-GPU support: Allow execution space instance constructor to run (#6706)

* Cuda multi-GPU support: Allow execution space instance constructor to run

* Skip a test

* Use cuda_stream/device also for UVM and HostPinned

* Clean up

* Revert test changes
The file was modified core/unit_test/CMakeLists.txt (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was addedcore/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp
Commit 57126af31360a14cf1b5e2948a3f5fbf48a861b2 by tccleve
add more warp sync for cuda reductions
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_MDRange.hpp (diff)
The file was modified core/src/Cuda/Kokkos_Cuda_Parallel_Team.hpp (diff)
Commit d560c4719df6fcf664cdb5c98de29fdd159f27d0 by noreply
Drop support for deprecated command-line arguments and environment variables (#6744)

* Drop support for deprecated cmdline args and env vars

* Deprecate num_devices and skip_device in InitializationSettings

* Fixup rm now unused get_flag utility lambda
The file was modified core/src/impl/Kokkos_DeviceManagement.hpp (diff)
The file was modified core/src/impl/Kokkos_InitializationSettings.hpp (diff)
The file was modified core/unit_test/TestInitializationSettings.cpp (diff)
The file was modified core/src/impl/Kokkos_Core.cpp (diff)
The file was modified core/unit_test/TestParseCmdLineArgsAndEnvVars.cpp (diff)
Commit 650ac40677765a7a8e6951528489a5ff75e9da58 by noreply
Avoid unnecessary zero-memset of the scratch flags in SYCL (#6739)

* SYCL: Homogenize scratch_flags with CUDA and HIP

* Add comments for CUDA and HIP

* Fix typo [ci skip]

---------

Co-authored-by: Daniel Arndt <arndtd@ornl.gov>
The file was modified core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelReduce_MDRange.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_Instance.cpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelReduce_Range.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelScan_Range.hpp (diff)
The file was modified core/src/SYCL/Kokkos_SYCL_ParallelReduce_Team.hpp (diff)
The file was modified core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
Commit 5403681145d2fa2ee99dc10a5f3d62e4dc036512 by romin.tomasetti
std(remove-if): fixing tmp view alloc + avoid evaluating twice the predicate during final pass
The file was modified algorithms/src/std_algorithms/impl/Kokkos_RemoveAllVariants.hpp (diff)