The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/unit_test/CMakeLists.txt (diff) The file was added core/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp The file was modified
core/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/unit_test/CMakeLists.txt (diff) The file was removed core/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp The file was modified
core/src/Cuda/Kokkos_Cuda.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Space.hpp (diff) The file was modified
core/src/Kokkos_Core.hpp (diff) The file was modified
core/src/Kokkos_Core_fwd.hpp (diff) The file was removed core/src/Kokkos_LogicalSpaces.hpp The file was removed core/unit_test/tools/TestLogicalSpaces.hpp The file was modified
core/src/Cuda/Kokkos_CudaSpace.hpp (diff) The file was modified
core/src/Kokkos_HBWSpace.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_Space.hpp (diff) The file was modified
core/unit_test/CMakeLists.txt (diff) The file was modified
core/src/Kokkos_HostSpace.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit
4d4a343e50bf4d5998efe20f97e8670c93bdf2dd
by tccleve Add warp sync for Cuda parallel reduce compute-sanitizer --tool=racecheck discovered a potential racecondition for Cuda parallel reductions (using range policy) where data was being updated on a single thread inside a warp, but the warp was not being synchronized before being read. The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_Range.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_Range.hpp (diff) The file was modified
tpls/desul/include/desul/atomics/Lock_Based_Fetch_Op_OpenACC.hpp (diff) The file was modified
tpls/desul/include/desul/atomics/Fetch_Op_OpenACC.hpp (diff) The file was modified
.jenkins (diff) The file was modified
tpls/desul/include/desul/atomics/Compare_Exchange_OpenACC.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsExclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTransformInclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsInclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTransformExclusiveScan.cpp (diff) The file was modified
core/unit_test/TestMathematicalFunctions.hpp (diff) The file was modified
containers/src/Kokkos_DualView.hpp (diff) The file was modified
.jenkins (diff) The file was modified
.jenkins (diff) The file was modified
core/src/Kokkos_HostSpace.hpp (diff)
Commit
843fca336a8bc056b6650310f6c5152f06e3b7e7
by noreply OpenMPTarget: clang extensions for dynamic shared memory. (#6380) * OpenMPTarget: clang extensions for dynamic shared memory. * OpenMPTarget: Rebase and use macros for pragma. * OpenMPTarget: Fix bug. Add Macros file. * OpenMPTarget: Fix resize_scratch. * OpenMPTarget: restore map clause. * OpenMPTarget: Fix comment. * OpenMPTarget: change L0 values in resize_scratch. * OpenMPTarget: Fix comment and spaces. --------- Co-authored-by: Rahulkumar Gayatri <rgayatri@lbl.gov> The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTarget_Parallel.hpp (diff) The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTarget_ParallelFor_Team.hpp (diff) The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTarget_Exec.cpp (diff) The file was added core/src/OpenMPTarget/Kokkos_OpenMPTarget_Macros.hpp The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTarget_Parallel_Common.hpp (diff) The file was modified
core/unit_test/TestMathematicalFunctions.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsCommon.hpp (diff) The file was modified
core/src/Serial/Kokkos_Serial.hpp (diff) The file was modified
simd/unit_tests/include/TestSIMD_GeneratorCtors.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamRemoveIf.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamReplaceIf.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamPartitionCopy.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamCopyIf.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamIsPartitioned.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamReplaceCopyIf.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamPartitionPoint.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamRemoveCopyIf.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsCommon.hpp (diff) The file was modified
simd/unit_tests/include/TestSIMD_GeneratorCtors.hpp (diff) The file was modified
core/unit_test/TestMathematicalFunctions.hpp (diff) The file was modified
core/src/Serial/Kokkos_Serial.hpp (diff)
Commit
ed08974c76ab00a28897c31b876dbd56d283d86d
by noreply Unit test for issue 3371 (negative vector length should not yield a negative max_team_size) (#6076) * #3371: Added TeamPolicy construction test * Check recommended team size * Move size_max and size_recommended tests to TeamPolicy constructor test * Define tag type outside of test body * Add missing overload * tests: remove spurious check * Remove redundant qualifier Co-authored-by: Damien L-G <dalg24+github@gmail.com> * Fix formatting --------- Co-authored-by: Cezary Skrzyński <cezary.skrzynski@ng-analytics.com> Co-authored-by: Cezary Skrzyński <czars1988@gmail.com> Co-authored-by: Damien L-G <dalg24+github@gmail.com> The file was modified
core/unit_test/TestTeamPolicyConstructors.hpp (diff) The file was added benchmarks/stream/CMakeLists.txt The file was modified
benchmarks/CMakeLists.txt (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Team.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Range.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_Team.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_MDRange.hpp (diff) The file was modified
core/src/Kokkos_Assert.hpp (diff) The file was modified
core/src/OpenMP/Kokkos_OpenMP_Instance.hpp (diff) The file was modified
generate_makefile.bash (diff) The file was modified
Makefile.kokkos (diff) The file was modified
cmake/KokkosCore_config.h.in (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.hpp (diff) The file was modified
cmake/kokkos_arch.cmake (diff) The file was modified
core/src/impl/Kokkos_Utilities.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Team.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_MDRange.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_Team.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_ParallelReduce_Range.hpp (diff) The file was modified
core/src/Kokkos_CopyViews.hpp (diff) The file was modified
core/unit_test/TestViewCopy_a.hpp (diff) The file was modified
core/src/Kokkos_CopyViews.hpp (diff) The file was modified
core/src/Kokkos_CopyViews.hpp (diff) The file was modified
simd/src/Kokkos_SIMD_AVX2.hpp (diff) The file was modified
core/src/Kokkos_Printf.hpp (diff) The file was modified
core/src/Kokkos_Printf.hpp (diff)
Commit
cbbe09b93db49e081ab4d400975aca8c007c4223
by noreply OpenMP: Use `omp_get_nested` for older gcc versions (#6685) * OpenMP: fix for issue 6670. * OpenMP: Update gcc version when using max_active_level. * OpenMP: Edit execute_in_serial. * Apply suggestions from code review Co-authored-by: Daniel Arndt <arndtd@ornl.gov> * clang-format --------- Co-authored-by: Rahulkumar Gayatri <rgayatri@lbl.gov> Co-authored-by: Damien L-G <dalg24+github@gmail.com> Co-authored-by: Daniel Arndt <arndtd@ornl.gov> The file was modified
core/src/OpenMP/Kokkos_OpenMP_Instance.hpp (diff) The file was modified
core/unit_test/TestMathematicalSpecialFunctions.hpp (diff) The file was modified
algorithms/src/Kokkos_Random.hpp (diff) The file was modified
core/src/Kokkos_Profiling_ProfileSection.hpp (diff) The file was modified
core/src/Kokkos_Profiling_ProfileSection.hpp (diff)
Commit
02b46c09c75697da5a1e4bfed5429a56c1bfb7ef
by noreply #5333: CUDA: Use scratch space appropriate to small reduction elements in Team reductions (#5334) * #5333: Add maybe failing test case? * Revise test to try to target the right code * NOMERGE #5333: Partial implementation of fix * #5333: Update tests * #5333: Fix comparision types in reducers test * #5333: Re-enable tests * #5333: Add test for Scalar with size of 1 * Fix formatting * Remove unrelated changes * Extract TeamPolicy tests into a separate method * Disable new tests (OpenACC, OpenMPTarget, Serial, SYCL, point_t) * Use smaller number of teams for bhalf_t * Use pointer_type for m_scratch_space parameter * Skip failing tests (SYCL) * Fix tests for OpenMPTarget * Fix num_teams_done variable in SYCL TeamPolicy reduction * Restore guards for array_reduce<float, 7> * Also make array_reduce<float, 7> work for SYCL * tests: change point_t default constructor --------- Co-authored-by: Arkadiusz Szczepkowicz <arek.szczepkowicz@ng-analytics.com> Co-authored-by: Cezary Skrzyński <cezary.skrzynski@ng-analytics.com> Co-authored-by: Daniel Arndt <arndtd@ornl.gov> The file was modified
core/src/SYCL/Kokkos_SYCL_Team.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelReduce_Team.hpp (diff) The file was modified
core/unit_test/TestReducers.hpp (diff) The file was modified
core/unit_test/TestReducers_d.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_Team.hpp (diff) The file was modified
core/unit_test/TestNonTrivialScalarTypes.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_ReduceScan.hpp (diff)
Commit
4078a0d8a022ab1ad2291554b2820916fa3bc361
by noreply Cuda: Allocate using the correct device (#6392) * Cuda: Allocate using the correct device * Avoid warning about uninitialized variable * exec_space_provided -> stream_sync_only * Fix up ASYNC support * Only check for errors in synchronization if allocation was successful * Rename arguments and make constructor private --------- Co-authored-by: Daniel Arndt <arndtd@ornl.com> The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/src/Cuda/Kokkos_CudaSpace.hpp (diff) The file was modified
core/src/Cuda/Kokkos_CudaSpace.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff)
Commit
f485cfa53cb2d25cab98e7a1d41a116027ec2363
by noreply Let `Profiling::ProfilingSection(std::string)` constructor be explicit and nodiscard (#6690) * Add nodiscard attribute to Profiling::ProfilingSection * Make Profiling::ProfilingSection constructor explicit The file was modified
core/src/Kokkos_Profiling_ProfileSection.hpp (diff) The file was modified
core/src/Kokkos_Profiling_ProfileSection.hpp (diff)
Commit
654a51f60cef8900868f55a93661cee80f451cd9
by noreply GitHub CI: Test with AddressSanitizer (#6676) * GitHub CI: Test with AddressSanitizer * Disable view_allocation_error when running with AddressSanitizer * Try __attribute__((no_sanitize(address))) * Revert "Try __attribute__((no_sanitize(address)))" This reverts commit 335f7f26ae96034ceae221e5a80bd93b4769ed1d. The file was modified
.github/workflows/continuous-integration-workflow.yml (diff) The file was modified
core/unit_test/TestViewAPI_d.hpp (diff) The file was modified
core/src/Kokkos_Array.hpp (diff) The file was modified
core/unit_test/TestArray.cpp (diff) The file was added .github/workflows/windows.yml The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelFor_Team.hpp (diff) The file was modified
core/src/setup/Kokkos_Setup_SYCL.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelReduce_Team.hpp (diff) The file was modified
core/unit_test/TestMathematicalFunctions.hpp (diff) The file was modified
core/src/CMakeLists.txt (diff) The file was modified
cmake/kokkos_tpls.cmake (diff) The file was modified
core/unit_test/TestArray.cpp (diff)
Commit
3523bc3e789110445320af86e1feea6862b31b68
by noreply Enable `{transform_}exclusive_scan` in place (#6667) * Enable exclusive_scan in place * add test for inplace operation * improve tests * formatting * fix test for transform_* * simplify args * revise tests --------- Co-authored-by: Francesco Rizzi <fnrizzi@sandia.gov> The file was modified
algorithms/src/std_algorithms/impl/Kokkos_FunctorsForExclusiveScan.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTransformExclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamTransformExclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamExclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsExclusiveScan.cpp (diff)
Commit
27286c32d7e0c157d1a4d7f00f9441979ed8eabe
by noreply Add `ATOMICS_BYPASS` configuration option to disable atomics (#6692) * Add NAME_TBD_UNSAFE_ATOMICS configuration option to disable atomics * Rename configuration option and macro NAME_TBD_UNSAFE_ATOMICS -> ATOMICS_BYPASS * Improve option description and configuration time error message Co-Authored-By: Christian Trott <crtrott@sandia.gov> Co-Authored-By: Daniel Arndt <arndtd@ornl.gov> * Mention Kokkos_ENABLE_ATOMICS_BYPASS option in error msssage * Simplify sanity check for disabling atomics in <Kokkos_Macros.hpp> Co-Authored-By: Daniel Arndt <arndtd@ornl.gov> --------- Co-authored-by: Christian Trott <crtrott@sandia.gov> Co-authored-by: Daniel Arndt <arndtd@ornl.gov> The file was modified
cmake/kokkos_arch.cmake (diff) The file was modified
core/src/Kokkos_Atomics_Desul_Volatile_Wrapper.hpp (diff) The file was modified
cmake/KokkosCore_config.h.in (diff) The file was modified
cmake/kokkos_enable_options.cmake (diff) The file was modified
core/src/Kokkos_Atomics_Desul_Wrapper.hpp (diff) The file was modified
core/src/Kokkos_Macros.hpp (diff) The file was modified
core/src/Serial/Kokkos_Serial.cpp (diff)
Commit
0e4a158a7585f2a93b93d4121788016fe0dba4c9
by noreply Check matching static extents in View constructor (#5190) * Also check static extents * Use separate bools for the two checks and improve error messages * Improve error messages * Refactor test * Add another test * Extend existing tests * Use make_index_sequence * Remove LIVE and DIE * Try removing guards for OpenMPTarget * Use different style in view_construction_with_wrong_static_extents * Remove template template parameter * Also check unmanaged views * Fix DynRankView * Add FIXME * Pass label by const char * * fix terminating define * Deduce rank and dynamic_rank from View * Only call runtime_check_rank when KOKKOS_ENABLE_DEBUG_BOUNDS_CHECK is defined * Guard with if constexpr --------- Co-authored-by: Francesco Rizzi <fnrizzi@sandia.gov> The file was modified
core/unit_test/TestViewAPI.hpp (diff) The file was modified
core/src/Kokkos_View.hpp (diff) The file was modified
core/unit_test/TestViewCtorDimMatch.hpp (diff) The file was modified
containers/src/Kokkos_DynRankView.hpp (diff) The file was modified
core/src/impl/Kokkos_ViewArray.hpp (diff) The file was modified
core/unit_test/TestHalfOperators.hpp (diff) The file was modified
core/src/impl/Kokkos_Half_FloatingPointWrapper.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTransformInclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamTransformInclusiveScan.cpp (diff)
Commit
0ba8c40fc56c54bc0fd5808a23cf44e48749c187
by noreply Provide `kokkos_swap` as part of Core and deprecate `Experimental::swap` in Algorithms (#6697) * Move Kokkos_Swap.hpp header to core * Include only what we use (in <Kokkos_Swap.hpp>) * Rename header guards in <Kokkos_Swap.hpp> (because I can) * Drop useless comment * Fix noexcept specification of Kokkos::swap * Constexprify Kokkos::swap * Fixup type requirements in Kokkos::swap * Drop the inline (because I can) * fixup! Move Kokkos_Swap.hpp header to core * Promote swap to namespace Kokkos:: * Move and update Kokkos::swap unit test to core * Add support for swapping arrays * Fixup size_t * Provide Kokkos::swap overload for a few classes * Rename Kokkos::swap -> Kokkos::kokkos_swap * For backward compatibility provide deprecated Experimental::swap The file was modified
algorithms/src/std_algorithms/impl/Kokkos_Reverse.hpp (diff) The file was modified
algorithms/src/Kokkos_StdAlgorithms.hpp (diff) The file was modified
core/src/Kokkos_Pair.hpp (diff) The file was modified
algorithms/src/std_algorithms/impl/Kokkos_SwapRanges.hpp (diff) The file was modified
core/src/Kokkos_Core.hpp (diff) The file was added core/src/Kokkos_Swap.hpp The file was modified
core/unit_test/CMakeLists.txt (diff) The file was added core/unit_test/TestSwap.hpp The file was modified
algorithms/unit_tests/TestStdAlgorithmsModOps.cpp (diff) The file was removed algorithms/src/std_algorithms/Kokkos_Swap.hpp The file was modified
core/src/Kokkos_Array.hpp (diff) The file was modified
algorithms/src/sorting/impl/Kokkos_NestedSortImpl.hpp (diff) The file was modified
algorithms/src/std_algorithms/Kokkos_IterSwap.hpp (diff)
Commit
89ba3fbae8ab2ae2dbfd30eef05c3bac387630a5
by noreply Provide new public headers `<Kokkos_Clamp.hpp>` and `<Kokkos_MinMax.hpp>` (#6687) * Promote min, max, clamp to public * Drop unnecessary <Kokkos_MinMaxClamp.hpo> header includes * Split Kokkos_MinMaxClamp.hpp into Kokkos_MinMax.hpp and Kokkos_Clamp.hpp --------- Co-authored-by: Damien L-G <dalg24@gmail.com> The file was modified
core/src/Kokkos_View.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_Range.hpp (diff) The file was added core/src/Kokkos_Clamp.hpp The file was removed core/src/Kokkos_MinMaxClamp.hpp The file was modified
core/src/HIP/Kokkos_HIP_TeamPolicyInternal.hpp (diff) The file was modified
core/src/Kokkos_Core.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_MDRange.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_Team.hpp (diff) The file was added core/src/Kokkos_MinMax.hpp
Commit
ee5cbfc25c81b9efc5b2a4abd82d9a9d5d3d07bd
by noreply Fix TeamThreadMDRange parallel_reduce (#6511) * Fix TeamThreadRangeMD parallel_reduce * Use vector_reduce for Cuda, HIP, and SYCL * Initialize reduction variable * Test with maximal vector length * TeamHandle::execution_space->typename TeamHandle::execution_space * Fix OpenMPTarget * Restore tests * reducer is unused * Workarounds for OpenMPTarget * Reduce * Fix OpenMPTarget tests * Restore tests * Strengthen test case for OpenMPTarget * Add some static_asserts The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTarget_Parallel.hpp (diff) The file was modified
core/src/Kokkos_ExecPolicy.hpp (diff) The file was modified
core/unit_test/TestTeamMDRange.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsCommon.hpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsInclusiveScan.cpp (diff) The file was modified
algorithms/unit_tests/TestStdAlgorithmsTeamInclusiveScan.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda.hpp (diff)
Commit
9393b358fb1300d15c41957c42a566f329bcac85
by noreply Don't use the compiler launcher script if the compile language is CUDA. (#6704) * Don't use the compiler launcher script if the compile language is CUDA. * Updated descriptive comment. The file was modified
cmake/KokkosConfig.cmake.in (diff)
Commit
c75d730d203077bf79439c9b3f4a33cbbc18c6a9
by noreply Deprecate `{Cuda,HIP}::detect_device_count()` and `Cuda::[detect_]device_arch()` (#6710) * CUDA/HIP: Inline getting device count get_device_count() -> int cannot be generic. Get rid of the extra indirection because it brings no benefit. * Get rid of CudaInternalDevices and cleanup Cuda::print_configuration() * Get rid of cuda_get_device_{count,properties}_wrapper() * Missed a few CudaInternalDevices and get rid of Cuda::detect_device_arch() * Get rid of Cuda::device_arch() * Fixup Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com> * Don’t mess with Voodoo * Be more conservative and deprecate before removing * Clang-format for suggestion made on GH * Remove stray const qualifier * Forgot that device_arch() was static * Who Let the Bugs Out?? * The same thing we do every night, Pinky - try to take over the world! * this should fix it --------- Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com> The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda.hpp (diff) The file was modified
core/src/impl/Kokkos_Core.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.cpp (diff)
Commit
256c0ca62e56216ebd6351b3ed2c7bdacef8c654
by ndellin Kokkos_HIP.cpp: include Kokkos_Core.hpp to resolve errors Resolves errors of form: /home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_ROCM560/kokkos/core/src/HIP/Kokkos_HIP.cpp:96:15: error: no member named 'hip_global_unique_token_locks' in namespace 'Kokkos::Impl' (void)Impl::hip_global_unique_token_locks(true); Note: include of Kokkos_HIP_UniqueToken.hpp was insufficient as this triggered new errors: /home/ndellin/kokkos/core/src/HIP/Kokkos_HIP_UniqueToken.hpp:40:29: error: implicit instantiation of undefined template 'Kokkos::View<unsigned int *, Kokkos::HIPSpace>' View<uint32_t*, HIPSpace> m_locks; The file was modified
core/src/HIP/Kokkos_HIP.cpp (diff)
Commit
179d2e67fd42eeb894cf9b5e976104a13de990c2
by noreply Add bound checks in RangePolicy and MDRangePolicy (#6617) * Added a bounds check in MDRangePolicy that checks that all lower bounds are less than its upper bound * Modified the wording on the abort * Converted the error msg from a stringstream to a string * Modified abort msg * Fixed the unit test output based on backend's default iterate direction * Update core/unit_test/TestMDRangePolicyConstructors.hpp Formatting. Co-authored-by: Damien L-G <dalg24+github@gmail.com> * Updated RangePolicy to have the same precondition as MDRangePolicy --------- Co-authored-by: Damien L-G <dalg24+github@gmail.com> The file was modified
core/src/Kokkos_ExecPolicy.hpp (diff) The file was modified
core/unit_test/TestMDRangePolicyConstructors.hpp (diff) The file was modified
core/src/KokkosExp_MDRangePolicy.hpp (diff) The file was modified
core/unit_test/TestRangePolicyConstructors.hpp (diff) The file was modified
.jenkins (diff)
Commit
86f5bb7d892b74defdcde7da8866bb9a5d5afbd4
by noreply Let the smart pointer manage the CUDA/HIP stream (#6721) * Let the smart pointer manage the CUDA/HIP stream * Fixup do not null the stream in {Cuda/HIP}Internal::finalize() * Fixup hip not tested either * Not breaking HIP backend for 3rd time in a row Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com> --------- Co-authored-by: Bruno Turcksin <bruno.turcksin@gmail.com> The file was modified
core/src/HIP/Kokkos_HIP.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.cpp (diff) The file was modified
.jenkins (diff) The file was modified
core/unit_test/TestViewAPI.hpp (diff) The file was modified
core/src/impl/Kokkos_Error.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACCSpace.cpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACCSpace.cpp (diff) The file was modified
core/unit_test/TestViewAPI_d.hpp (diff)
Commit
6912b3998d4722c6c19db1a5d110e15d882d0ff5
by noreply Guard `[MD]RangePolicy` precondition check for deprecated code 4 (#6726) * Guard [MD]RangePolicy precondition check for deprecated code 4 * No good reason to use a raw string literal for the warning msg * Drop pointless inline specifier * Restore original behavior when deprecated code 4 is enabled * first blush * Fixup obviously wont display both message when aborting * Trust me, it's not badly written. It's just way above your head. * Add assertions for the old behavior * Fiddling with string comparison * Attempt to resolve death tests error msg matching issues * Fix that stupid regex The file was modified
core/src/Kokkos_ExecPolicy.hpp (diff) The file was modified
core/src/impl/Kokkos_Error.cpp (diff) The file was modified
core/unit_test/TestMDRangePolicyConstructors.hpp (diff) The file was modified
core/src/impl/Kokkos_Error.hpp (diff) The file was modified
core/unit_test/TestRangePolicyConstructors.hpp (diff) The file was modified
core/src/KokkosExp_MDRangePolicy.hpp (diff) The file was modified
cmake/kokkos_pick_cxx_std.cmake (diff) The file was modified
cmake/KokkosCore_config.h.in (diff) The file was modified
cmake/kokkos_test_cxx_std.cmake (diff) The file was modified
Makefile.kokkos (diff) The file was modified
core/src/impl/Kokkos_Core.cpp (diff) The file was modified
.jenkins (diff) The file was modified
core/unit_test/hip/TestHIP_Memory_Requirements.cpp (diff)
Commit
407e18dc8c652ff6d6c6c6796e2866891580ab4f
by noreply Use team_size_max to fix "Team size too large" error in reducer test (#6725) * Fix test to not use team size too large For some configurations, one of the reducer test was setting a team size too large. * Remove team_size calculation from test not using it * Remove unused typedef * Use int in for-loop to avoid warning Co-authored-by: Damien L-G <dalg24+github@gmail.com> * Fix teamsize request for OPENMPTARGET * Cleanup test and match TestTeam.hpp Use 32 as a minimum team size like in TestTeam.hpp. Better match layout in the 3 tests here. --------- Co-authored-by: Damien L-G <dalg24+github@gmail.com> The file was modified
core/unit_test/TestReducers.hpp (diff)
Commit
5610068c5d908e94806ce8e9b7a8118ec959b1eb
by noreply Don't touch my records! (refactor Cuda/HIP/SYCL/Threads to not directly mess with `SharedAllocationRecord`) (#6732) * Do not use SharedAllocationRecord directly * Purge Cuda/HIP graph implementation from SharedAllocationRecord * MemorySpace::{free -> allocate} and FIXMEs for size argument * Properly cast allocated pointer from void* to size_type* * Fixup Cuda/HIP graph header includes and pointer casting * Fix the FIXMEs The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_GraphNodeKernel.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.cpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_Instance.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_GraphNodeKernel.hpp (diff) The file was modified
core/src/Threads/Kokkos_Threads_Instance.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Space.cpp (diff) The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTargetSpace.hpp (diff) The file was modified
core/src/impl/Kokkos_HBWSpace.cpp (diff) The file was modified
core/src/impl/Kokkos_SharedAlloc.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_SharedAllocationRecord.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_SharedAllocationRecord.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Space.hpp (diff) The file was modified
core/src/Cuda/Kokkos_CudaSpace.cpp (diff) The file was removed core/src/impl/Kokkos_MemorySpace.hpp The file was modified
core/src/Kokkos_HBWSpace.hpp (diff) The file was modified
core/src/impl/Kokkos_HostSpace.cpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_UniqueToken.hpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_SharedAllocationRecord.cpp (diff) The file was modified
Makefile.targets (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACC_SharedAllocationRecord.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_Space.hpp (diff) The file was modified
core/src/Kokkos_HostSpace.hpp (diff) The file was modified
core/src/impl/Kokkos_SharedAlloc_timpl.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_Space.cpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_UniqueToken.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_GraphNodeKernel.hpp (diff) The file was removed core/src/impl/Kokkos_MemorySpace.cpp The file was modified
core/src/impl/Kokkos_SharedAlloc.cpp (diff) The file was modified
core/src/Cuda/Kokkos_CudaSpace.hpp (diff) The file was modified
core/src/OpenMPTarget/Kokkos_OpenMPTargetSpace.cpp (diff) The file was modified
core/src/OpenACC/Kokkos_OpenACCSpace.cpp (diff) The file was modified
CMakeLists.txt (diff) The file was modified
core/unit_test/TestTeamMDRange.hpp (diff)
Commit
2dc7cbcc9cebe83569a644ccf3a7c6e825e6ec4a
by noreply Cuda multi-GPU support: Allow execution space instance constructor to run (#6706) * Cuda multi-GPU support: Allow execution space instance constructor to run * Skip a test * Use cuda_stream/device also for UVM and HostPinned * Clean up * Revert test changes The file was modified
core/unit_test/CMakeLists.txt (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was added core/unit_test/cuda/TestCuda_InterOp_StreamsMultiGPU.cpp The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_MDRange.hpp (diff) The file was modified
core/src/Cuda/Kokkos_Cuda_Parallel_Team.hpp (diff)
Commit
d560c4719df6fcf664cdb5c98de29fdd159f27d0
by noreply Drop support for deprecated command-line arguments and environment variables (#6744) * Drop support for deprecated cmdline args and env vars * Deprecate num_devices and skip_device in InitializationSettings * Fixup rm now unused get_flag utility lambda The file was modified
core/src/impl/Kokkos_DeviceManagement.hpp (diff) The file was modified
core/src/impl/Kokkos_InitializationSettings.hpp (diff) The file was modified
core/unit_test/TestInitializationSettings.cpp (diff) The file was modified
core/src/impl/Kokkos_Core.cpp (diff) The file was modified
core/unit_test/TestParseCmdLineArgsAndEnvVars.cpp (diff)
Commit
650ac40677765a7a8e6951528489a5ff75e9da58
by noreply Avoid unnecessary zero-memset of the scratch flags in SYCL (#6739) * SYCL: Homogenize scratch_flags with CUDA and HIP * Add comments for CUDA and HIP * Fix typo [ci skip] --------- Co-authored-by: Daniel Arndt <arndtd@ornl.gov> The file was modified
core/src/Cuda/Kokkos_Cuda_Instance.cpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelReduce_MDRange.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_Instance.cpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelReduce_Range.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelScan_Range.hpp (diff) The file was modified
core/src/SYCL/Kokkos_SYCL_ParallelReduce_Team.hpp (diff) The file was modified
core/src/HIP/Kokkos_HIP_Instance.cpp (diff) The file was modified
algorithms/src/std_algorithms/impl/Kokkos_RemoveAllVariants.hpp (diff)