1. 02 Nov, 2020 3 commits
    • Bryce Adelstein Lelbach aka wash's avatar
      Retune some kernels. · a3d89dcc
      Bryce Adelstein Lelbach aka wash authored
      Retune radix sort, run length encoding, reduce by key, scan, select if, and
      histogram for SM70 and SM80.
      a3d89dcc
    • Andrew Corrigan's avatar
    • Allison Vacanti's avatar
      Update DeviceScan to pass Thrust's scan tests. · 7e6f33b1
      Allison Vacanti authored
      Thrust will be switching to `cub::DeviceScan` to replace its custom scan
      implementation. This patch addresses some issues found by the Thrust
      tests:
      
      - Initialize unused `BlockLoad` items with values known to be in the input
        set. This fixes the `TestInclusiveScanWithIndirection` Thrust test by
        keeping the `plus_mod3` functor indices valid.
      - Use `OffsetT` instead of `int` to hold indicies in `AgentScan`. This
        fixes the `Test*ScanWithBigIndexes` Thrust tests by not truncating
        the input problem size.
      - Use `BLOCK_[STORE|LOAD]_WARP_TRANSPOSED_TIMESLICED` instead of
        `BLOCK_[STORE|LOAD]_WARP_TRANSPOSED` when the intermediate type is
        larger than 128 bytes. This keeps shared memory buffers from growing
        too large in the `TestScanWithLargeTypes` Thrust test.
      7e6f33b1
  2. 30 Oct, 2020 1 commit
  3. 19 Oct, 2020 1 commit
    • Allison Vacanti's avatar
      Update CUB CMake to match recent changes to Thrust. · ea48955f
      Allison Vacanti authored
      Specifically, this ports the following PRs to CUB:
      
      - NVIDIA/thrust#1297 Add install rule option for add_subdirectory users.
      - NVIDIA/thrust#1298 Enforce semantic versioning in CMake version configs.
      - NVIDIA/thrust#1300 Use FindPackageHandleStandardArgs in CMake config.
      ea48955f
  4. 14 Oct, 2020 1 commit
  5. 05 Oct, 2020 1 commit
  6. 25 Sep, 2020 3 commits
  7. 23 Sep, 2020 7 commits
  8. 21 Sep, 2020 3 commits
  9. 16 Sep, 2020 1 commit
  10. 09 Sep, 2020 1 commit
  11. 17 Aug, 2020 2 commits
  12. 12 Aug, 2020 2 commits
  13. 06 Aug, 2020 1 commit
  14. 28 Jul, 2020 1 commit
  15. 17 Jul, 2020 1 commit
  16. 16 Jul, 2020 1 commit
  17. 14 Jul, 2020 1 commit
  18. 11 Jul, 2020 2 commits
  19. 09 Jul, 2020 1 commit
    • Allison Vacanti's avatar
      CMake update and associated warning fixes. · 620de2a1
      Allison Vacanti authored
      - Deprecation warnings are disabled due to use of deprecated CUDA APIs
        such as cudaBindTexture. Filed NVlabs/cub#191 to track this.
      - Bump CMake minimum version to 3.15.
      - Remove workarounds for old cmake/compilers.
      - Switch to CUB_ENABLE_DIALECT_CPPXX instead of CMAKE_CXX_STANDARD.
        - Like Thrust multiconfig, all can be enabled at once.
        - Uses Thrust's settings when Thrust multiconfig is enabled.
      - Add CUB_ENABLE_* options to disable portions of the build.
        - HEADER_TESTING, TESTING, EXAMPLES
      - Add sm_37, sm_80, remove sm_30, sm_32.
        - When building CUB in Thrust, just use thrust's flags.
      - Sanitize variable names in AppendOptionIfAvailable.
        - GCC was failing to detect `-Werror`. Long story short, this fixes it.
      - Add cub.compiler_interface target to store warning suppressions, etc.
      - Find and use Thrust. We should move the Thrust code into Thrust.
      - Check for THOROUGH, QUICK and QUICKER variants of tests, create new
        targets for them when found.
      - Rem...
      620de2a1
  20. 03 Jul, 2020 2 commits
  21. 23 Jun, 2020 1 commit
  22. 10 Jun, 2020 2 commits
  23. 27 May, 2020 1 commit
    • Hugh Winkler's avatar
      Use placement new to construct item on uninitialized memory. · 5217bb11
      Hugh Winkler authored
      Formerly used the assignment operator to copy to uninitialized
      memory. But a non-trivial assignment operator requires the destination
      object be in a valid state. So use placement new to construct the item
      on the uninitialized bits.
      
      Partial fix for thrust issue 1153
      Also a similar fix in thrust.
      5217bb11