[CUDA] 使用thrust::sort()函数排序

本文主要是介绍[CUDA] 使用thrust::sort()函数排序，希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

一、简介

本文介绍了如何使用thrust::sort()函数对device_vector<int>容器，或者int*指针表示的内存（显存）处的数据进行排序。

二、示例代码

1. 排序 `device_vector<int>`中的数据

main.cu文件内容：

#include <iostream>
#include <cuda_runtime.h>
#include <thrust/device_vector.h>
#include <thrust/device_ptr.h>
#include <thrust/host_vector.h>
#include <thrust/sort.h>
#include <algorithm>
#include <random>
#include <chrono>using namespace std;
int main()
{int min = 0, max = 100;random_device seed;                           // 硬件生成随机数种子ranlux48 engine(seed());                      // 利用种子生成随机数引擎uniform_int_distribution<> distrib(min, max); // 设置随机数范围，并为均匀分布int n = 10;thrust::host_vector<int> data_host(n);thrust::device_vector<int> data_device(n);/* 在host_vector上生成随机数 */for (int i = 0; i < n; i++){data_host[i] = distrib(engine);}std::cout << "Before sorting:\n";for (int i = 0; i < n; i++){std::cout << data_host[i] << ",";}std::cout << "\n";/* 在device端进行排序 */data_device = data_host;                              // 将host端的数据拷贝到device端thrust::sort(data_device.begin(), data_device.end()); // 进行排序data_host = data_device;                              // 将排序完成后的device端的数据拷贝回host端std::cout << "After sorting:\n";for (int i = 0; i < n; i++){std::cout << data_host[i] << ",";}std::cout << "\n";return 0;
}

CMakeLists.txt文件内容：

cmake_minimum_required(VERSION 3.8 FATAL_ERROR)
project(HelloWolrd LANGUAGES CXX CUDA)
set(CMAKE_CXX_STANDARD 14)
set(CMAKE_CXX_STANDARD_REQUIRED ON)
# CUDA设置
find_package(CUDA REQUIRED)
# 指定生成目标
cuda_add_executable(HelloWorld main.cu)

程序输出结果：

Before sorting:
85,73,8,0,77,68,93,31,85,76,
After sorting:
0,8,31,68,73,76,77,85,85,93,

2. 排序 `int*`指针中的数据

main.cu文件内容：

#include <iostream>
#include <cuda_runtime.h>
#include <thrust/device_vector.h>
#include <thrust/device_ptr.h>
#include <thrust/host_vector.h>
#include <thrust/sort.h>
#include <algorithm>
#include <random>
#include <chrono>using namespace std;
int main()
{int min = 0, max = 100;random_device seed;                           // 硬件生成随机数种子ranlux48 engine(seed());                      // 利用种子生成随机数引擎uniform_int_distribution<> distrib(min, max); // 设置随机数范围，并为均匀分布int n = 10;int *data_host = new int[n];/* 生成随机数 */for (int i = 0; i < n; i++){data_host[i] = distrib(engine);}std::cout << "Befor sorting:\n";for (int i = 0; i < n; i++){std::cout << data_host[i] << ",";}std::cout << "\n";    /*数据拷贝到device端*/int *data_device = nullptr;cudaMalloc((void **)&data_device, n * sizeof(int));cudaMemcpy(data_device, data_host, n * sizeof(int), cudaMemcpyHostToDevice);/*使用thrust::sort()排序*/// thrust::sort()只支持迭代器，因此需要将int*指针转为device_ptr<int>容器类型thrust::device_ptr<int> t_ptr(data_device);thrust::sort(t_ptr, t_ptr + n);/*数据拷贝回host端*/cudaMemcpy(data_host, data_device, n * sizeof(int), cudaMemcpyDeviceToHost);std::cout << "After sorting:\n";for (int i = 0; i < n; i++){std::cout << data_host[i] << ",";}std::cout << "\n";/*释放内存*/delete[] data_host;cudaFree(data_device);return 0;
}

CMakeLists.txt文件内容同上。
程序输出结果：

Befor sorting:
5,82,71,3,90,69,8,0,23,81,
After sorting:
0,3,5,8,23,69,71,81,82,90,

三、自定义排序

更多自定义排序请参考thrust官方代码仓库：NVIDIA/thrust。

这篇关于[CUDA] 使用thrust::sort()函数排序的文章就介绍到这儿，希望我们推荐的文章对编程师们有所帮助！

[CUDA] 使用thrust::sort()函数排序

一、简介

二、示例代码

1. 排序 `device_vector<int>`中的数据

2. 排序 `int*`指针中的数据

三、自定义排序

相关文章

vue使用docxtemplater导出word

Linux换行符的使用方法详解

使用Jackson进行JSON生成与解析的新手指南

使用Python实现快速搭建本地HTTP服务器

Elasticsearch 在 Java 中的使用教程

使用C#代码在PDF文档中添加、删除和替换图片

Kotlin 作用域函数apply、let、run、with、also使用指南

Java中List的contains()方法的使用小结

C#使用SQLite进行大数据量高效处理的代码示例

Android中Dialog的使用详解

[CUDA] 使用thrust::sort()函数排序

一、简介

二、示例代码

1. 排序 device_vector<int>中的数据

2. 排序 int*指针中的数据

三、自定义排序

相关文章

1. 排序 `device_vector<int>`中的数据

2. 排序 `int*`指针中的数据