文章索引
CUDA Learning Path
You can start by reading this blog for a simple introduction: Introductory Blog
1.0 Parallel Computing and Computer Architecture
2018-02-14 | CUDA , Freshman | 0 |
1.1 Heterogeneous Computing and CUDA
2018-02-15 | CUDA , Freshman | 0 |
2.0 CUDA Programming Model Overview (Part I)
2018-02-15 | CUDA , Freshman | 0 |
2.1 CUDA Programming Model Overview (Part II)
2018-02-16 | CUDA , Freshman | 0 |
2.2 Timing Kernel Functions
2018-03-08 | CUDA , Freshman | 0 |
2.3 Organizing Parallel Threads
2018-03-09 | CUDA , Freshman | 0 |
2.4 GPU Device Information
2018-03-10 | CUDA , Freshman | 0 |
3.1 CUDA Execution Model Overview
2018-03-12 | CUDA , Freshman | 0 |
3.2 Understanding the Essence of Warp Execution (Part I)
2018-03-14 | CUDA , Freshman | 0 |
3.2 Understanding the Essence of Warp Execution (Part II)
2018-03-15 | CUDA , Freshman | 0 |
3.3 Parallelism and Performance
Published 2018-04-15 | Category 0 | Read count:
3.4 Avoiding Branch Divergence
| CUDA | Freshman |
3.5 Loop Unrolling
| CUDA | Freshman |
3.6 Dynamic Parallelism
| CUDA | Freshman |
4.0 Global Memory
| CUDA | Freshman |
4.1 Memory Model Overview
| CUDA | Freshman |
4.2 Memory Management
Published on 2018-05-01 | Category 0 | Views:
4.3 Memory Access Patterns
Published on 2018-05-03 | Category 0 | Views:
4.4 Achievable Kernel Bandwidth
Published on 2018-05-13 | Category 0 | Views:
4.5 Vector Addition with Unified Memory
Published on 2018-05-14 | Category 0 | Views:
5.0 Shared Memory and Constant Memory
Published on 2018-06-01 | Category 0 | Views:
5.1 CUDA Shared Memory Overview
Published on 2018-06-01 | Category 0 | Views:
5.2 Data Layout in Shared Memory
Published on 2018-06-02 | Category 0 | Views:
5.3 Reducing Global Memory Accesses
Published on 2018-06-04 | Category 0 | Views:
5.4 Coalesced Global Memory Access
Published on 2018-06-04 | Category 0 | Views:
5.5 Constant Memory
Published on 2018-06-06 | Category 0 | Views:
5.6 Warp Shuffle Instructions
Published on 2018-06-06 | Category 0 | Views:
6.0 Streams and Concurrency
Published on 2018-06-10 | Category 0 | Views:
6.1 Stream and Event Overview
Published on 2018-06-10 | Category 0 | Views:
6.2 Concurrent Kernel Execution
Published on 2018-06-18 | Category 0 | Views:
6.3 Overlapping Kernel Execution and Data Transfer
Published on 2018-06-20 | Category 0 | Views:
6.4 Overlapping GPU and CPU Execution
Published on 2018-06-20 | Category 0 | Views:
6.5 Stream Callbacks
Published on 2018-06-20 | Category 0 | Views: