Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Any CUDA primer you recommend in particular? I had this same question.


Here's an article on syncing in CUDA via cooperative groups: https://developer.nvidia.com/blog/cooperative-groups/

There's also explicit warp synchronization, i.e. __syncwarp(). More on warp primitives here: https://developer.nvidia.com/blog/using-cuda-warp-level-prim...


Probably https://www.youtube.com/watch?v=nOxKexn3iBo (or just skimming the attached colab).


This is terrific, thanks!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: