If you’re delving into GPU computing with NVIDIA CUDA, understanding your hardware’s capabilities and interconnections is crucial. The CUDA samples provide an excellent starting point for this exploration. This guide will walk you through downloading these samples, compiling them, and then using them to assess your GPU’s performance and connectivity.
This guide will walk you through setting up a server or cluster for deep learning tasks, particularly for Large Language Models (LLMs). The content was originally documented in my wolai notes and is now shared here. Feel free to ask questions in the comments!