VOOZH
about
URL: https://dev.to/t/cuda
β± Cuda - DEV Community
TensorCircuit-NG vs cuQuantum on H200: JIT compilation beats the "magic GPU library" assumption
π refractionray profile
Shixin Zhang
π Image
Shixin Zhang
Jun 7
TensorCircuit-NG vs cuQuantum on H200: JIT compilation beats the "magic GPU library" assumption
#
python
#
gpu
#
cuda
Add Comment
4 min read
Learn CUDA and GPU programming without owning a GPU
π iwtlp profile
I Want To Learn Programming
π Image
I Want To Learn Programming
Jun 9
Learn CUDA and GPU programming without owning a GPU
#
cuda
#
gpu
#
parallel
Add Comment
2 min read
Notes on CUDA Tensor Core GEMM (WMMA)
π member_2e5ba30f profile
member_2e5ba30f
π Image
member_2e5ba30f
May 31
Notes on CUDA Tensor Core GEMM (WMMA)
#
cuda
#
gpu
#
cpp
#
performance
Add Comment
4 min read
Where Tensor-Parallel Inference Hits the NVLink Wall
π member_2e5ba30f profile
member_2e5ba30f
π Image
member_2e5ba30f
May 31
Where Tensor-Parallel Inference Hits the NVLink Wall
#
cuda
#
gpu
#
machinelearning
#
performance
Add Comment
2 min read
The Microsecond Lie: Why your Go timers are lying about the GPU
π eitamos_ring_0508146ca448 profile
Eitamos Ring
π Image
Eitamos Ring
May 23
The Microsecond Lie: Why your Go timers are lying about the GPU
#
ai
#
programming
#
go
#
cuda
Add Comment
3 min read
Profiling a CUDA Python Program with GPUFlight
π codinginavan profile
Myoungho Shin
π Image
Myoungho Shin
May 22
Profiling a CUDA Python Program with GPUFlight
#
performance
#
python
#
cuda
#
gpu
Add Comment
10 min read
TensorRT `trt.Dims` SIGSEGV inside a GStreamer Python plugin β root cause and fix
π micwarsoft profile
MichaΕ Warian
π Image
MichaΕ Warian
May 20
TensorRT `trt.Dims` SIGSEGV inside a GStreamer Python plugin β root cause and fix
#
tensorrt
#
gstreamer
#
python
#
cuda
Add Comment
4 min read
Calling CUDA from Go without cgo
π eitamos_ring_0508146ca448 profile
Eitamos Ring
π Image
Eitamos Ring
May 16
Calling CUDA from Go without cgo
#
ai
#
softwareengineering
#
go
#
cuda
π Image
1
reaction
Add Comment
2 min read
Why CUDA kernels silently corrupt memory and how to catch the bug
π alanwest profile
Alan West
π Image
Alan West
May 12
Why CUDA kernels silently corrupt memory and how to catch the bug
#
cuda
#
rust
#
debugging
#
gpu
Add Comment
5 min read
How I optimized a Solana vanity address grinder to 44M keys/sec on GPU
π alhimikix profile
Anton
π Image
Anton
Apr 29
How I optimized a Solana vanity address grinder to 44M keys/sec on GPU
#
cuda
#
solana
#
gpu
#
cryptocurrency
Add Comment
2 min read
From Black Magic to Science: The Evolution of the CUDA Optimization Skill
π kernelflowops profile
aa24aa
π Image
aa24aa
Apr 22
From Black Magic to Science: The Evolution of the CUDA Optimization Skill
#
cuda
#
agents
#
cutlass
#
triton
Add Comment
11 min read
Learning Resources Tech
π urscookie profile
cookie
π Image
cookie
Apr 22
Learning Resources Tech
#
webdev
#
cuda
#
programming
#
beginners
Add Comment
1 min read
512MiB 512MB β the silent trtexec bug
π tushar365 profile
Tushar Thokdar
π Image
Tushar Thokdar
Apr 12
512MiB 512MB β the silent trtexec bug
#
tensorrt
#
jetson
#
cuda
#
debugging
Add Comment
2 min read
Memory Coalescing: Same computation, 6x Performance Difference
π codinginavan profile
Myoungho Shin
π Image
Myoungho Shin
Apr 9
Memory Coalescing: Same computation, 6x Performance Difference
#
cuda
#
gpu
#
aiops
#
cpp
Add Comment
6 min read
Setting Up NVIDIA Drivers and CUDA for ML/DL on Ubuntu 22.04
π the_abrahamaudu profile
Abraham Audu
π Image
Abraham Audu
Apr 6
Setting Up NVIDIA Drivers and CUDA for ML/DL on Ubuntu 22.04
#
nvidia
#
cuda
#
ubuntu
#
machinelearning
π Image
1
reaction
Add Comment
3 min read
π
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
π DEV Community
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account
π Image
π Image
π Image
π Image
π Image