A quick snapshot capturing a few of the highlights from an article written by our company intern (Alex) and published on Towards Data Science. The article goes into depth on how we achieve a 1700x speedup on the k-means algorithm written for our proprietary AI model.
<aside> π Article: bit.ly/cuda-kmeans
</aside>
Batched K-Means with Python Numba and CUDA C
<aside> πββοΈ Bonus Tip: donβt over-utilize the synchthread() function to avoid deadlocks
</aside>
<aside> π Link to this page: bit.ly/fast-gpu
</aside>