Faster and Memory-Efficient PyTorch models using AMP and Tensor Cores | by Rahul Agarwal | Towards Data Science
Accelerating Inference Up to 6x Faster in PyTorch with Torch-TensorRT | NVIDIA Technical Blog
Video Series: Mixed-Precision Training Techniques Using Tensor Cores for Deep Learning | NVIDIA Technical Blog
Tensor Cores and mixed precision *matrix multiplication* - output in float32 - PyTorch Forums
Understanding Tensor Cores
Pytorch Tutorial from Basic to Advance Level: A NumPy replacement and Deep Learning Framework that provides maximum flexibility with speed | by Kunal Bhashkar | Medium
PyTorch 1.0 preview (Dec 6, 2018) packages with full CUDA 10 support for your Ubuntu 18.04 x86_64 systems. - vxlabs
Types oNVIDIA GPU Architectures For Deep Learning
Understanding Tensor Cores
Deep Tensorized Learning — TensorLy-Torch 0.4.0 documentation
Accelerating AI Training with NVIDIA TF32 Tensor Cores | NVIDIA Technical Blog