Document Details


1805.02867v2.pdf
Download View Text Delete
Clip: arXiv:1805.02867v2 [cs.PF] 28 Jul 2018 Online normalizer calculation for softmax Maxim Milakov NVIDIA mmilakov@nvidia.com Natalia Gimelshein NVIDIA ngimelshein@nvidia.com Abstract The Softmax function is ubiquitous in machine learning, multiple previous works suggested faster alternatives for it. In this paper we propose a way to compute classical Softmax with fewer memory accesses and hypothesize that this reduction in memory accesses should improve Softmax performance on actual hardware. The benchmarks confirm this hypothesis: Softmax accelerates by up to1.3x and Softmax+TopK combined and fused by up to5x.
Filename: 1805.02867v2.pdf
Filetype: application/pdf
Size: 147347 bytes
Uploaded On: 2025-10-24
Abstract:
Summary:
Tags:
Notes:
Visible: 1
Status: Parsed
Producer: dvips + GPL Ghostscript GIT PRERELEASE 9.22
CreationDate: 2018-07-30T21:05:21-04:00
ModDate: 2018-07-30T21:05:21-04:00
Creator: LaTeX with hyperref package
Title:
Subject:
Author:
Keywords:
Pages: 9

Return to Document Library