- What exactly is an XPU? - Artificial Intelligence Stack Exchange
XPU is a device abstraction for Intel heterogeneous computation architectures, which can be mapped to CPU, GPU, FPGA and other accelerators The "X" from XPU is just like a variable, like in maths, so you can do X=C and you get CPU accceleration, or X=G and you get GPU acceleration
- What exactly happens in gradient clipping by norm?
Gradient clipping is a technique that tackles exploding gradients The idea of gradient clipping is very simple: If the gradient gets too large, we rescale it to keep it small
|