Feature
· Precision: 16bit INT, fixed-point unit
·
높은 model accuracy 유지 (low accuracy drop of 0.1 or less for GLUE tasks)
·
높은 정확도 유지를 위한 internal mixed precision (upto 32bit)
·
self-attention의 matrix multiplication 연산을 위한 2 output stationary systolic arrays (per single head core)
·
output stationary systolic arrarys의 streaming input/output을 매 cycle마다 produce/comsume 할 수 있는, pipelined softmax module
·
CORDIC algorithm-based exponential unit (using less BRAM than LUT-based)
·
기존의 CORDIC algorithm의 limited output range (-1~1) 문제를 해결하기 위한 extended CORDIC 적용
·
Freq : 215MHz
Business Area
Accelerator
Category
Arithmetic & Mathematic > Arithmetic & Logic Unit
Deliverables
· SW integration guide
Validation Status
· RTL simulation-based validation, FPGA-based validation, SW modeling & model accuracy evaluation