Files
FastDeploy/custom_ops/gpu_ops/fp8_deep_gemm

DeepGEMM

DeepGEMM 安装流程

Installation

首先安装自定义算子确保cutlass已经git clonecustom_ops/third_party/cutlass

安装deep_gemm:

# Make symbolic links for third-party (CUTLASS and CuTe) include directories
python setup.py develop

# Add the project path to PYTHONPATH
export PYTHONPATH=$(pwd):$PYTHONPATH

# or install directly
python setup.py install

Test

# Test all GEMM implements (normal, contiguous-grouped and masked-grouped)
python tests/test_core.py