Compilation
- tests.compilation.get_compilation_output(*, source: Path, cwd: Path, arch: NVIDIAArch, cmake_file_api: FileAPI, object_file: bool = True, resource_usage: bool = False, ptx: bool = False) tuple[Path, str]View on GitHub
Compile the source in cwd for arch.
- Parameters:
object_file – Whether to compile into an object file.
- tests.compilation.get_cubin_name(*, compiler_id: str, file: Path, arch: NVIDIAArch, object_file: bool = False) strView on GitHub
When the compilation is not into an object file, the resulting file may contain:
more than one (usually 2) embedded CUDA binary files when using
nvcconly one when using
clang
For
nvcc, the first cubin is usually nearly empty.