1/8/2024 0 Comments Cmake ubuntu packageMore information about the cache can be found here. If the compiler changes, or the user modifies the kernels then the cache must be deleted for the MIOpen version in use e.g., rm -rf ~/.cache/miopen/. Users can also disable the cache during runtime using the environmental variable set as MIOPEN_DISABLE_CACHE=1. Users can change the location of the cache directory during configuration using the flag -DMIOPEN_CACHE_DIR=. In the cache directory there exists a directory for each version of MIOpen. MIOpen by default caches the device programs in the location ~/.cache/miopen/. More information about the performance database can be found here. ĭatabase paths can be explicitly customized by means of MIOPEN_SYSTEM_DB_PATH (System PerfDb) and MIOPEN_USER_DB_PATH (User PerfDb) cmake variables. To install the kernels package for your GPU architecture, use the following command:Ĭmake -DMIOPEN_BACKEND=OpenCL -DBUILD_DEV=On. Precompiled kernels do not reduce startup time on subsequent runs. Note that all compiled kernels are locally cached in the folder $HOME/.cache/miopen/, so precompiled kernels reduce the startup latency only for the first execution of a neural network. These precompiled kernels comprise a select set of popular input configurations and will expand in future release to contain additional coverage. MIOpen provides an optional pre-compiled kernels package to reduce the startup latency. If a different backend other than what currently exists on the system is desired, please uninstall the existing backend completely and then install the new backend. MIOpen can be installed on Ubuntu using apt-get.įor OpenCL backend: apt-get install miopen-openclįor HIP backend: apt-get install miopen-hipĬurrently both the backends cannot be installed on the same system simultaneously. Installing MIOpen with pre-built packages Composable Kernel - C++ templated device library for GEMM-like and reduction-like operators.MLIR - (Multi-Level Intermediate Representation) with its MIOpen dialect to support and complement kernel development.Minimum version branch for post-ROCm 3.5 master-rocm-3.5.Minimum version branch for pre-ROCm 3.5 master-rocm-2.10.rocBLAS - AMD library for Basic Linear Algebra Subprograms (BLAS) on the ROCm platform.MIOpenTENSILE - users can enable this library using the cmake configuration flag -DMIOPEN_USE_MIOPENTENSILE=On.lbzip2 - multi-threaded compress or decompress utility.SQLite3 - reading and writing performance database.Version 1.79 is recommended, older version may need patches to work on newer systems, e.g.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |