aelitta's picture
Upload folder using huggingface_hub
4bdb245 verified
Class Documentation and C++ Reference
========
This section provides a breakdown of the cpp classes and what each of their functions provide. It is partially generated and augomented from the Doxygen autodoc content. You can also go directly to the `raw doxygen docs <../doxygen/annotated.html>`_.
Below is a diagram that provides insights on the relationship between Kompute objects and Vulkan SDK resources, which primarily encompass ownership of either CPU and/or GPU memory.
.. image:: ../images/kompute-vulkan-architecture.jpg
:width: 100%
Manager
-------
The Kompute Manager provides a high level interface to simplify interaction with underlying :class:`kp::Sequences` of :class:`kp::Operations`.
.. image:: ../images/kompute-vulkan-architecture-manager.jpg
:width: 100%
.. doxygenclass:: kp::Manager
:members:
Sequence
-------
The Kompute Sequence consists of batches of :class:`kp::Operations`, which are executed on a respective GPU queue. The execution of sequences can be synchronous or asynchronous, and it can be coordinated through its respective vk::Fence.
.. image:: ../images/kompute-vulkan-architecture-sequence.jpg
:width: 100%
.. doxygenclass:: kp::Sequence
:members:
Tensor
-------
The :class:`kp::Tensor` is the atomic unit in Kompute, and it is used primarily for handling Host and GPU Device data.
.. image:: ../images/kompute-vulkan-architecture-tensor.jpg
:width: 100%
.. doxygenclass:: kp::Tensor
:members:
Algorithm
-------
The :class:`kp::Algorithm` consists primarily of the components required for shader code execution, including the relevant vk::DescriptorSet relatedresources as well as vk::Pipeline and all the relevant Vulkan SDK resources as outlined in the architectural diagram.
.. image:: ../images/kompute-vulkan-architecture-algorithm.jpg
:width: 100%
.. doxygenclass:: kp::Algorithm
:members:
OpBase
-------
The :class:`kp::OpBase` provides a top level class for an operation in Kompute, which is the step that is executed on a GPU submission. The Kompute operations can consist of one or more :class:`kp::Tensor`.
.. image:: ../images/kompute-vulkan-architecture-operations.jpg
:width: 100%
.. doxygenclass:: kp::OpBase
:members:
OpAlgoDispatch
-------
The `vk::OpAlgoDispatch` extends the `vk::OpBase` class, and provides the base for shader-based operations. Besides of consisting of one or more `vk::Tensor` as per the `vk::OpBase`, it also contains a unique `vk::Algorithm`.
.. image:: ../images/kompute-vulkan-architecture-opmult.jpg
:width: 100%
.. doxygenclass:: kp::OpAlgoDispatch
:members:
OpMult
-------
The :class:`kp::OpMult` operation is a sample implementation of the :class:`kp::OpAlgoBase` class. This class shows how it is possible to create a custom vk::OpAlgoBase that can compile as part of the binary. The :class:`kp::OpMult` operation uses the shader-to-cpp-header-file script to convert the script into cpp header files.
.. image:: ../images/kompute-vulkan-architecture-opmult.jpg
:width: 100%
.. doxygenclass:: kp::OpMult
:members:
OpTensorCopy
-------
The :class:`kp::OpTensorCopy` is a tensor only operation that copies the GPU memory buffer data from one :class:`kp::Tensor` to one or more subsequent tensors.
.. doxygenclass:: kp::OpTensorCopy
:members:
OpTensorSyncLocal
-------
The :class:`kp::OpTensorSyncLocal` is a tensor only operation that maps the data from the GPU device memory into the local host vector.
.. doxygenclass:: kp::OpTensorSyncLocal
:members:
OpTensorSyncDevice
-------
The :class:`kp::OpTensorSyncDevice` is a tensor only operation that maps the data from the local host vector into the GPU device memory.
.. doxygenclass:: kp::OpTensorSyncDevice
:members:
OpMemoryBarrier
-------
The :class:`kp::OpMemoryBarrier` is a tensor only operation which adds memory barriers to the tensors provided with the access and stage masks provided.
.. doxygenclass:: kp::OpTensorSyncDevice
:members: