File size: 3,377 Bytes
e45d058
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
cff-version: 1.2.0
title: CUTLASS
message: >-
  If you use this software, please cite using the

  following metadata.

type: software
authors:

  - given-names: Vijay
    family-names: Thakkar
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Pradeep
    family-names: Ramani
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Cris
    family-names: Cecka
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Aniket
    family-names: Shivam
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Honghao
    family-names: Lu
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Ethan
    family-names: Yan
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Jack
    family-names: Kosaian
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Mark
    family-names: Hoemmen
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Haicheng
    family-names: Wu
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Andrew
    family-names: Kerr
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Matt
    family-names: Nicely
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Duane
    family-names: Merrill
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Dustyn
    family-names: Blasig
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Fengqi
    family-names: Qiao
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Piotr
    family-names: Majcher
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Paul
    family-names: Springer
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Markus
    family-names: Hohnerbach
    affiliation: NVIDIA
    email: [email protected]
  - given-names: Jin
    family-names: Wang
    email: [email protected]
    affiliation: NVIDIA
  - given-names: Manish
    family-names: Gupta
    affiliation: Google
    email: [email protected]


repository-code: 'https://github.com/NVIDIA/cutlass'
abstract: >-
  CUTLASS is a collection of CUDA C++ template

  abstractions for implementing high-performance

  matrix-multiplication (GEMM) and related

  computations at all levels and scales within CUDA.

  It incorporates strategies for hierarchical

  decomposition and data movement similar to those

  used to implement cuBLAS and cuDNN. CUTLASS

  decomposes these "moving parts" into reusable,

  modular software components abstracted by C++

  template classes. These thread-wide, warp-wide,

  block-wide, and device-wide primitives can be

  specialized and tuned via custom tiling sizes, data

  types, and other algorithmic policy. The resulting

  flexibility simplifies their use as building blocks

  within custom kernels and applications.

keywords:

  - 'cutlass, tensor cores, cuda, cute, nvidia, gpu, linear algebra, matrix computations'
license: BSD-3-Clause
license-url: https://github.com/NVIDIA/cutlass/blob/v3.0.0/LICENSE.txt
version: '3.0.0'
date-released: '2023-01-23'
identifiers:

  - type: url
    value: "https://github.com/NVIDIA/cutlass/tree/v3.0.0"
    description: The GitHub release URL of tag 3.0.0