Cerebras AI Day Deck :: A closer look at the world’s fastest AI Chip
deniztortop
1,023 views
147 slides
Mar 21, 2024
Slide 1 of 147
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
About This Presentation
At Cerebras AI Day, we unveiled the next chapter of the Cerebras AI platform, new state-of-the-art AI models, and our latest AI supercomputers.
> Cerebras announces CS-3, the world’s fastest AI Chip with a whopping 4 trillion transistors
> Cerebras selects Qualcomm to deliver unprecedented p...
At Cerebras AI Day, we unveiled the next chapter of the Cerebras AI platform, new state-of-the-art AI models, and our latest AI supercomputers.
> Cerebras announces CS-3, the world’s fastest AI Chip with a whopping 4 trillion transistors
> Cerebras selects Qualcomm to deliver unprecedented performance in AI Inference
> Cerebras and G42 break ground on Condor Galaxy 3, an 8 exaFLOPs AI Supercomputer
Cerebras AI Applications
& Research Panel
Praneetha Elugunti
Mayo Clinic
Jim Culver
GSK
Tim Bishop
Mayo Clinic
Irinia Rish
University of Montreal
Andy Hock
Cerebras
Cerebras x
Qualcomm
Fireside Chat with
Rashid Attar, VP of Cloud Computing,
Qualcomm
Cerebras xQualcomm Technology Partnership
ReducingInference Cost by 10x
Cerebras CS-3
AI Training
Qualcomm Cloud AI100 Ultra
AI Inference
Jointly optimized software stack for
cost efficient LLMs
Cerebras Stack Qualcomm Stack
Sparse trainingSparse inference
Train in FP16Compile & run in MX6
Train large + small modelsApply speculative decoding
Network Architecture
Search
Compile & run on Ultra AI 100
Cerebras x G42
Fireside Chat with
Kiril Evtimov, Group CTO G42 & CEO
Core42
G42 across the Entire AI Value Chain
Customer &
Industry Tailored
Solutions
Data
Centers
Compute
Infrastructure
Cloud
Platforms
AI Model
Development
Cloud &
Enterprise AI
Deployment
Application
Development
476B Arabic tokens
1.63T Total tokens
The world’s largest
open-source Arabic LLM30B parameter, bilingual
Arabic-English model
Trained on the
Condor Galaxy 1 and 2
AI Supercomputer