Skip to content

Commit 263dec2

Browse files
authored
Update docs with NVAIE messaging (#6162)
Update docs with NVAIE messaging
1 parent 0529476 commit 263dec2

File tree

3 files changed

+62
-9
lines changed

3 files changed

+62
-9
lines changed

README.md

+10-7
Original file line numberDiff line numberDiff line change
@@ -38,13 +38,16 @@ and corresponds to the 23.07 container release on
3838

3939
----
4040
Triton Inference Server is an open source inference serving software that
41-
streamlines AI inferencing. Triton enables teams to deploy any AI model from
42-
multiple deep learning and machine learning frameworks, including TensorRT,
43-
TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton
44-
supports inference across cloud, data center,edge and embedded devices on NVIDIA
45-
GPUs, x86 and ARM CPU, or AWS Inferentia. Triton delivers optimized performance
46-
for many query types, including real time, batched, ensembles and audio/video
47-
streaming.
41+
streamlines AI inferencing. Triton enables teams to deploy any AI model from
42+
multiple deep learning and machine learning frameworks, including TensorRT,
43+
TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton
44+
Inference Server supports inference across cloud, data center, edge and embedded
45+
devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia. Triton Inference
46+
Server delivers optimized performance for many query types, including real time,
47+
batched, ensembles and audio/video streaming. Triton inference Server is part of
48+
[NVIDIA AI Enterprise](https://www.nvidia.com/en-us/data-center/products/ai-enterprise/),
49+
a software platform that accelerates the data science pipeline and streamlines
50+
the development and deployment of production AI.
4851

4952
Major features include:
5053

docs/index.md

+11-2
Original file line numberDiff line numberDiff line change
@@ -58,9 +58,18 @@ Triton Inference Server is an open source inference serving software that stream
5858
<iframe width="560" height="315" src="https://www.youtube.com/embed/NQDtfSi5QF4" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
5959
</div>
6060

61-
# Triton
61+
# Triton Inference Server
6262

63-
Triton enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference across cloud, data center,edge and embedded devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia. Triton delivers optimized performance for many query types, including real time, batched, ensembles and audio/video streaming.
63+
Triton Inference Server enables teams to deploy any AI model from multiple deep
64+
learning and machine learning frameworks, including TensorRT, TensorFlow,
65+
PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference
66+
across cloud, data center, edge and embedded devices on NVIDIA GPUs, x86 and ARM
67+
CPU, or AWS Inferentia. Triton Inference Server delivers optimized performance
68+
for many query types, including real time, batched, ensembles and audio/video
69+
streaming. Triton inference Server is part of
70+
[NVIDIA AI Enterprise](https://www.nvidia.com/en-us/data-center/products/ai-enterprise/),
71+
a software platform that accelerates the data science pipeline and streamlines
72+
the development and deployment of production AI.
6473

6574
Major features include:
6675

docs/user_guide/faq.md

+41
Original file line numberDiff line numberDiff line change
@@ -162,3 +162,44 @@ looking at the gdb trace for the segfault.
162162

163163
When opening a GitHub issue for the segfault with Triton, please include
164164
the backtrace to better help us resolve the problem.
165+
166+
## What are the benefits of using [Triton Inference Server](https://developer.nvidia.com/triton-inference-server) as part of the [NVIDIA AI Enterprise Software Suite](https://www.nvidia.com/en-us/data-center/products/ai-enterprise/)?
167+
168+
NVIDIA AI Enterprise enables enterprises to implement full AI workflows by
169+
delivering an entire end-to-end AI platform. Four key benefits:
170+
171+
### Enterprise-Grade Support, Security & API Stability:
172+
173+
Business-critical AI projects stay on track with NVIDIA Enterprise Support,
174+
available globally to assist both IT teams with deploying and managing the
175+
lifecycle of AI applications and the developer teams with building AI
176+
applications. Support includes maintenance updates, dependable SLAs and
177+
response times. Regular security reviews and priority notifications mitigate
178+
potential risk of unmanaged opensource and ensure compliance with corporate
179+
standards. Finally, long term support and regression testing ensures API
180+
stability between releases.
181+
182+
### Speed time to production with AI Workflows & Pretrained Models:
183+
To reduce the complexity of developing common AI applications, NVIDIA AI
184+
Enterprise includes
185+
[AI workflows](https://www.nvidia.com/en-us/launchpad/ai/workflows/) which are
186+
reference applications for specific business outcomes such as Intelligent
187+
Virtual Assistants and Digital Fingerprinting for real-time cybersecurity threat
188+
detection. AI workflow reference applications may include
189+
[AI frameworks](https://docs.nvidia.com/deeplearning/frameworks/index.html) and
190+
[pretrained models](https://developer.nvidia.com/ai-models),
191+
[Helm Charts](https://catalog.ngc.nvidia.com/helm-charts),
192+
[Jupyter Notebooks](https://developer.nvidia.com/run-jupyter-notebooks) and
193+
[documentation](https://docs.nvidia.com/ai-enterprise/index.html#overview).
194+
195+
### Performance for Efficiency and Cost Savings:
196+
Using accelerated compute for AI workloads such as data process with
197+
[NVIDIA RAPIDS Accelerator](https://developer.nvidia.com/rapids) for Apache
198+
Spark and inference with Triton Inference Sever delivers better performance
199+
which also improves efficiency and reduces operation and infrastructure costs,
200+
including savings from reduced time and energy consumption.
201+
202+
### Optimized and Certified to Deploy Everywhere:
203+
Cloud, Data Center, Edge Optimized and certified to ensure reliable performance
204+
whether it’s running your AI in the public cloud, virtualized data centers, or
205+
on DGX systems.

0 commit comments

Comments
 (0)