Cisco IT needed to build an AI cluster to support model training and inferencing for multiple teams across the business.
Cisco product teams needed a way to run AI workloads that would be used to develop and test new AI capabilities in Cisco products. But, existing data center facilities weren’t built to power modern AI.
Cisco IT designed an AI infrastructure with Cisco compute, best-in-class GPUs from NVIDIA, and Cisco networking. They built a front-end ethernet network and a backend lossless ethernet network — ensuring a reliable, high performing, scalable network.
Jon Woolwine, Technical Systems Engineering and Jag Kahlon, Principal IT Engineer, explain how this new infrastructure was launched in only 3 months and expanded to support dozens of use cases.