HPC System Reliability Engineering
HPC System Reliability Engineering
Corporate Enterprise
Broadwing stabilizes a top50 10k+ node hybrid-cloud supercomputer SituationSeeking to deliver its most complex HPC environment ever, a globally recognizable HPC manufacturer reached out to Broadwing for help. ActionBroadwing engineers integrated seamlessly as part of the larger team co-developing the automated healthchecks, triaging the findings, resolving system configuration issues, and coordinating the replacement of hundreds…
More
DevOps Pipelines for HPC
Corporate Enterprise
Broadwing reduces HPC softwaretesting time from 1 month to 1 hour SituationA leading HPC product manufacturer faced the challenge of improving software quality while reducing the manual effort required to deploy product increments. To solve this, they turned to Broadwing for help. ActionBroadwing stepped in and overhauled their CI/CD pipelines and artifact warehousing, modernizing the…
More