HomeEventsBuilding a scalable ML model serving API with Ray Serve

Webinar

Building a scalable ML model serving API with Ray Serve

The demo will show how to:
- Deploy a trained Python model and scale it to a cluster using Ray Serve
- Improve the HTTP API using Ray Serve’s native FastAPI integration
- Compose multiple independently-scalable models into a single model, and run them in parallel to minimize latency.

LinkView slides >>>

Speakers

Tricia Fu

Tricia Fu

Product Manager, Anyscale, Anyscale

Other Events

Scaling Robot Policy Evaluations to Thousands of Parallel Simulations

07 . 22 . 2026  ,  03:30 PM (PST)

Anyscale on Azure: Build and deploy AI at scale in your own tenant

06 . 16 . 2026  ,  03:30 PM (PST)

How Torc Robotics Scales Multimodal AI for Autonomous Driving with Ray

06 . 10 . 2026  ,  03:30 PM (PST)