Ray Serve is an open-source library for serving machine learning models at scale. In this talk, we will demonstrate some new features of Ray Serve, including (1) an integration with the open-source ML lifecycle management platform MLflow, and (2) an easy way to use Ray Serve to scale up an existing Python web server.