Ray Serve is a flexible and scalable framework for ML application development that overcomes theoperational and scaling burdens of typical model micro-services or monolithic architectures.