Call for Papers for the vLLM Featured Track at Ray Summit is now Open! Submit your presentation by July 30
Anyscale
  • Tutorials
  • Pricing
Get Started with $100 Credit
HomeBlogBlog Detail

blog post 123

By    |   June 1, 2023

test

Sharing

Sign up for product updates

Recommended content

Open Source RL Libraries for LLMs GraphThe architecture of a Reinforcement Learning (RL) library is split into two primary components: Generation and Training. During the generation phase, an LLM Engine performs multi-turn rollouts within an environment to produce data and reward signals. This output is then fed into the training phase to update the model's parameters. This process forms a feedback loop, where the progressively improved model generates the next iteration of data for continuous refinement.

Open Source RL Libraries for LLMs

Figure 24: TFCC inference runtimeFigure 24: TFCC inference runtime

Large-Scale Deployment of Ray in Tencent’s Weixin AI Infrastructure

Your Data and AI Frameworks Evolved – What About Your Compute Framework? thumbnail

Your Data and AI Frameworks Evolved – What About Your Distributed Compute Framework?

Ready to try Anyscale?

Access Anyscale today to see how companies using Anyscale and Ray benefit from rapid time-to-market and faster iterations across the entire AI lifecycle.

© Anyscale, Inc 2025 - Privacy Policy

Follow Anyscale

Follow Ray

Company

  • About Us
  • News
  • Careers
  • Contact Sales

Learn

  • Resources
  • Case Studies
  • Blog
  • Events
  • Ray Training
  • Ray Docs
  • Anyscale Docs

Products

  • Anyscale Platform
  • Anyscale Support
  • Ray Open Source
  • Integrations