Spaces:
Runtime error
Runtime error
File size: 656 Bytes
fa6856c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
.. trlX documentation master file, created by
sphinx-quickstart on Mon Oct 3 21:21:33 2022.
You can adapt this file completely to your liking, but it should at least
contain the root `toctree` directive.
Welcome to trlX's documentation!
================================
trlX is a library made for training large language models using reinforcement learning. It
currently supports training using PPO or ILQL for models up to 20B using Accelerate.
.. toctree::
:maxdepth: 2
:caption: Contents:
data
models
configs
pipeline
examples
Indices and tables
==================
* :ref:`genindex`
* :ref:`modindex`
* :ref:`search`
|