PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Package profile

trl

  • Summary: Train transformer language models with reinforcement learning.
  • Author: Leandro von Werra
  • Homepage: https://github.com/huggingface/trl
  • Source: https://github.com/huggingface/trl (Repo profile)
  • Number of releases: 63
  • First release: 0.0.1 on 2020-03-30
  • Latest release: 0.19.1 on 2025-07-08

Releases

Dates and sizes of releases20212022202320242025Release Date0.100.200.300.400.50Size in MB

PyPI Downloads

Weekly downloads over the last 3 monthsFebruaryMarchAprilMayJuneDate050100150200250300350400 thousand downloads per week

Dependencies

Trl has 23 dependencies, 20 of which optional.
Dependencies of trl (23).
DependencyOptional
acceleratefalse
datasetsfalse
transformersfalse
bitsandbytestrue
deepspeedtrue
diffuserstrue
fastapitrue
joblibtrue
liger-kerneltrue
llm-blendertrue
openaitrue
parameterizedtrue
pefttrue
Pillowtrue
pydantictrue
pytesttrue
pytest-covtrue
pytest-rerunfailurestrue
pytest-xdisttrue
requeststrue
scikit-learntrue
uvicorntrue
vllmtrue

Details