PySpect

Home

lists

Frequently asked questions

© 2025 PySpect

Package profile

trl

  • Summary: Train transformer language models with reinforcement learning.
  • Author: Leandro von Werra
  • Homepage: https://github.com/huggingface/trl
  • Source: https://github.com/huggingface/trl (Repo profile)
  • Number of releases: 68
  • First release: 0.0.1 on 2020-03-30
  • Latest release: 0.22.2 on 2025-09-03

Releases

Dates and sizes of releases20212022202320242025Release Date0.100.200.300.400.50Size in MB

PyPI Downloads

Loading PyPI statistics...

Dependencies

Trl has 26 dependencies, 23 of which optional.
Dependencies of trl (26).
DependencyOptional
acceleratefalse
datasetsfalse
transformersfalse
bitsandbytestrue
deepspeedtrue
diffuserstrue
fastapitrue
ftfytrue
joblibtrue
liger-kerneltrue
llm-blendertrue
num2wordstrue
openaitrue
parameterizedtrue
pefttrue
Pillowtrue
pydantictrue
pytesttrue
pytest-covtrue
pytest-rerunfailurestrue
pytest-xdisttrue
requeststrue
scikit-learntrue
torchvisiontrue
uvicorntrue
vllmtrue

Details